I [tweeted](https://twitter.com/gvanrossum/status/1297687557715529728) about this and got a few tips that I might have to follow up if the current corpus is not sufficiently useful. - [ ] [Meg Risdal from Kaggle might have some](https://twitter.com/MeganRisdal/status/1297692573260083200) - [ ] I got a [tip](https://twitter.com/YadKonrad/status/1297698662055735297) about [Buggify](https://github.com/roovyshapiro/Buggify) (looks immature) - [ ] [Apparently](https://twitter.com/dmoisset/status/1297819956676108288) @Tobias-Kohn may have [something](https://pretalx.com/pyconuk-2019/talk/G93RFU/) - [ ] [David Kramer says Sentry might have something](https://twitter.com/zeeg/status/1298027165905125376) (DM for access) (Also [Data Dreadnought could help](https://twitter.com/DataDreadnought/status/1298043679635103744)) FWIW I am currently using a corpus made available by the authors of [this paper](https://arxiv.org/abs/1907.07803).