Analyses of software mentions and dependencies
The software-mentions dataset is a collection of ML-identified mentions of software detected in about 24,000,000 academic papers.
If you want to extract the .parquet tables yourself, or work with the original dataset, see Extracting Tables. Otherwise, you can download the tables in a friendlier format from (INSERT LOCATION).