This repository contains the research corpus and analysis tools for a thesis project examining 3D printing literature through the Lens.org Scholarly API. The project analyzes 31,000+ scholarly articles to understand the evolution and impact of 3D printing technology across various fields.
RESEARCH/
├── 1. Fos_3d_printing/ # Field of Study analysis for 3D printing
├── 2. PDF_download/ # PDF retrieval and management
├── 3. Patent_enrich/ # Patent enrichment analysis
├── 4. Patent_csv_match/ # Patent-article citation matching
├── 5. troubleshooting patent api/ # Patent API exploration
├── 6. Patent_API_matching/ # Direct API-based matching
└── 7. PDF_converter/ # PDF processing utilities
- Comprehensive Metadata Extraction: Analyzes 31,016 3D printing scholarly articles
- Patent-Article Matching: Links scholarly articles to patent citations (387,593 matches found)
- Field Analysis: Maps interdisciplinary connections across engineering, materials science, and biomedical fields
- API Integration: Leverages Lens.org Scholarly and Patent APIs for data retrieval
- Clone the repository:
git clone https://github.com/Thisiswallz/THESIS_CORPUS.git
cd THESIS_CORPUS
- Create a
.env
file with your Lens.org API key:
API=your_lens_api_key_here
- Install dependencies:
pip install -r requirements.txt
cd RESEARCH/1.\ Fos_3d_printing/Scripts/
python3 11_final_extraction.py
cd RESEARCH/4.\ Patent_csv_match/scripts/production/
./run_full_82k_production.sh
- Core Dataset: 31,016 documents with 3D printing field tags
- Extended Dataset: 106,797 documents with abstract mentions
- Patent Citations: 387,593 scholarly article citations found in patents
- Top Fields: Engineering, Mechanical Engineering, Materials Science, Biomedical Engineering
- MagID Coverage: 100% author-level identification for tracking
- Processed datasets are excluded from version control for size and privacy
- Scripts and documentation are provided for reproducibility
- Contact author for access to processed datasets
See CLAUDE.md
for detailed API documentation and field mappings.
This research project is part of a thesis work. Please contact the author for usage permissions.
Wallace - Thesis Research 2025
- Lens.org for API access
- 3D printing research community