Skip to content

Thisiswallz/THESIS_CORPUS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

THESIS_CORPUS: 3D Printing Scholarly Research Analysis

Overview

This repository contains the research corpus and analysis tools for a thesis project examining 3D printing literature through the Lens.org Scholarly API. The project analyzes 31,000+ scholarly articles to understand the evolution and impact of 3D printing technology across various fields.

Project Structure

RESEARCH/
├── 1. Fos_3d_printing/      # Field of Study analysis for 3D printing
├── 2. PDF_download/          # PDF retrieval and management
├── 3. Patent_enrich/         # Patent enrichment analysis
├── 4. Patent_csv_match/      # Patent-article citation matching
├── 5. troubleshooting patent api/ # Patent API exploration
├── 6. Patent_API_matching/   # Direct API-based matching
└── 7. PDF_converter/         # PDF processing utilities

Key Features

  • Comprehensive Metadata Extraction: Analyzes 31,016 3D printing scholarly articles
  • Patent-Article Matching: Links scholarly articles to patent citations (387,593 matches found)
  • Field Analysis: Maps interdisciplinary connections across engineering, materials science, and biomedical fields
  • API Integration: Leverages Lens.org Scholarly and Patent APIs for data retrieval

Setup

  1. Clone the repository:
git clone https://github.com/Thisiswallz/THESIS_CORPUS.git
cd THESIS_CORPUS
  1. Create a .env file with your Lens.org API key:
API=your_lens_api_key_here
  1. Install dependencies:
pip install -r requirements.txt

Usage

Extract 3D Printing Metadata

cd RESEARCH/1.\ Fos_3d_printing/Scripts/
python3 11_final_extraction.py

Run Patent-Article Matching

cd RESEARCH/4.\ Patent_csv_match/scripts/production/
./run_full_82k_production.sh

Key Findings

  • Core Dataset: 31,016 documents with 3D printing field tags
  • Extended Dataset: 106,797 documents with abstract mentions
  • Patent Citations: 387,593 scholarly article citations found in patents
  • Top Fields: Engineering, Mechanical Engineering, Materials Science, Biomedical Engineering
  • MagID Coverage: 100% author-level identification for tracking

Data Availability

  • Processed datasets are excluded from version control for size and privacy
  • Scripts and documentation are provided for reproducibility
  • Contact author for access to processed datasets

Documentation

See CLAUDE.md for detailed API documentation and field mappings.

License

This research project is part of a thesis work. Please contact the author for usage permissions.

Author

Wallace - Thesis Research 2025

Acknowledgments

  • Lens.org for API access
  • 3D printing research community

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published