Instapaper Scraper

This script allows users to scrape all saved Instapaper bookmarks and export them as CSV data.

Features

Scrapes all bookmarks from your Instapaper home page.
Support scraping bookmarks from specific Instapaper folders
Export bookmarks metadata in CSV format

Requirements

The following Python libraries are required:

requests
beautifulsoup4
python-dotenv

Install all dependencies using the provided requirements.txt:

pip install -r requirements.txt

Setup

Clone or download the repository.

Create an .env file in the project root with your credentials:

INSTAPAPER_USERNAME=your_username
INSTAPAPER_PASSWORD=your_password

Run the script:

# Output to console
python scrape.py

# Save output to CSV file
python scrape.py > bookmarks.csv

Environment Variables

INSTAPAPER_USERNAME: Your Instapaper account username
INSTAPAPER_PASSWORD: Your Instapaper account password
ENABLE_FOLDER_MODE: Set to 'true' to scrape a specific folder
FOLDER_ID_AND_SLUG: The folder ID and slug when folder mode is enabled

How It Works

Authenticate: Logs into Instapaper using your credentials
Extract Bookmarks: Fetches bookmarks from either homepage or specific folder
Process Pages: Iterates through all available pages of bookmarks
Output CSV: Prints bookmark data in CSV format with headers

Example Output

The script outputs CSV data with the following structure:

page,id,title,url
Page 1,999901234,"Article 1",https://www.example.com/page-1/
Page 1,999002345,"Article 2",https://www.example.com/page-2/

Acknowledgments

This script is forked and modified from:

Major modifications include:

Switched from HTML/PDF downloads to CSV export format
Added environment variable support using python-dotenv
Implemented folder-specific bookmark scraping
Enhanced documentation and examples

Disclaimer

This script requires valid Instapaper credentials. Be cautious when using personal account information and ensure compliance with Instapaper’s Terms of Service.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
scrape-transactions.py		scrape-transactions.py
scrape.py		scrape.py
styles.css		styles.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Instapaper Scraper

Features

Requirements

Setup

Environment Variables

How It Works

Example Output

Acknowledgments

Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

chriskyfung/InstapaperScraper

Folders and files

Latest commit

History

Repository files navigation

Instapaper Scraper

Features

Requirements

Setup

Environment Variables

How It Works

Example Output

Acknowledgments

Disclaimer

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages