mytaxi-scraper

A tool that scrapes metadata out of mytaxi receipt pdfs.

Problem

If you use taxis a lot in germany, and use mytaxi for this purpose, these receipts will appear familiar to you:

This tool extracts meta information from these PDF receipts for further analysis

Usage

Extract data

Make sure you have python3
Install pdfminer.six (pip3 install pdfminer.six)
Prepare a directory that contains your pdf files
python3 write_json.py <<path/to/directory/with/trailing/slash/>>
You have a metadata.json file with the metadata for all your taxi rides

Create a heatmap of your visited spots

Make sure to obtain a Google Maps API key. Set the environment variable:

export GMAPS_API_KEY=<<Your API Key>>

Exactly as the extract script, but python3 analyze.py <<path/to/directory/with/trailing/slash/>>

The script uses Google Maps API to find the coordinates of the addresses of your rides. Once it has done that, it stores the results of the pdf parsing and the location queries in a pickle file. If you want to refresh the data, just delete the pickle file. The script will produce an html file with a google map of your rides and hotspots.

Use as library in your own program

Copy the file extract.py into your python project, then:

from extract import parse_bill

metadata = parse_bill('path/to/my/bill.pdf')
print(metadata)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
analyze.py		analyze.py
demo.png		demo.png
extract.py		extract.py
heatmap.jpg		heatmap.jpg
write_excel.py		write_excel.py
write_json.py		write_json.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mytaxi-scraper

Problem

Usage

Extract data

Create a heatmap of your visited spots

Use as library in your own program

About

Releases

Packages

Languages

ThomasDebrunner/mytaxi-scraper

Folders and files

Latest commit

History

Repository files navigation

mytaxi-scraper

Problem

Usage

Extract data

Create a heatmap of your visited spots

Use as library in your own program

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages