Skip to content

process.json not written, always restarting from scratch #1581

@Frank-Steiner

Description

@Frank-Steiner

Hi,
I've just fetched the latest code and tried to download invocies from amazon (great idea to automate this!!!). My call is

docudigger scrape amazon -u xxx -p yyy -l ~/.docudigger --fileDestinationFolder ~/.docudigger/amazon

This works find and fetches the PDFs from 4 years. But when I call it again with --yearFilter 2025 --onlyNew I get these messages:

[info] [2025-10-24 17:10:28] [scrape:amazon]:   OnlyNew activated.
[info] [2025-10-24 17:10:28] [scrape:amazon]:   Getting last run
[warn] [2025-10-24 17:10:28] [scrape:amazon]:   process.json not found. Full run needed. OnlyNew deactivated. 

and there is indeed no process.json anywhere in ~/.docudigger or anywhere else in $HOME. Should this file be filled during the scrape, or is it written at once at the end?
Any idea why it could be missing?

cu,
Frank

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingplugin: amazonIssues related to amazon plugin

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions