Skip to content

notvillers/BarcodeScanSplit

Repository files navigation

BarcodeScanSplit

Reads pdf files from directory, split the pages and renames them by the barcode found on each page.

Extra requirements

Arguments

Using arguments

  • -s, --source : Directory containing PDF files.
  • -d, --destination: Directory to store output files.
  • -b, --backup: Directory to store backup files.
  • -l, --log: Directory to store log files.
  • -t, --temp: Temporary directory to store split.
  • -i, --image: Temporary directory to store the images.
  • -m, --mode: Processing mode. (single, multi)
  • -p, --processes: Maximum number of processes to run, by default is the number of CPU threads.
  • -f, --prefixes: Prefixes for OCR reading if barcode not found (ex.: 'KSZ,EKSZ')
  • -r, --ratio: Image ratio to check for OCR, only neccessary if --text-prefixes is given (ex.: 0.4 means it scans from top to bottom 40% of the image)

(If any of the arguments left empty the script will read its pair from the default config file.)

Running

Windows:

Example usage:

    python splitter.py -s docs -d out -m multi -p 4
    powershell.exe .\splitter.ps1
    .\splitter.bat

(splitter.bat can't get arguments at the moment.)

Unix/Linux/Mac:

Example usage:

    python3 splitter.py -s docs -d out -m multi -p 4
    ./barcodescansplit.sh -s docs -d out -m multi -p 4
    

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •