Reads pdf files from directory, split the pages and renames them by the barcode found on each page.
-s,--source: Directory containing PDF files.-d,--destination: Directory to store output files.-b,--backup: Directory to store backup files.-l,--log: Directory to store log files.-t,--temp: Temporary directory to store split.-i,--image: Temporary directory to store the images.-m,--mode: Processing mode. (single,multi)-p,--processes: Maximum number of processes to run, by default is the number of CPU threads.-f,--prefixes: Prefixes for OCR reading if barcode not found (ex.: 'KSZ,EKSZ')-r,--ratio: Image ratio to check for OCR, only neccessary if--text-prefixesis given (ex.: 0.4 means it scans from top to bottom 40% of the image)
(If any of the arguments left empty the script will read its pair from the default config file.)
Example usage:
python splitter.py -s docs -d out -m multi -p 4
powershell.exe .\splitter.ps1
.\splitter.bat
(splitter.bat can't get arguments at the moment.)
Example usage:
python3 splitter.py -s docs -d out -m multi -p 4
./barcodescansplit.sh -s docs -d out -m multi -p 4