Synthetically generate random text documents with ground truth!
- Atleast Python 3.7
pip install -r dependencies.txt
- Check
documentation/Installation
for further instructions
python generate.py <template.json> <num_samples> <output_folder>
Check the templates/
folder for sample document templates.
python augment.py <config.json> <input_folder> <num_epochs> <output_folder> <num_workers>
Check documentation/Augmentation
for more details.