Synthtiger: korean dataset for doctr training #1098
khawar-islam
started this conversation in
General
Replies: 1 comment 4 replies
-
Hi @khawar-islam no need to keep masks and coords. Mh vertical could be used multiline is Not supported in doctr. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi @felixdittrich92 , thank you for you suggestion. After your suggestion, I have worked on
https://github.com/clovaai/synthtiger
repo and generate a new korean synthetic dataset. I have attached the .zip for your confirmation then i will be able to generate full 5 Million images dataset. I have changed some paramters for doctr training, please have a look on configuration and .zip fileWhile preparing training data, we do not need
masks
andcoords.txt
information. Am i right?Can i generate the vertical and multi line images for doctr training? Is that OK
results.zip
Beta Was this translation helpful? Give feedback.
All reactions