Add as_recognition_dataset to VisionDataset #618
felixdittrich92
started this conversation in
Ideas
Replies: 4 comments
-
Hi @felixdittrich92! Yup, I agree but I think the best way to achieve this would be to have a constructor flag for each dataset:
What do you think? |
Beta Was this translation helpful? Give feedback.
0 replies
-
@fg-mindee |
Beta Was this translation helpful? Give feedback.
0 replies
-
I was thinking about something quite simple: from doctr.datasets import FutureOCRDataset
ocr_ds = FutureOCRDataset("/path/to/root")
detection_ds = FutureOCRDataset("/path/to/root", target_task="text-detection")
recognition_ds = FutureOCRDataset("/path/to/root", target_task="text-recognition") |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
🚀 The feature
Add a method to the VisionDataset which make it possible to use each dataset also for recognition task.
Todo would be crop each box as image with there plain label and all exsisting chars as vocab.
A must have would be to merge multible datasets
@fg-mindee
wdyt ?
Motivation, pitch
Provide datasets also for recognition would be a big benefit for training part and i think a very good way to validate the results
Alternatives
No response
Additional context
No response
Beta Was this translation helpful? Give feedback.
All reactions