-
Notifications
You must be signed in to change notification settings - Fork 501
Open
Labels
error casesSome error/test case for future improvementsSome error/test case for future improvementsmodels:fulltextmodels:segmentation
Milestone
Description
Hello, I am using Grobid for my project and I am working with PDF Drug Labels. I have noticed a few things that happen when the pdf is extracted into xml:
- It often times does not extract the text that comes right after an image
- It sometimes captures a new head into the preceding header. For example after extracting section 12.3, it extracts section 12.4 as a continuation of the preceding header.
Could this be looked at please?
Metadata
Metadata
Assignees
Labels
error casesSome error/test case for future improvementsSome error/test case for future improvementsmodels:fulltextmodels:segmentation