Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[REQUEST] - docling #294

Open
thistleknot opened this issue Sep 16, 2024 · 1 comment · May be fixed by #471
Open

[REQUEST] - docling #294

thistleknot opened this issue Sep 16, 2024 · 1 comment · May be fixed by #471
Labels
enhancement New feature or request

Comments

@thistleknot
Copy link

Reference Issues

No response

Summary

docling supports automatic parsing of pdf's with tables. I've found it very beneficial.
https://github.com/DS4SD/docling/issues

Basic Example

automatic table extraction

Drawbacks

gpu access
changes format of incoming document, but I've found it much easier to read pdfs processed by markdown. Uses layout detection + vision transformers to translate tables to markdown representations

Additional information

No response

@thistleknot thistleknot added the enhancement New feature or request label Sep 16, 2024
@cin-albert
Copy link
Collaborator

cin-albert commented Sep 26, 2024

Thanks for the request. I'm working on it and will add it to the readers soon

@cin-albert cin-albert linked a pull request Nov 6, 2024 that will close this issue
8 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants