Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(feat) Integrate Docling for documents conversion #4763

Open
tobitege opened this issue Nov 5, 2024 · 0 comments
Open

(feat) Integrate Docling for documents conversion #4763

tobitege opened this issue Nov 5, 2024 · 0 comments
Labels
agent quality Problems with specific agents enhancement New feature or request

Comments

@tobitege
Copy link
Collaborator

tobitege commented Nov 5, 2024

What problem or use case are you trying to solve?

Integrate IBM Deep Search' "Docling" (MIT) for e.g. agent skills to read/convert multitude of
documents (file/URL) to e.g. markdown:

https://github.com/DS4SD/docling

From their examples:

from docling.document_converter import DocumentConverter

source = "https://arxiv.org/pdf/2408.09869"  # document per local path or URL
converter = DocumentConverter()
result = converter.convert(source)
print(result.document.export_to_markdown())  # output: "## Docling Technical Report[...]"

If a PDF was converted to markdown before sending it to the LLM (maybe as tool calls),
it could bring down cost and/or open up other uses for the results.

@tobitege tobitege added enhancement New feature or request agent quality Problems with specific agents labels Nov 5, 2024
@tobitege tobitege changed the title Integrate Docling for doc conversion (feat) Integrate Docling for documents conversion Nov 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
agent quality Problems with specific agents enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant