Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using Docling with costume layout and table recognition models #250

Open
ALIYoussef opened this issue Nov 5, 2024 · 2 comments
Open

Using Docling with costume layout and table recognition models #250

ALIYoussef opened this issue Nov 5, 2024 · 2 comments

Comments

@ALIYoussef
Copy link

Is it possible to use the docling with costume models for layout and table recognition.

I would like to use the pipeline by replacing the existing models with my own models for layout and table recognition. I am wondering if the documentation has any example about using costume AI models.

@dolfim-ibm
Copy link
Contributor

The choice of the models is done at the Pipeline level. For example, the PDF pipeline (called StandardPdfPipeline) is defined in docling/pipeline/standard_pdf_pipeline.py.

You can make your own pipeline with different models, or simply extend with others. We have an example which extends the PDF pipeline with an image understanding model. See https://ds4sd.github.io/docling/examples/develop_picture_enrichment/.

@PeterStaar-IBM
Copy link
Contributor

@ALIYoussef You can of course also provide extension to docling via a PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants
@ALIYoussef @PeterStaar-IBM @dolfim-ibm and others