Using Docling with costume layout and table recognition models #250

ALIYoussef · 2024-11-05T19:49:42Z

Is it possible to use the docling with costume models for layout and table recognition.

I would like to use the pipeline by replacing the existing models with my own models for layout and table recognition. I am wondering if the documentation has any example about using costume AI models.

dolfim-ibm · 2024-11-06T07:40:18Z

The choice of the models is done at the Pipeline level. For example, the PDF pipeline (called StandardPdfPipeline) is defined in docling/pipeline/standard_pdf_pipeline.py.

You can make your own pipeline with different models, or simply extend with others. We have an example which extends the PDF pipeline with an image understanding model. See https://ds4sd.github.io/docling/examples/develop_picture_enrichment/.

PeterStaar-IBM · 2024-11-06T09:17:39Z

@ALIYoussef You can of course also provide extension to docling via a PR.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using Docling with costume layout and table recognition models #250

Using Docling with costume layout and table recognition models #250

ALIYoussef commented Nov 5, 2024

dolfim-ibm commented Nov 6, 2024

PeterStaar-IBM commented Nov 6, 2024

Using Docling with costume layout and table recognition models #250

Using Docling with costume layout and table recognition models #250

Comments

ALIYoussef commented Nov 5, 2024

dolfim-ibm commented Nov 6, 2024

PeterStaar-IBM commented Nov 6, 2024