Supported Languages #41
-
Hi, What languages are supported?
Thank you |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
We should distinguish between 1) programmatic documents and 2) scanned documents. In the first case, we are language independent, we have tested Asian languages with good success. We are actually extending Docling with a simpler way to change OCR backend and customize the parameters. For the moment changing the config requires you to make a new |
Beta Was this translation helpful? Give feedback.
We should distinguish between 1) programmatic documents and 2) scanned documents.
In the first case, we are language independent, we have tested Asian languages with good success.
In the second case, we depend on the underlying OCR engine. At the moment we have binding for EasyOCR which has support for 80+ languages. On their website you find the language parameters to provide.
We are actually extending Docling with a simpler way to change OCR backend and customize the parameters. For the moment changing the config requires you to make a new
ModelPipeline
object.