Skip to content

pdfOCR 4.1.0

Latest
Compare
Choose a tag to compare
@nanouh nanouh released this 02 Sep 12:22
· 6 commits to develop since this release
4.1.0

pdfOCR is our add-on for iText Core to perform OCR on documents and images.

This release of pdfOCR brings a huge change with a new built-in OCR engine. It adds the pdfocr-onnxtr module, which implements the OnnxTR library for OCR tasks, with specific requirements for model predictors and resource management. It significantly improves recognition accuracy for English text, and other Latin-based languages.

The Open Neural Network Exchange (ONNX) is an open standard format for machine learning models, enabling interoperability across various frameworks and tools. OnnxTR is a Python OCR library which is a wrapper around the popular OCR tool doctr, enhanced with support for ONNX models.

It makes OCR processing faster and more accessible by leveraging optimized ONNX models without requiring heavy frameworks. This allows easy integration of OCR into applications with minimal resource consumption and high processing speed, offering fast processing and support for multiple platforms, with features like modularity and lightweight dependencies. Using the existing pdfOCR API, we’ve simply added another OCR engine to the existing pdfOcr-tesseract4 module

Not only that, but pdfOCR now directly supports PDF as input files. This can be a big benefit for OCR workflows, as it removes the need to process PDFs with iText Core to extract images from scanned documents.

You can find full details linked from the release notes on the iText Knowledge Base.