Skip to content

[Feature]: more detailed information for: lots of diacritics - possibly poor OCR #1566

@clach04

Description

@clach04

Describe the proposed feature

I had a game manual that had 3 pages (out of 88) that generate:

   26 [tesseract] lots of diacritics - possibly poor OCR   tesseract.py:241
   68 [tesseract] lots of diacritics - possibly poor OCR   tesseract.py:241
   12 [tesseract] lots of diacritics - possibly poor OCR   tesseract.py:241

I'd prefer to see the raw message from Tesseract. This isn't specific to the diacritics message (EDIT opened #1567).

What are people's opinions on this?

If positive, I could look at doing a PR.

Turned out for those pages, the OCR warnings were from some screenshots 😆

BTW, thanks for this awesome tool. The installation instructions are really great.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions