ocr-python

Star

Here are 408 public repositories matching this topic...

hiroi-sora / Umi-OCR

Star

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

screenshot qt ocr qml ocr-python paddleocr umi-ocr

Updated Apr 26, 2025
Python

CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】

ocr pytorch chinese-character-recognition ocr-python english-character-recognition

Updated Nov 30, 2024
Python

CatchTheTornado / text-extract-api

Star

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

api pdf json ocr extract anonymization pii ocr-python llm

Updated Apr 29, 2025
Python

hiroi-sora / Umi-OCR_v2

Star

结束和新的开始

qt ocr qml ocr-python paddleocr

Updated Nov 19, 2023
QML

Psarpei / Multi-Type-TD-TSR

Star

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

nlp computer-science machine-learning natural-language-processing ocr computer-vision deep-learning algorithms machine-learning-algorithms image-processing nlp-machine-learning ocr-recognition computer-vision-algorithms ocr-python table-detection computer-vision-opencv table-detection-using-deep-learning table-structure-recognition

Updated Sep 5, 2022
Jupyter Notebook

maxent-ai / ocrpy

Star

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

python nlp aws information-retrieval ocr computer-vision deep-learning azure cv image-processing transformers tesseract-ocr google-vision-api semantic-search ocr-python

Updated Nov 3, 2023
Jupyter Notebook

MrZilinXiao / Hyper-Table-OCR

Star

A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.

ocr deep-learning table-extraction ocr-python table-ocr

Updated Jan 10, 2023
C++

nathanaday / RealTime-OCR

Star

Perform text detection in a variety of languages with your computer webcam using Google Tesseract OCR and OpenCV. This script achieves a real-time OCR effect via multi-threading.

python ocr multithreading cv2 opencv-python pytesseract ocr-python

Updated Jan 30, 2023
Python

ankandrew / fast-plate-ocr

Star

Lightweight & fast OCR models for license plate text recognition.

ocr tensorflow keras pytorch license-plate plate-recognition onnx license-plate-recognition jax ocr-python albumentations plate-ocr license-plate-reader keras3 license-plate-check license-plate-ocr

Updated May 11, 2025
Python

ilic5000 / pabkvizgenerator

Star

Anansi is a computer vision (cv2 and FFmpeg) + OCR (EasyOCR and tesseract) python-based crawler for finding and extracting questions and correct answers from video files of popular TV game shows in the Balkan region.

python opencv computer-vision tesseract quiz-game quiz-app ocr-python easyocr

Updated Sep 26, 2022
Python

blueaxis / Cloe

Star

Manga OCR snipping application for desktop

ocr pyqt5 ocr-python snipping-tool manga-ocr

Updated Jan 7, 2023
Python

prp-e / persian_ocr_project

Star

A FLOSS software for Persian Optical Character Recognition

ocr ocr-recognition ocr-python

Updated Jun 19, 2024
Jupyter Notebook

nainiayoub / pdf-text-data-extractor

Star

PDF text data extraction web app with OCR for scanned documents

python pdf ocr text-extraction pdf-to-text ocr-text-reader ocr-python streamlit streamlit-webapp

Updated Jun 5, 2024
Python

shibing624 / imgocr

Star

Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理，可实现 CPU 上毫秒级的 OCR 精准预测，通用场景中英文OCR达到开源SOTA。

ocr ocr-python chinese-ocr

Updated Jan 22, 2025
Python

kartikgill / Easter2

Star

Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION

ocr handwriting-ocr python3 optical-character-recognition htr handwriting-recognition handwritten-text-recognition ocr-python iam-dataset easter2

Updated Apr 25, 2023
Jupyter Notebook

genieincodebottle / parsemypdf

Star

Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.