Poor Image OCR API 😎

A REST API for extracting text from images using Optical Character Recognition (OCR).
Built with Node.js, Express, and Tesseract.js.

Features

Extract text from images using OCR
Accurate text extraction with confidence scores
Multi-language support (English, Spanish, French, German, etc.)
RESTful API endpoints
File validation (type and size)
Interactive Swagger/OpenAPI documentation
Secure file uploads
Rate limiting and security rules
Efficient memory usage with worker pooling

Requirements

Node.js >= 18.0.0
npm or yarn

Installation

npm install

Environment Configuration

Create a .env file:

NODE_ENV=development
PORT=3000
HOST=localhost
MAX_FILE_SIZE=10485760
ALLOWED_FILE_TYPES=image/png,image/jpeg,image/jpg,image/webp
RATE_LIMIT_WINDOW_MS=900000
RATE_LIMIT_MAX_REQUESTS=100
CORS_ORIGIN=*
LOG_LEVEL=info

# MongoDB Configuration
MONGODB_URI=mongodb://localhost:27017/ocr-stats
MONGODB_MAX_POOL_SIZE=10
MONGODB_MIN_POOL_SIZE=2

# Stats Secret Token (IMPORTANT: Change this!)
STATS_SECRET_TOKEN=your-super-secret-token-here-change-this

Running the Server

Development

npm run dev

Production

npm start

Server URL: http://localhost:3000 API Docs: http://localhost:3000/api-docs

API Endpoints

OCR Processing

POST /api/ocr/process

Request

image (file, required): PNG, JPG, JPEG, WEBP (Max 10MB)
language (string, optional): OCR language (default: eng)

Response

{
  "success": true,
  "message": "Image processed successfully",
  "data": {
    "text": "Extracted text from image...",
    "confidence": 95.67,
    "language": "eng",
    "processingTime": 1234
  }
}

Health Check

GET /api/health

{
  "success": true,
  "message": "Server is running",
  "timestamp": "2025-11-10T10:30:45.123Z",
  "uptime": 123.45
}

Statistics (Protected)

All statistics endpoints require authentication via STATS_SECRET_TOKEN.

GET /api/stats

Get overall statistics with optional filters.

Headers:

x-stats-token: Your secret stats token

Query Parameters:

startDate (optional): Filter from date (ISO 8601)
endDate (optional): Filter to date (ISO 8601)
endpoint (optional): Filter by endpoint
success (optional): Filter by success status (true/false)

Response:

{
  "success": true,
  "message": "Statistics retrieved successfully",
  "data": {
    "overall": {
      "totalRequests": 1234,
      "successfulRequests": 1200,
      "failedRequests": 34,
      "successRate": 97.25,
      "avgResponseTime": 1523.45,
      "minResponseTime": 234,
      "maxResponseTime": 5432,
      "avgConfidence": 94.32,
      "totalDataProcessed": 15728640
    },
    "byEndpoint": [...],
    "hourly": [...]
  }
}

GET /api/stats/recent

Get recent requests with optional filters.

Headers:

x-stats-token: Your secret stats token

Query Parameters:

limit (optional): Number of results (default: 100)
startDate (optional): Filter from date
endDate (optional): Filter to date
endpoint (optional): Filter by endpoint
success (optional): Filter by success status

DELETE /api/stats/cleanup

Delete old statistics.

Headers:

x-stats-token: Your secret stats token

Query Parameters:

daysToKeep (optional): Days to keep (default: 90)

Supported Languages

eng – English
spa – Spanish
fra – French
deu – German
rus – Russian
chi_sim – Chinese (Simplified)
jpn – Japanese

Additional Tesseract traineddata files can be installed for more languages.

Technologies Used

Node.js
Express.js
Tesseract.js
MongoDB + Mongoose
Multer (file upload)
Helmet (security)
Swagger (API documentation)

Security

Helmet HTTP headers protection
CORS configuration
Rate limiting (100 requests / 15 min)
File type and size validation
Input validation

Project Structure

image-ocr/
  src/
    config/          # Configuration files
      db.config.js   # MongoDB connection
      env.config.js  # Environment config
      swagger.config.js
    controllers/     # Route controllers
    middleware/      # Express middlewares
      auth.js        # Stats token authentication
    models/          # MongoDB models
      stats.model.js # Statistics schema
    routes/          # API routes
    services/        # Business logic
      ocr.service.js
      stats.service.js
    utils/           # Helper utilities
    app.js           # Express app setup
    server.js        # Server entry point
  uploads/           # Temporary uploaded files
  .env               # Environment variables
  package.json
  README.md

Usage Examples

cURL

curl -X POST http://localhost:3000/api/ocr/process \
  -F "image=@/path/to/image.png" \
  -F "language=eng"

JavaScript (Fetch API)

const formData = new FormData();
formData.append('image', fileInput.files[0]);
formData.append('language', 'eng');

fetch('http://localhost:3000/api/ocr/process', {
  method: 'POST',
  body: formData
})
  .then(res => res.json())
  .then(data => console.log(data))
  .catch(err => console.error(err));

Get Statistics

curl -X GET http://localhost:3000/api/stats \
  -H "x-stats-token: your-super-secret-token-here-change-this"

Or with query parameters:

curl -X GET "http://localhost:3000/api/stats?startDate=2024-01-01&endDate=2024-12-31" \
  -H "x-stats-token: your-super-secret-token-here-change-this"

Python (requests)

import requests

url = 'http://localhost:3000/api/ocr/process'
files = {'image': open('image.png', 'rb')}
data = {'language': 'eng'}

response = requests.post(url, files=files, data=data)
print(response.json())

License

MIT

Author: Yaxyobek (Yakhyobek) Year: 2025

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
eng.traineddata		eng.traineddata
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Poor Image OCR API 😎

Features

Requirements

Installation

Environment Configuration

Running the Server

Development

Production

API Endpoints

OCR Processing

Request

Response

Health Check

Statistics (Protected)

Supported Languages

Technologies Used

Security

Project Structure

Usage Examples

cURL

JavaScript (Fetch API)

Get Statistics

Python (requests)

License

About

Uh oh!

Releases

Packages

Languages

yaxyobekuz/image-ocr

Folders and files

Latest commit

History

Repository files navigation

Poor Image OCR API 😎

Features

Requirements

Installation

Environment Configuration

Running the Server

Development

Production

API Endpoints

OCR Processing

Request

Response

Health Check

Statistics (Protected)

Supported Languages

Technologies Used

Security

Project Structure

Usage Examples

cURL

JavaScript (Fetch API)

Get Statistics

Python (requests)

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages