OCR Fails to Detect and Classify Checkbox States in Structured Forms

### 🚀 The feature, motivation and pitch

### Description:
The current OCR pipeline is unable to accurately detect and classify the states of checkboxes in scanned or photographed document forms. This affects data extraction quality where form checkboxes are used to capture user selections.

### Expected Functionality:
#### OCR should:
- Detect all checkbox elements in the form
- Classify checkbox states:
- Empty: Rectangular border with empty interior
- Checked: Contains ✓, or “v” shape
- Crossed: Contains “x” or diagonal line(s)
- Filled: Darkened or shaded interior
- Partial: Unclear or incomplete mark
- Associate checkbox rows/columns with corresponding labels 

### Steps taken:

- Tried prompt-engineering around layout inference and checkbox keyword detection – unsuccessful
- OCR returns text only, ignoring graphic elements entirely.

##### Sample file tried

![Image](https://github.com/user-attachments/assets/751bab28-1165-4968-b31c-be324ef75e8f)

### Alternatives

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OCR Fails to Detect and Classify Checkbox States in Structured Forms #217

🚀 The feature, motivation and pitch

Description:

Expected Functionality:

OCR should:

Steps taken:

Sample file tried

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

OCR Fails to Detect and Classify Checkbox States in Structured Forms #217

Description

🚀 The feature, motivation and pitch

Description:

Expected Functionality:

OCR should:

Steps taken:

Sample file tried

Alternatives

Additional context

Activity

aman-17 commented on Jul 10, 2025

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions