Skip to content

Tatar language data quality issues #61

@rsabirov

Description

@rsabirov

Hello,

Where is this data for Tatar language is coming from?

I see a lot of garbage there, I barely found a Tatar words here.

I would like to improve this.

  1. do you have some page with guidance how to train the model?
  2. once I train it, should I create a PR with just model itself to that repo? where are storing raw data for training?

follow up for tesseract-ocr/langdata#305

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions