Skip to content

Arabic text extracted from PDF is reversed (both words and word's chars) #1938

Open
@Mahmoud-A-Noor

Description

@Mahmoud-A-Noor

Bug

when i extract text from pdf that contains arabic content i get arabic text in the wrong direction and chars of each word is reversed
...

Steps to reproduce

just try to extract text from any arabic pdf
...

Docling version

latest version
...

Python version

3.13
...

Image Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions