You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm using docling to load documents and then export_to_markdown. My data has many tables and I've discovered that some contain the "|" character, breaking the table structure. So I thought, docling should surely escape those when exporting. Or at least offer an option to do so (if not by default).
I considered contributing, but am unsure of where a change like that could fit in. Perhaps adding a boolean escape_breaking parameter here and then replace any occurences of "|" with regex. Perhaps that is too crude. Any docling-core developer that could point me in the right direction?
Kind regards,
The text was updated successfully, but these errors were encountered:
@isakcodes Thanks for reporting this issue and your willingness to contribute!
Indeed, the problem you reported is being addressed in #61 .
Also note that we are leveraging the tabulate library, which also has an issue in this direction: astanin/python-tabulate#241
I'm using docling to load documents and then export_to_markdown. My data has many tables and I've discovered that some contain the "|" character, breaking the table structure. So I thought, docling should surely escape those when exporting. Or at least offer an option to do so (if not by default).
I see there is another open issue mentioning this.
I considered contributing, but am unsure of where a change like that could fit in. Perhaps adding a boolean escape_breaking parameter here and then replace any occurences of "|" with regex. Perhaps that is too crude. Any docling-core developer that could point me in the right direction?
Kind regards,
The text was updated successfully, but these errors were encountered: