Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: Markdown parsing error #83

Open
jamesvillarrubia opened this issue Aug 7, 2024 · 0 comments
Open

Bug: Markdown parsing error #83

jamesvillarrubia opened this issue Aug 7, 2024 · 0 comments
Assignees

Comments

@jamesvillarrubia
Copy link
Collaborator

jamesvillarrubia commented Aug 7, 2024

The following markdown:

A horizontal rule follows.

***

also

size  material      color
----  ------------  ------------
9     leather       brown
10    hemp canvas   natural
11    glass         transparent

produces the following error:

parser-1    | error uploading file, stacktrace: Traceback (most recent call last):
parser-1    |   File "/app/nlm_ingestor/ingestion_daemon/__main__.py", line 48, in parse_document
parser-1    |     return_dict, _ = ingestor_api.ingest_document(
parser-1    |                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
parser-1    |   File "/app/nlm_ingestor/ingestor/ingestor_api.py", line 41, in ingest_document
parser-1    |     ingestor = markdown_parser.MarkdownDocument(doc_location)
parser-1    |                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
parser-1    |   File "/app/nlm_ingestor/file_parser/markdown_parser.py", line 163, in __init__
parser-1    |     self.blocks, self.html_str = parse_markdown_to_blocks(markdown_text)
parser-1    |                                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
parser-1    |   File "/app/nlm_ingestor/file_parser/markdown_parser.py", line 37, in parse_markdown_to_blocks
parser-1    |     cur_blocks = {
parser-1    |                  ^
parser-1    | KeyError: 'thematic_break'
parser-1    | Traceback (most recent call last):
parser-1    |   File "/app/nlm_ingestor/ingestion_daemon/__main__.py", line 48, in parse_document
parser-1    |     return_dict, _ = ingestor_api.ingest_document(
parser-1    |                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
parser-1    |   File "/app/nlm_ingestor/ingestor/ingestor_api.py", line 41, in ingest_document
parser-1    |     ingestor = markdown_parser.MarkdownDocument(doc_location)
parser-1    |                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
parser-1    |   File "/app/nlm_ingestor/file_parser/markdown_parser.py", line 163, in __init__
parser-1    |     self.blocks, self.html_str = parse_markdown_to_blocks(markdown_text)
parser-1    |                                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
parser-1    |   File "/app/nlm_ingestor/file_parser/markdown_parser.py", line 37, in parse_markdown_to_blocks
parser-1    |     cur_blocks = {
parser-1    |                  ^
parser-1    | KeyError: 'thematic_break'
parser-1    | 192.168.65.1 - - [07/Aug/2024 00:49:13] "POST /api/parseDocument?renderFormat=all HTTP/1.1" 500 -
@jamesvillarrubia jamesvillarrubia self-assigned this Aug 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant