Bug - handling last sentence in task1_converter.py #38

antonyscerri · 2020-01-22T22:39:11Z

Hi

The conversion script task1_converter.py does not handle the last line in all the deft_files where they dont end in a blank line (which is all those in the train subdirectory). The code isn't checking for a new sentences concatenation after going through all the lines.

This brings up another question which is the corpus size in terms of sentences. I've not been able to match up with the figures in the paper against any of the sets of files in this repo, so i wanted to check how many sentences should there in fact be in total.

Thanks

Tony

…prefixes, fixed task1_converter to always handle final lines as mentioned in #38

sashaspala pushed a commit that referenced this issue Jan 28, 2020

Fixed minor tokenization errors, fixed #37 and added appropriate BIO …

63c8d6b

…prefixes, fixed task1_converter to always handle final lines as mentioned in #38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug - handling last sentence in task1_converter.py #38

Bug - handling last sentence in task1_converter.py #38

antonyscerri commented Jan 22, 2020

Bug - handling last sentence in task1_converter.py #38

Bug - handling last sentence in task1_converter.py #38

Comments

antonyscerri commented Jan 22, 2020