Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

new optional input (corpus weights) #2

Open
wants to merge 14 commits into
base: master
Choose a base branch
from

Conversation

lzlniu
Copy link

@lzlniu lzlniu commented Feb 9, 2022

Specify optional argument --corpus-weights=filename.tsv to input the weights
The new optional input file should be in TSV format and better to have 2 columns. But it's ok for it to have more or fewer columns, if fewer, then the corpus weight will not be specify and set to 1.0, if more, then >2 columns will not be read.
It's also not necessary for each PubMed ID that occur in the "documents.tsv" to appear in this corpus weights file (not appear PubMed ID's corpus weight will be set to 1.0).

Fix the standard copy constructor of Document Class

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant