Skip to content

Commit 9242312

Browse files
authored
Add semantic mining citation, update evo 1.5 config references (#105)
* Update README with citation * Update acr_sample.json evo 1.5 name * Update gene_completion.json evo 1.5 name * Update operon_completion.json evo 1.5 name * Update toxin_antitoxin_sample.json evo 1.5 name
1 parent 8ffd8d8 commit 9242312

File tree

5 files changed

+21
-5
lines changed

5 files changed

+21
-5
lines changed

semantic_mining/README.md

+17-1
Original file line numberDiff line numberDiff line change
@@ -74,4 +74,20 @@ Reference sequences for the operon and gene completion scripts are provided in t
7474
## Related Documentation
7575

7676
For more comprehensive documentation, please refer to:
77-
- The [Main Repository README](https://github.com/evo-design/evo/blob/main/README.md) for setting up generation with Evo
77+
- The [Main Repository README](https://github.com/evo-design/evo/blob/main/README.md) for setting up generation with Evo
78+
79+
## Citation
80+
81+
Please cite the following publication when referencing semantic mining or Evo 1.5.
82+
83+
```
84+
@article {merchant2024semantic,
85+
author = {Merchant, Aditi T and King, Samuel H and Nguyen, Eric and Hie, Brian L},
86+
title = {Semantic mining of functional de novo genes from a genomic language model},
87+
year = {2024},
88+
doi = {10.1101/2024.12.17.628962},
89+
publisher = {Cold Spring Harbor Laboratory},
90+
URL = {https://www.biorxiv.org/content/early/2024/12/18/2024.12.17.628962},
91+
journal = {bioRxiv}
92+
}
93+
```

semantic_mining/sample_configs/acr_sample.json

+1-1
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@
99
"output_folds_file": "/semantic_mining/test/ouput_folds.csv",
1010
"output_filtered_folds": "/semantic_mining/test/filtered_folds.csv",
1111

12-
"model_name": "evo_1.5_8k_hf",
12+
"model_name": "evo-1.5-8k-base",
1313
"n_tokens": 1000,
1414
"temperature": 0.7,
1515
"top_k": 4,

semantic_mining/sample_configs/gene_completion.json

+1-1
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@
1313
"segmasker_path": "/usr/local/bin/ncbi-blast-2.16.0+/bin/segmasker",
1414
"mafft_path": "/usr/local/bin/mafft/bin/mafft",
1515

16-
"model_name": "evo_1.5_8k_hf",
16+
"model_name": "evo-1.5-8k-base",
1717
"n_tokens": 2500,
1818
"temperature": 0.7,
1919
"top_k": 4,

semantic_mining/sample_configs/operon_completion.json

+1-1
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@
1313
"segmasker_path": "/usr/local/bin/ncbi-blast-2.16.0+/bin/segmasker",
1414
"mafft_path": "/usr/local/bin/mafft/bin/mafft",
1515

16-
"model_name": "evo_1.5_8k_hf",
16+
"model_name": "evo-1.5-8k-base",
1717
"n_tokens": 2500,
1818
"temperature": 0.7,
1919
"top_k": 4,

semantic_mining/sample_configs/toxin_antitoxin_sample.json

+1-1
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@
1111
"shared_protein_pairs_csv": "/semantic_mining/test/paired_candidates.csv",
1212
"cofold_fasta": "/semantic_mining/test/paired_candidates.fasta",
1313

14-
"model_name": "evo_1.5_8k_hf",
14+
"model_name": "evo-1.5-8k-base",
1515
"n_tokens": 2000,
1616
"temperature": 0.7,
1717
"top_k": 4,

0 commit comments

Comments
 (0)