-
Notifications
You must be signed in to change notification settings - Fork 31.2k
Open
Labels
Description
Model description
Evo2 is a Decoder only StripedHyena2-based causal LM that models DNA in the same fashion frontier models are based on pretrained only models. Huggingface has similar models available but Evo2 is state of the are on long range dependency modeling and down stream variant prediction.
Evo2 in huggingface would allow Bio x ML folks to experiment with many down stream tasks such as variant prediction, cancer modeling, genomic design and more.
Evo2 uses a DNA base pair tokenizer.
Open source status
- The model implementation is available
- The model weights are available
Provide useful links for the implementation
@ArcInstite originally implemented a non HF compatible version here: https://github.com/ArcInstitute/evo2