Skip to content

Add Evo2 Genomic foundation model #42209

@McClain-Thiel

Description

@McClain-Thiel

Model description

Evo2 is a Decoder only StripedHyena2-based causal LM that models DNA in the same fashion frontier models are based on pretrained only models. Huggingface has similar models available but Evo2 is state of the are on long range dependency modeling and down stream variant prediction.

Evo2 in huggingface would allow Bio x ML folks to experiment with many down stream tasks such as variant prediction, cancer modeling, genomic design and more.

Evo2 uses a DNA base pair tokenizer.

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

@ArcInstite originally implemented a non HF compatible version here: https://github.com/ArcInstitute/evo2

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions