_2025_Schulze_abundance-model

This repository contains data and scripts to reproduce results from the following preprint:

Thea K. Schulze, Lasse M. Blaabjerg, Matteo Cagiada, Kresten Lindorff-Larsen (2025) Supervised learning of protein variant effects across large-scale mutagenesis datasets. bioRxiv 2025.04.02.646878; doi: https://doi.org/10.1101/2025.04.02.646878

We provide example scripts and data to train and validate supervised models against VAMP-seq abundance scores:

/output/models.zip: A selection of pretrained models corresponding to those reported in the preprint.
/output/feature_set_1: Examples of scripts to run training and validation pipeline for different versions of our model architecture.
/data/df_vamp.csv: All training data (VAMP-seq scores for six different proteins) and model input features (structure-based features, including ESM-IF scores).

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
output		output
scripts		scripts
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

_2025_Schulze_abundance-model

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

KULL-Centre/_2025_Schulze_abundance-model

Folders and files

Latest commit

History

Repository files navigation

_2025_Schulze_abundance-model

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages