Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Availability of benchmarking datasets from espaloma 0.3.0 #180

Open
LeifSeute opened this issue Jul 21, 2023 · 3 comments
Open

Availability of benchmarking datasets from espaloma 0.3.0 #180

LeifSeute opened this issue Jul 21, 2023 · 3 comments
Labels
paper 🧻 Issues that reference a paper question ❓ Further information is requested reproducibility 🔬 Question about how to reproduce something

Comments

@LeifSeute
Copy link

Hello there!

I would like to reproduce the results from table 1 of the espaloma 0.3.0 paper.
image

Is there a way to obtain the datasets used for creating this table including bonded and nonbonded energies stored for the respective classical forcefields or directly as espaloma graphs? If one loads the data from spice or QC archive, it cannot be parametrized with amberff14sb since the information on residues is missing. For the other forcefields, one has to re-calculate the partial charges in this case.

@mikemhenry
Copy link
Contributor

@LeifSeute Thank you for raising this issue! I will defer to @yuanqing-wang and @kntkb to answer this one

@mikemhenry mikemhenry added question ❓ Further information is requested reproducibility 🔬 Question about how to reproduce something paper 🧻 Issues that reference a paper labels Jul 21, 2023
@kntkb
Copy link
Contributor

kntkb commented Jul 24, 2023

@LeifSeute Thank you for your interest. A pre-filtered dataset ready for training and more information can be found here.

@LeifSeute
Copy link
Author

Thank you for your answer. Unfortunately, I can only find scripts to download data that does not include the nonbonded contribution to the energies and gradients calculated from gaff-2.11 and openff-2.0.0, which are needed to add them to the bonded contributions predicted by espaloma.
For a part of the dataset, I re-calculated them myself, however, this is relatively comp. expensive and I think that this is not economical since these calculations must have been done already to obtain the table referenced above.

Could you provide the full dataset (containing nonbonded energies from said classical force fields) for download, e.g. as hdf5 file like it is the case for the spice dataset (https://zenodo.org/record/7258940)?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
paper 🧻 Issues that reference a paper question ❓ Further information is requested reproducibility 🔬 Question about how to reproduce something
Projects
None yet
Development

No branches or pull requests

3 participants