Export basic datasets to hdf5 #196
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR adds the ability to export basic datasets to hdf5 files, currently, we extract the energies and gradients but more properties could be included in future.
Layout
Each molecule has its own group under its fixed hydrogen layer inchikey with the following datasets:
smiles: The mapped explicit hydrogen smiles which can be used to construct the molecule via the openff-toolkit usingMolecule.from_mapped_smilesatomic_numbers: An array of the atomic numbers type int16charge: The total charge on the molecule calculated by the openff-toolkit as the sum of formal charges, type int16specification: Themethod:basisused to compute the results type h5py string.energies: An array of energies for the molecule in units ofhartreein the same order as the conformations, soenergies[i]corresponds toconformation[i]type float64conformations: An array of conformations for the molecule inbohrtype float64gradients: An array of gradients in unitshartree / bohrin the same order as the conformations, sogradients[i]corresponds toconformation[i]type float64.Todos
Notable points that this PR has either accomplished or will accomplish.
Questions
Status