Save model info #336

ElliottKasoar · 2024-10-22T18:05:43Z

Resolves #327

For model_path (or equivalent inputs e,g, model), this adds mlip_model as an info label when structures are output.

For strings and paths, this is fine. For already-loaded torch models, it's less clear what the best label is, so I'm open to alternatives.

This will also interact with any implementation of #240, but we can deal with that when we come to it.

@alinelena, was the suggested info['{arch}_{model}'] = model replacing info["arch"] or additional? To me it doesn't make sense to have "{arch}_model" as well as "arch", which is how I originally interpreted it, but perhaps it makes sense to combine them into a single info label, which I just realised may be what you meant.

janus_core/helpers/utils.py

alinelena · 2024-10-31T06:36:00Z

we need to think about this one, what happens if we have two calculations, both arch=mace but model=small and model=large, how will the extxyz look and similarly if we have some models passed by paths(do we get only the basename for mlip?). I think will be nice to document all this structure of what we want and then review the code accordingly

ElliottKasoar · 2024-10-31T12:04:07Z

we need to think about this one, what happens if we have two calculations, both arch=mace but model=small and model=large, how will the extxyz look and similarly if we have some models passed by paths(do we get only the basename for mlip?). I think will be nice to document all this structure of what we want and then review the code accordingly

Good question. At the moment I think having the same arch is relatively clean, if not perfect, since info["arch]", info["mlip_model"], and labelled results should all just be replaced. This means we can't save both sets of results, but at least I don't think it should be a confusing mix of both.

E.g. arch=mace, mlip_model=small, mace_energy=x -> arch=mace, mlip_model=large, mace_energy=y

When we change architecture, this is a bit more complicated, since we'd change info["arch]", info["mlip_model"], but keep both sets of results.

E.g. arch=mace_off, mlip_model=small, mace_off_energy=x -> arch=mace_mp, mlip_model=large, mace_off_energy=x, mace_mp_energy=y.

I implemented it this way for now since it means mlip_model essentially works the same as arch already does - we can only keep the last one, even if we add new results.

We could change both to potentially be lists (if more than one value), either as two individual lists or a combined entry {arch}_model={model}.

This does make things more complicated though, since we'd then want to do something like convert repeated {arch}_{property} entries e.g. mace_energy into a list too, but only if the model has changed...

similarly if we have some models passed by paths(do we get only the basename for mlip?)

Regarding paths, at the moment model paths are saved as str(Path(model_path).expanduser()) e.g. ~ expands to mlip_model=/home/ek/.cache/mace/46jrkm3v, but you might get mlip_model=46jrkm3v for a relative model path. Maybe we should change this to absolute.

alinelena · 2024-10-31T15:03:20Z

ok I think we need to document very well the behaviour of this with examples since there is no obvious solution.

docs/source/user_guide/python.rst

tests/test_singlepoint_cli.py

Co-authored-by: Jacob Wilkins <[email protected]>

ElliottKasoar added the enhancement New/improved feature or request label Oct 22, 2024

ElliottKasoar requested review from alinelena and oerc0122 October 22, 2024 18:05

ElliottKasoar commented Oct 22, 2024

View reviewed changes

janus_core/helpers/utils.py Outdated Show resolved Hide resolved

janus_core/helpers/utils.py Outdated Show resolved Hide resolved

ElliottKasoar force-pushed the main branch from 6dc02b2 to 4781620 Compare October 28, 2024 11:25

ElliottKasoar force-pushed the update-arch-info branch from 65cd5f1 to f2f38c5 Compare October 31, 2024 12:04

ElliottKasoar added 3 commits November 4, 2024 19:35

Save model info

1521ee2

Test singlepoint model info saved

d70118e

Fix saving model

3984edd

ElliottKasoar force-pushed the update-arch-info branch from f2f38c5 to 4b2fc67 Compare November 4, 2024 19:43

ElliottKasoar added 2 commits November 4, 2024 20:06

Change info label for consistency

ccd2ee2

Describe output info

778e777

ElliottKasoar force-pushed the update-arch-info branch from 4b2fc67 to 778e777 Compare November 4, 2024 20:06

Add reference to aiida mlip

7f6ecff

oerc0122 reviewed Nov 8, 2024

View reviewed changes

docs/source/user_guide/python.rst Outdated Show resolved Hide resolved

tests/test_singlepoint_cli.py Outdated Show resolved Hide resolved

Update docs/source/user_guide/python.rst

ee72b37

Co-authored-by: Jacob Wilkins <[email protected]>

ElliottKasoar force-pushed the update-arch-info branch from b186c38 to ee72b37 Compare November 8, 2024 16:00

alinelena approved these changes Nov 14, 2024

View reviewed changes

alinelena merged commit 563742a into stfc:main Nov 14, 2024
8 checks passed

ElliottKasoar deleted the update-arch-info branch November 14, 2024 16:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Save model info #336

Save model info #336

Uh oh!

ElliottKasoar commented Oct 22, 2024

Uh oh!

Uh oh!

Uh oh!

alinelena commented Oct 31, 2024

Uh oh!

ElliottKasoar commented Oct 31, 2024

Uh oh!

alinelena commented Oct 31, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Save model info #336

Save model info #336

Uh oh!

Conversation

ElliottKasoar commented Oct 22, 2024

Uh oh!

Uh oh!

Uh oh!

alinelena commented Oct 31, 2024

Uh oh!

ElliottKasoar commented Oct 31, 2024

Uh oh!

alinelena commented Oct 31, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants