[WIP] LAMMPS Flows #1185

vir-k01 · 2025-04-22T20:26:10Z

Summary

This is an effort that picks up from #173 to incorporate workflows to run LAMMPS in atomate2. Quite a bit of the initial code was taken from the atomate2-lammps add-on (here) written by @ml-evs and @gbrunin. The input set generator and templates have been moved to pymatgen.io.lammps, and a concurrent PR has been opened to integrate those into pymatgen. Also tagging in @esoteric-ephemera who helped structure some of the code here, and @davidwaroquiers for their interest in this PR.

Function to call lammps based on settings provided in the atomate2.yaml config file, including running in parallel with mpi.
Base Maker that generates inputs, runs lammps, and parses the outputs into a LAMMPS TaskDoc.
Implemented sets and makers for common MD simulations: NVE/NVT/NPT, and a job to perform geometry minimization under an applied pressure.
Implemented a flow to melt, then quench, then thermalize a structure, suitable for creating liquids/glasses/general phase transformations.
Wrote a convertor to parse lammps dump files into ase/pymatgen trajectories (might have to tweak this to match how other workflows do this step)
Implemented a CustomLammpsMaker that takes in a user written input file (for jobs that aren't a combination of NVT/NPT steps or more complicated lammps simulation) and user specified settings. This maker a port of the lammps implementation in atomate, and I expect this Maker to be the most used by existing LAMMPS users.
Added mock_lammps and basic tests for the sets, jobs and schemas.
Added a notebook under tutorials for how to set-up and use the makers.

TODO

These flows are primarily designed around solids (interfaced through the pymatgen Structure) with forcefields that rely on pair_styles (including MLIPs), and as such all design decisions, units, default values and validation checks are tuned for solids. I'm open to suggestions on how the current implementation can be extended to molecules. (Any changes in this regard will have to make changes to the pymatgen PR as well).
Dump files (in their entirety) are presently stored as strings in the job store, and are parsed and additionally stored as either an ASE/pymatgen trajectory if specified by the user. This is done to avoid parsing/storing exceedingly large dump files as the heavier trajectory objects, however, it could very well be possible that storing the files as strings also becomes prohibitively expensive (for large classical MD simulations). Any suggestions on how to deal with such problems are appreciated!
I haven't tested these flows with kokkos/gpu or the other lammps add-ons yet.

@ml-evs

…cture, copied over all the work done on atomate2-lammps (by @ml-evs and @gbrunin at Matgenix). The basic functions have been implemented, the actual tasks to be done are generating the right input sets for a wide range of lammps calculations that can be done, and to write a task doc that can handle the outputs of these calculations. Will update the run.py once I get it to work with a complied version of lammps for a simple test case.

…, run_lammps can now be executed

… are probably better expressed in the LAMMPS_CMD, which is specified through the environment's atomate2.yaml file.

…o MDA via a builder

…arameters

…tter output processing into the taskdoc

…rs from pmg, better handling of inputs to the makers. TODO: make init more readable, and allow for better management of how upstream generators call base set generator

Update atomate2

… based on atomate2.ase.utils.TrajectoryObserver. Also accounted for reading in molecules and saving as a trajectory.

…ng taskdoc

…s to json files for easier access, added utility funcs to process settings dicts

…ings to allow restart keyword to be provided in template

…and langevin/berendsen for now. Added nph as a thermostat too.

…for langevin, need for nve integrator for nvt/npt with non-nose-hoover

…arsed

… in update_settings function

…ded template and for minimization calcs

… take in TaskState and StoreTrajectoryOption objects from emmet for consistency

… isn't complete

…unc, indent

…etGenerator in pmg

…s workflows

Updated test files (not the actual tests) to be consistent with new pmg input sets

Removed all instances of the original LammpsInterchange (too difficult to configure correctly), and removed the templates since they've been moved to pymatgen

…into lammps_flows_templated

JaGeo · 2025-04-22T22:16:18Z

@vir-k01 Thank you!

Before I check out the code in more detail, a naive question:
I am not an expert user of LAMMPS, but I nevertheless have a question: as far as I know, a python interface to lammps can be compiled. Are there drawbacks of using this interface in contrast to input files?

vir-k01 · 2025-04-23T03:02:06Z

@JaGeo Yes, that's a very valid question. I personally haven't used the python interface to LAMMPS much, but from what I know of it, it's entirely equivalent to just writing templates since it does not provide actual objects to use (other than the cmd line runner). I'm sure the lammps python interface has its uses, but in the context of this PR though, I think using templated input files offers a lot more flexibility, since anything that's not a simple NVT/NPT MD is difficult to convert into a structured input set. At least this way, the user can prepare their input file using whatever approach they're comfortable with, whether that be the pymatgen interface to lammps, or the native python lammps interface, or the ASE interface, or just directly writing the file out in a text editor; and pipe that input file into the flows here.

JaGeo · 2025-04-23T05:19:52Z

@vir-k01 Thank you very much for the answer! That sounds very good!

…by the run

gpetretto

Hi @vir-k01,
thanks a lot for all this work.
Since we are planning to use this I have made a first review of the code and left some comments.

I have already mentioned it in the comments, but my main concern is about how to handle the connection between different jobs. At the moment it seems that the only automation allowed would be passing the output Structure of one Job to the next one. However, in this kind of MD simulations seems that all the additional information (like velocities and thermostat information) would be necessary to have a meaningful connetion.
One potential solution would be to use the "restart" feature in LAMMPS. From a quick test, the size of the restart file generated (or of the "data" file from the write_data) is realtively small (~50KB for a system with ~400 atoms). It may be an idea to always write the restart at the end, so that the jobs could always be composable, or alternatively at least ease the addition of write_restart/write_data to the input file in case one wants to join jobs.
Is there any other way to better ensure the transfer of the required information between two different execution of LAMMPS?

gpetretto · 2025-04-29T21:23:35Z

src/atomate2/lammps/sets/core.py

+import numpy as np
+
+
+class LammpsNVESet(BaseLammpsSetGenerator):


The generators should probably be made as dataclasses. Otherwise all the attributes below will be seen as class attributes, instead of instance attributes.

Oh shoot, I did not realize that. I wrote out the init functions this way for the core set generators to allow the user to provide keywords that they normally would (such as temperature, nsteps etc for NVT) without having to create a LammpsSettings object beforehand. I can make this change but that'll require writing out an init function regardless since this can't be moved into the post_init?

gpetretto · 2025-04-29T21:26:26Z

src/atomate2/lammps/sets/core.py

+    Lammps input set for NVE MD simulations.  
+    """
+    ensemble : MDEnsemble = MDEnsemble.nve
+    settings : dict = {}


Only dictionary? Why not also LammpsSettings as in the base class?

gpetretto · 2025-04-29T21:28:24Z

src/atomate2/lammps/sets/core.py

+    """
+    Lammps input set for NVE MD simulations.  
+    """
+    ensemble : MDEnsemble = MDEnsemble.nve


Is there a point to this argument, or at least having it as atomate2.ase.md.MDEnsemble? I could understand if there was a base class with common code among LammpsNVESet, LammpsNVTSet and just set the value in ensamble. But here the class is called LammpsNVESet, is there any other meaningful value to set for ensamble here? The same for the other generators.

gpetretto · 2025-04-30T06:45:08Z

src/atomate2/lammps/sets/core.py

+    ensemble : MDEnsemble = MDEnsemble.nve
+    settings : dict = {}
+
+    def __init__(self, **kwargs):


probably making these generators dataclasses and replace this with a __post_init__ will remove the need for a good part of the code in the __init__?

src/atomate2/lammps/jobs/base.py

src/atomate2/lammps/flows/core.py

src/atomate2/lammps/schemas/task.py

gpetretto · 2025-05-05T13:20:54Z

src/atomate2/lammps/files.py

+    '''
+    def __init__(self, dumpfile, store_md_outputs : StoreTrajectoryOption  = StoreTrajectoryOption.NO, read_index: str | int = ':') -> None:
+        self.store_md_outputs = store_md_outputs
+        self.traj = read(dumpfile, index=read_index) if isinstance(read_index, str) else [read(dumpfile, index=read_index)]


This is not entirely correct. In fact, if read_index containts an integer index in the form of a string (for example "1") read still returns a single Atoms object. I understand that this is a particular case, but it is still make the simple check with isinstance(read_index, str) potentially incorrect.

src/atomate2/lammps/files.py

src/atomate2/lammps/schemas/task.py

… default init, ruff

…topology info, fix return type in Dumpconverteer

… str too

sync to main

Update to main

vir-k01 and others added 30 commits October 24, 2024 16:54

Moved lammps invokation settings into common atomate2 settings object…

243c071

…, run_lammps can now be executed

Removed unneccessary MPI settings from atomate2.SETTINGS. MPI options…

646719c

… are probably better expressed in the LAMMPS_CMD, which is specified through the environment's atomate2.yaml file.

redid taskdoc to get log files read. Have to pass dump file reading t…

2e24f0a

…o MDA via a builder

Fix input file name, modify template to account for more simulation p…

1dd4db8

…arameters

Added helper function to convert dump to PMGTraj

08989a0

Redid force_field setup, added correct _DATA_OBJECTS for jobstore, be…

06d5aab

…tter output processing into the taskdoc

More expressive taskdoc, added ability to parse dump file into pmgtraj

6dea0f9

Defined a BASE_SETTINGS object, base set now inherits lammps generato…

454cfc1

…rs from pmg, better handling of inputs to the makers. TODO: make init more readable, and allow for better management of how upstream generators call base set generator

Merge pull request #2 from materialsproject/main

9a4d4f0

Update atomate2

Redid trajectory reading into taskdoc with a new DumpConvertor class,…

c1e670f

… based on atomate2.ase.utils.TrajectoryObserver. Also accounted for reading in molecules and saving as a trajectory.

Fixed issue with filename to write out trajectory to disk when buildi…

c260a97

…ng taskdoc

Streamlined lammps input set settings, moved default settings and key…

ec3b7d0

…s to json files for easier access, added utility funcs to process settings dicts

Added basic implementation for NVT and NPT sets

af60b51

Added ability to restart lammps run from prev_dir; added keys in sett…

da7b087

…ings to allow restart keyword to be provided in template

Fixed nvt and npt set initialization, accounted only for nose-hoover …

ee94293

…and langevin/berendsen for now. Added nph as a thermostat too.

Only add raw_log_file and trajectory with metadata to jobstore

7550f5f

Better management of settings keys in utils, account for random seed …

2630a52

…for langevin, need for nve integrator for nvt/npt with non-nose-hoover

Fixed fallback data types for raw_log and thermo_log if files arent p…

5ee8730

…arsed

Remove zipping dir for debugging purposes

fc0b845

Relaxed force field input to through warning if not dict, redid logic…

36792eb

… in update_settings function

Fix species variables in base set

af85468

Redid logic for update_settings

4ecfec1

Added reudimentary sets for a custom lammps job based on a user provi…

2d86231

…ded template and for minimization calcs

Added task state to TaskDoc, made task state and store traj option to…

277d98a

… take in TaskState and StoreTrajectoryOption objects from emmet for consistency

Typo in StoreTrajectoryOption, added ValError exception when log file…

eb9e910

… isn't complete

Fixed imports

2ff7ecb

Fixed super init for base set from core sets, fixed update settings f…

2b10f7f

…unc, indent

Better missing log file handling

92953bf

Add time delay to task doc reading (debug)

02805f2

vir-k01 added 17 commits April 21, 2025 19:52

Synced sets and makers with Templated approached based on BaseLammpsS…

cd12b08

…etGenerator in pmg

Added a tutorial notebook to show how to initialize and use the Lammp…

8829e3b

…s workflows

Updated tutorial

aa736fe

Updated notebook

cc9c572

Updated tests

1c186ea

Updated test files (not the actual tests) to be consistent with new pmg input sets

Fixed typo in lammpsjob decorator; fixed logic in taskdoc

c1490dd

Fixed typo in lammpsjob decorator; fixed logic in taskdoc

026997b

Removed unecessary files

36c5d50

Removed all instances of the original LammpsInterchange (too difficult to configure correctly), and removed the templates since they've been moved to pymatgen

remove comment

96b7a2a

fix bug of metadata going to jobstore

79929ce

linting

32c5cd7

Added maker for NVE jobs

be9acd1

Consolidate lammps run commands

72746a7

fix lammps mpi invokation

c124f93

Merge branch 'lammps_flows_templated' of github.com:vir-k01/atomate2 …

1893e5c

…into lammps_flows_templated

fix mpirun/srun invokation

05aea9e

Fixed typo/formatting in tutorial

cd46e38

vir-k01 added 2 commits April 28, 2025 19:09

Post init fix in CustomLammpsMaker

895ec9f

Ruff, added ability to parse additional files that might be produced …

855e27c

…by the run

gpetretto reviewed May 5, 2025

View reviewed changes

vir-k01 and others added 7 commits May 5, 2025 17:08

Made flows take in prev_dir to read restart files, npt_maker now gets…

6bc8b39

… default init, ruff

removed msonable requirement

353c2af

Now store the any output files with *.data fomrat in order to retain …

15932a5

…topology info, fix return type in Dumpconverteer

moved all task doc settings out into taskdoc funcs, prev_dir can take…

9c74fa3

… str too

change return type to any

27f8f7f

Merge pull request #9 from materialsproject/main

2ad291f

sync to main

Merge pull request #10 from materialsproject/main

2714a30

Update to main

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] LAMMPS Flows #1185

[WIP] LAMMPS Flows #1185

vir-k01 commented Apr 22, 2025 •

edited

Loading

Uh oh!

JaGeo commented Apr 22, 2025 •

edited

Loading

Uh oh!

vir-k01 commented Apr 23, 2025

Uh oh!

JaGeo commented Apr 23, 2025

Uh oh!

gpetretto left a comment

Uh oh!

gpetretto Apr 29, 2025

Uh oh!

vir-k01 May 6, 2025

Uh oh!

gpetretto Apr 29, 2025

Uh oh!

gpetretto Apr 29, 2025

Uh oh!

gpetretto Apr 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gpetretto May 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		import numpy as np


		class LammpsNVESet(BaseLammpsSetGenerator):

[WIP] LAMMPS Flows #1185

Are you sure you want to change the base?

[WIP] LAMMPS Flows #1185

Conversation

vir-k01 commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

TODO

Uh oh!

JaGeo commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vir-k01 commented Apr 23, 2025

Uh oh!

JaGeo commented Apr 23, 2025

Uh oh!

gpetretto left a comment

Choose a reason for hiding this comment

Uh oh!

gpetretto Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

vir-k01 May 6, 2025

Choose a reason for hiding this comment

Uh oh!

gpetretto Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

gpetretto Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

gpetretto Apr 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gpetretto May 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vir-k01 commented Apr 22, 2025 •

edited

Loading

JaGeo commented Apr 22, 2025 •

edited

Loading