gradient accumulation #54

sblackburn86 · 2024-06-05T16:51:33Z

Adding a kwarg to allow for gradient accumulation.
In the normal MACE and the MLP score network, there is no batchnorm, so doing gradient accumulation is easy.
Here, just passing an argument to pytorch-lightning trainer does the trick.

TBD what happens with DiffusionMACE with the o3.batchnorm...

rousseab

The PR only shows changes in the config files. The new argument is not passed to the Trainer... Maybe changes in train_diffusion.py were not committed?

rousseab

LGTM!

gradient accumulation

bf7e73b

rousseab requested changes Jun 6, 2024

View reviewed changes

missing arg in trainer

f34fe76

rousseab approved these changes Jun 6, 2024

View reviewed changes

rousseab merged commit 095167b into main Jun 6, 2024
1 check passed

rousseab deleted the gradient_accumulation branch June 6, 2024 18:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gradient accumulation #54

gradient accumulation #54

Uh oh!

sblackburn86 commented Jun 5, 2024

Uh oh!

rousseab left a comment

Uh oh!

rousseab left a comment

Uh oh!

Uh oh!

Uh oh!

gradient accumulation #54

gradient accumulation #54

Uh oh!

Conversation

sblackburn86 commented Jun 5, 2024

Uh oh!

rousseab left a comment

Choose a reason for hiding this comment

Uh oh!

rousseab left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!