-
Notifications
You must be signed in to change notification settings - Fork 32
Open
Description
During training, I found the training procedure crashes when running
Lumina-mGPT/xllmx/util/ckpt.py
Line 91 in 104abe4
consolidated_model_state_dict = {key: val.to(save_dtype) for key, val in model.state_dict().items()} |
And the error is:
AssertionError: FSDP assumes model.norm.weight is in the state_dict but the state_dict only has odict_keys
Metadata
Metadata
Assignees
Labels
No labels