We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
In order to enable 1D sharding, we need to add these to the CLI:
--fsdp "full_shard" --fsdp_config ~/fsdp_config.json
and the fsdp_config.json file is
{ "fsdp_transformer_layer_cls_to_wrap": [ "LlamaDecoderLayer" ], "xla": true, "xla_fsdp_v2": true, "xla_fsdp_grad_ckpt": true }
However, in order to enable 2D sharding, we need to instead put this on the CLI:
--spmd_2d_sharding 2
(remember to remove fsdp related flags)
This is very confusing.
The text was updated successfully, but these errors were encountered:
No branches or pull requests
In order to enable 1D sharding, we need to add these to the CLI:
and the fsdp_config.json file is
However, in order to enable 2D sharding, we need to instead put this on the CLI:
(remember to remove fsdp related flags)
This is very confusing.
The text was updated successfully, but these errors were encountered: