add load checkpoint support for virtual table #3037

bobbyliujb · 2025-06-03T21:52:17Z

after all of the rebasing and landing, the trunk still missed some of the needed changes for checkpoint loading:

change create_virtual_table_global_metadata to respect local_weight_count on each rank, or just use the param size as number of rows on each rank
register register_load_state_dict_post_hook in ShardedEmbeddingCollection to let it ignore loading the weight tensor

Differential Revision:
D75843542

Privacy Context Container: L1138451

Summary: X-link: pytorch/FBGEMM#4250 after all of the rebasing and landing, the trunk still missed some of the needed changes for checkpoint loading: * change `create_virtual_table_global_metadata` to respect local_weight_count on each rank, or just use the param size as number of rows on each rank * register register_load_state_dict_post_hook in ShardedEmbeddingCollection to let it ignore loading the weight tensor Differential Revision: D75843542 Privacy Context Container: L1138451

facebook-github-bot · 2025-06-03T21:52:37Z

This pull request was exported from Phabricator. Differential Revision: D75843542

Summary: X-link: pytorch/torchrec#3037 X-link: facebookresearch/FBGEMM#1329 Pull Request resolved: #4250 after all of the rebasing and landing, the trunk still missed some of the needed changes for checkpoint loading: * change `create_virtual_table_global_metadata` to respect local_weight_count on each rank, or just use the param size as number of rows on each rank * register register_load_state_dict_post_hook in ShardedEmbeddingCollection to let it ignore loading the weight tensor Reviewed By: emlin Differential Revision: D75843542 Privacy Context Container: L1138451 fbshipit-source-id: 8b3c8d76bb2e7ba2137c8899de2c03d534f1365c

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 3, 2025

facebook-github-bot added the fb-exported label Jun 3, 2025

facebook-github-bot closed this in d9254e2 Jun 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add load checkpoint support for virtual table #3037

add load checkpoint support for virtual table #3037

Uh oh!

bobbyliujb commented Jun 3, 2025

Uh oh!

facebook-github-bot commented Jun 3, 2025

Uh oh!

Uh oh!

add load checkpoint support for virtual table #3037

add load checkpoint support for virtual table #3037

Uh oh!

Conversation

bobbyliujb commented Jun 3, 2025

Uh oh!

facebook-github-bot commented Jun 3, 2025

Uh oh!

Uh oh!