add load checkpoint support for virtual table #4250

bobbyliujb · 2025-06-03T21:31:15Z

Summary:
after all of the rebasing and landing, the trunk still missed some of the needed changes for checkpoint loading:

change create_virtual_table_global_metadata to respect local_weight_count on each rank, or just use the param size as number of rows on each rank
register register_load_state_dict_post_hook in ShardedEmbeddingCollection to let it ignore loading the weight tensor

Differential Revision: D75843542

facebook-github-bot · 2025-06-03T21:31:24Z

This pull request was exported from Phabricator. Differential Revision: D75843542

netlify · 2025-06-03T21:31:44Z

❌ Deploy Preview for pytorch-fbgemm-docs failed.

Name	Link
🔨 Latest commit	`504dec1`
🔍 Latest deploy log	https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/683f6e9333afae000831661e

Summary: after all of the rebasing and landing, the trunk still missed some of the needed changes for checkpoint loading: * change `create_virtual_table_global_metadata` to respect local_weight_count on each rank, or just use the param size as number of rows on each rank * register register_load_state_dict_post_hook in ShardedEmbeddingCollection to let it ignore loading the weight tensor Differential Revision: D75843542 Privacy Context Container: L1138451

facebook-github-bot · 2025-06-03T21:52:30Z

This pull request was exported from Phabricator. Differential Revision: D75843542

Summary: X-link: pytorch/FBGEMM#4250 after all of the rebasing and landing, the trunk still missed some of the needed changes for checkpoint loading: * change `create_virtual_table_global_metadata` to respect local_weight_count on each rank, or just use the param size as number of rows on each rank * register register_load_state_dict_post_hook in ShardedEmbeddingCollection to let it ignore loading the weight tensor Differential Revision: D75843542 Privacy Context Container: L1138451

Summary: Pull Request resolved: #3037 X-link: facebookresearch/FBGEMM#1329 X-link: pytorch/FBGEMM#4250 after all of the rebasing and landing, the trunk still missed some of the needed changes for checkpoint loading: * change `create_virtual_table_global_metadata` to respect local_weight_count on each rank, or just use the param size as number of rows on each rank * register register_load_state_dict_post_hook in ShardedEmbeddingCollection to let it ignore loading the weight tensor Reviewed By: emlin Differential Revision: D75843542 Privacy Context Container: L1138451 fbshipit-source-id: 8b3c8d76bb2e7ba2137c8899de2c03d534f1365c

facebook-github-bot · 2025-06-04T05:11:18Z

This pull request has been merged in ee0264c.

facebook-github-bot added the cla signed label Jun 3, 2025

facebook-github-bot added the fb-exported label Jun 3, 2025

bobbyliujb force-pushed the export-D75843542 branch from 80e6b66 to 504dec1 Compare June 3, 2025 21:52

bobbyliujb mentioned this pull request Jun 3, 2025

add load checkpoint support for virtual table pytorch/torchrec#3037

Closed

facebook-github-bot closed this in ee0264c Jun 4, 2025

facebook-github-bot added the Merged label Jun 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add load checkpoint support for virtual table #4250

add load checkpoint support for virtual table #4250

Uh oh!

bobbyliujb commented Jun 3, 2025

Uh oh!

facebook-github-bot commented Jun 3, 2025

Uh oh!

netlify bot commented Jun 3, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jun 3, 2025

Uh oh!

facebook-github-bot commented Jun 4, 2025

Uh oh!

Uh oh!

add load checkpoint support for virtual table #4250

add load checkpoint support for virtual table #4250

Uh oh!

Conversation

bobbyliujb commented Jun 3, 2025

Uh oh!

facebook-github-bot commented Jun 3, 2025

Uh oh!

netlify bot commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

❌ Deploy Preview for pytorch-fbgemm-docs failed.

Uh oh!

facebook-github-bot commented Jun 3, 2025

Uh oh!

facebook-github-bot commented Jun 4, 2025

Uh oh!

Uh oh!

netlify bot commented Jun 3, 2025 •

edited

Loading