Skip to content

inference eviction interfaces #4500

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 2 commits into from

Conversation

chenyuzhcy
Copy link

Summary: Add a few inference eviction interfaces. The are interfaces breaking changes, but since model is not in production and it's not affecting other models so it should be fine.

Reviewed By: duduyi2013

Differential Revision: D78010843

Chenyu Zhang added 2 commits July 15, 2025 15:50
Summary:
X-link: facebookresearch/FBGEMM#1553

Add chunk loading for kv weights, the code path is shared for both initial load and inplace update. We use different flag to control the chunk size.

Reviewed By: emlin

Differential Revision:
D76995700

Privacy Context Container: L1299884
Summary: Add a few inference eviction interfaces. The are interfaces breaking changes, but since model is not in production and it's not affecting other models so it should be fine.

Reviewed By: duduyi2013

Differential Revision: D78010843
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D78010843

Copy link

netlify bot commented Jul 15, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 77b9b43
🔍 Latest deploy log https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/6876db440591c200089e2877
😎 Deploy Preview https://deploy-preview-4500--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 10363db.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants