Skip to content

int4 kv #3878

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from
Closed

int4 kv #3878

wants to merge 1 commit into from

Conversation

Aya-ZIbra
Copy link
Contributor

Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/968

Enabling int4 KV for LLama4 numeric evals
Changes:

  1. k_norm
  2. zero init dequantization.
  3. Add NoPE for int4

Differential Revision: D70508737

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D70508737

Copy link

netlify bot commented Mar 25, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 6da6145
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/681037fe15289100083051a6
😎 Deploy Preview https://deploy-preview-3878--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 7, 2025
Summary:

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 7, 2025
Summary:

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D70508737

Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 7, 2025
Summary:
Pull Request resolved: pytorch#3878

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D70508737

Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 7, 2025
Summary:
Pull Request resolved: pytorch#3878

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 28, 2025
Summary:

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D70508737

Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 28, 2025
Summary:

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 28, 2025
Summary:

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D70508737

Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 28, 2025
Summary:
Pull Request resolved: pytorch#3878

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D70508737

Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 28, 2025
Summary:
Pull Request resolved: pytorch#3878

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
@Aya-ZIbra Aya-ZIbra force-pushed the export-D70508737 branch 2 times, most recently from 7cb5fc1 to 0b84c01 Compare April 29, 2025 02:10
Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 29, 2025
Summary:

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D70508737

Aya-ZIbra added a commit to Aya-ZIbra/FBGEMM that referenced this pull request Apr 29, 2025
Summary:
Pull Request resolved: pytorch#3878

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
Summary:
Pull Request resolved: pytorch#3878

X-link: facebookresearch/FBGEMM#968

Enabling int4 KV for LLama4 numeric evals
Changes:
1) k_norm
2) zero init dequantization.
3) Add NoPE for int4

Reviewed By: summerdengfb

Differential Revision: D70508737
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D70508737

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in ab78c2f.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants