-
Notifications
You must be signed in to change notification settings - Fork 605
int4 kv #3878
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
int4 kv #3878
Conversation
This pull request was exported from Phabricator. Differential Revision: D70508737 |
✅ Deploy Preview for pytorch-fbgemm-docs ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
4a1240c
to
72f8d33
Compare
Summary: X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
72f8d33
to
47b69e6
Compare
Summary: X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
This pull request was exported from Phabricator. Differential Revision: D70508737 |
47b69e6
to
79a49b0
Compare
Summary: Pull Request resolved: pytorch#3878 X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
This pull request was exported from Phabricator. Differential Revision: D70508737 |
Summary: Pull Request resolved: pytorch#3878 X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
79a49b0
to
79b7582
Compare
Summary: X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
79b7582
to
59f51d9
Compare
This pull request was exported from Phabricator. Differential Revision: D70508737 |
Summary: X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
59f51d9
to
d32650f
Compare
Summary: X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
d32650f
to
009f75a
Compare
This pull request was exported from Phabricator. Differential Revision: D70508737 |
009f75a
to
5796fc5
Compare
Summary: Pull Request resolved: pytorch#3878 X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
This pull request was exported from Phabricator. Differential Revision: D70508737 |
Summary: Pull Request resolved: pytorch#3878 X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
7cb5fc1
to
0b84c01
Compare
Summary: X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
This pull request was exported from Phabricator. Differential Revision: D70508737 |
0b84c01
to
b07514d
Compare
Summary: Pull Request resolved: pytorch#3878 X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
Summary: Pull Request resolved: pytorch#3878 X-link: facebookresearch/FBGEMM#968 Enabling int4 KV for LLama4 numeric evals Changes: 1) k_norm 2) zero init dequantization. 3) Add NoPE for int4 Reviewed By: summerdengfb Differential Revision: D70508737
This pull request was exported from Phabricator. Differential Revision: D70508737 |
b07514d
to
6da6145
Compare
This pull request has been merged in ab78c2f. |
Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/968
Enabling int4 KV for LLama4 numeric evals
Changes:
Differential Revision: D70508737