[ET-VK] Modify quantized linear naive shader to linearly dispatch work to improve performance. #10116

trivedivivek · 2025-04-11T19:51:37Z

Stack from ghstack (oldest at bottom):

This diff changes naive quantized linear mat mul op to use push constant instead of uniform buffers and change dispatch pattern to linear to improve performance.

Differential Revision: D72862490

…k to improve performance. This diff changes naive quantized linear mat mul op to use push constant instead of uniform buffers and change dispatch pattern to linear to improve performance. Differential Revision: [D72862490](https://our.internmc.facebook.com/intern/diff/D72862490/) [ghstack-poisoned]

pytorch-bot · 2025-04-11T19:51:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10116

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 07cf9b1 with merge base c352672 ():

NEW FAILURES - The following jobs have failed:

Check Labels / Check labels (gh)
RuntimeError: Error checking labels: PR does not have required labels
Lint / lintrunner / linux-job (gh)
>>> Lint for backends/vulkan/runtime/graph/ops/impl/QuantizedLinearInt8.cpp:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-04-11T19:52:09Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

facebook-github-bot · 2025-04-11T19:52:39Z

This pull request was exported from Phabricator. Differential Revision: D72862490

…ispatch work to improve performance." This diff changes naive quantized linear mat mul op to use push constant instead of uniform buffers and change dispatch pattern to linear to improve performance. Differential Revision: [D72862490](https://our.internmc.facebook.com/intern/diff/D72862490/) [ghstack-poisoned]

facebook-github-bot · 2025-04-14T05:13:54Z

This pull request was exported from Phabricator. Differential Revision: D72862490

…ispatch work to improve performance." This diff changes naive quantized linear mat mul op to use push constant instead of uniform buffers and change dispatch pattern to linear to improve performance. Differential Revision: [D72862490](https://our.internmc.facebook.com/intern/diff/D72862490/) [ghstack-poisoned]

facebook-github-bot · 2025-04-14T14:30:11Z

This pull request was exported from Phabricator. Differential Revision: D72862490

trivedivivek requested a review from SS-JIA as a code owner April 11, 2025 19:51

trivedivivek mentioned this pull request Apr 8, 2025

[ET-VK] Minor performance improvements to native layer norm. #9892

Open

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 11, 2025

This was referenced Apr 9, 2025

[ET-VK] Tuning native layer norm local workgroup size to improve thread occupancy during reduce. #9984

Open

[ET-VK] Minor improvement to permute op. #10117

Open

facebook-github-bot added the fb-exported label Apr 11, 2025

SS-JIA approved these changes Apr 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ET-VK] Modify quantized linear naive shader to linearly dispatch work to improve performance. #10116

[ET-VK] Modify quantized linear naive shader to linearly dispatch work to improve performance. #10116

trivedivivek commented Apr 11, 2025 •

edited

Loading

pytorch-bot bot commented Apr 11, 2025 •

edited

Loading

github-actions bot commented Apr 11, 2025

facebook-github-bot commented Apr 11, 2025

facebook-github-bot commented Apr 14, 2025

facebook-github-bot commented Apr 14, 2025

[ET-VK] Modify quantized linear naive shader to linearly dispatch work to improve performance. #10116

Are you sure you want to change the base?

[ET-VK] Modify quantized linear naive shader to linearly dispatch work to improve performance. #10116

Conversation

trivedivivek commented Apr 11, 2025 • edited Loading

pytorch-bot bot commented Apr 11, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10116

❌ 2 New Failures

github-actions bot commented Apr 11, 2025

This PR needs a release notes: label

facebook-github-bot commented Apr 11, 2025

facebook-github-bot commented Apr 14, 2025

facebook-github-bot commented Apr 14, 2025

trivedivivek commented Apr 11, 2025 •

edited

Loading

pytorch-bot bot commented Apr 11, 2025 •

edited

Loading

This PR needs a `release notes:` label