Add asymmetric support for Int8Tensor + SmoothQuant by jcaip · Pull Request #3900 · pytorch/ao

jcaip · 2026-02-17T03:09:58Z

Stack from ghstack (oldest at bottom):

-> Add asymmetric support for Int8Tensor + SmoothQuant #3900

Summary:

This PR adds in support for asymmetric quantization in
Int8Tensor by adding in a new optional tensor attribute,
zero_point and act_zero_point for the weight / activation
respectively.

Also adds in a support for asymmetric quantization in our smoothquant
implementation.

Test Plan:

pytest test/quantization/quantize_/workflows/int8/test_int8_tensor.py
pytest test/prototype/test_smoothquant.py

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D94258324

Summary: This PR adds in support for asymmetric activation quantization in Int8Tensor and Smoothquant Test Plan: ``` pytest test/quantization/quantize_/workflows/int8/test_int8_tensor.py pytest test/prototype/test_smoothquant.py ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

pytorch-bot · 2026-02-17T03:10:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3900

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit dd40b73 with merge base 5906856 ():

NEW FAILURE - The following job has failed:

Run Regression Tests / test-nightly (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://downloa... / linux-job (gh)
test/quantization/pt2e/test_x86inductor_quantizer.py::TestQuantizePT2EX86Inductor::test_set_module_name_with_mixed_configs

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: This PR adds in support for asymmetric activation quantization in Int8Tensor and Smoothquant Test Plan: ``` pytest test/quantization/quantize_/workflows/int8/test_int8_tensor.py pytest test/prototype/test_smoothquant.py ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 5163aa0 Pull Request resolved: #3900

Summary: This PR adds in support for asymmetric quantization in Int8Tensor by adding in a new optional tensor attribute, `zero_point` and `act_zero_point` for the weight / activation respectively. Also adds in a support for asymmetric quantization in our smoothquant implementation. Test Plan: ``` pytest test/quantization/quantize_/workflows/int8/test_int8_tensor.py pytest test/prototype/test_smoothquant.py ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: This PR adds in support for asymmetric quantization in Int8Tensor by adding in a new optional tensor attribute, `zero_point` and `act_zero_point` for the weight / activation respectively. Also adds in a support for asymmetric quantization in our smoothquant implementation. Test Plan: ``` pytest test/quantization/quantize_/workflows/int8/test_int8_tensor.py pytest test/prototype/test_smoothquant.py ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: a146822 Pull Request resolved: #3900

namgyu-youn · 2026-02-24T05:06:12Z

torchao/quantization/quantize_/common/quantize_tensor_kwargs.py

    tensor: torch.Tensor,
    quant_kwargs: QuantizeTensorKwargs,
    scale: Optional[torch.Tensor] = None,
+    zero_point: Optional[torch.Tensor] = None,


nit: weight_zero_point for clear naming?

I think we should keep this consistent with scale. act_zero_point is used to denote the activation zero point

ok sounds good to me.

namgyu-youn · 2026-02-24T05:07:51Z

torchao/quantization/quantize_/workflows/int8/int8_tensor.py

    int8 quantized tensor with plain layout.

-    Currently only Symmetric quantization is supported.
+    Supports both symmetric and asymmetric quantization.


Maybe drop this docstring? Description inside Tensor Attribute section looks enough I feel.

sounds good, will remove

hossein1387 · 2026-02-24T19:30:39Z

torchao/quantization/quantize_/workflows/int8/int8_tensor.py

        act_quant_kwargs: flags for dynamic activation quantization
    """

    tensor_data_names = ["qdata", "scale"]


shouldn't zero_point be required for MappingType.ASYMMETRIC here?

hossein1387 · 2026-02-24T19:31:36Z

torchao/quantization/quant_api.py

-        )
        assert config.version == 2, f"Unexpected version: {config.version}"

        # TODO: Symmentric/Asymmetric choice for weight quantization


nit: Symmentric-> Symmetric

hossein1387

other than the typo, LGTM.

jcaip · 2026-02-24T20:01:17Z

@jcaip has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Removed TODO comment regarding symmetric/asymmetric weight quantization.

Removed mention of symmetric quantization support from docstring.

Xia-Weiwen · 2026-02-26T02:04:09Z

CC @cyxlily

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 17, 2026

jcaip added the module: inference quantize_ api inference flow label Feb 17, 2026

jcaip requested review from hossein1387, jerryzh168 and sxu February 17, 2026 03:22

namgyu-youn reviewed Feb 24, 2026

View reviewed changes

hossein1387 reviewed Feb 24, 2026

View reviewed changes

jcaip changed the base branch from gh/jcaip/12/base to main February 24, 2026 19:58

sxu approved these changes Feb 25, 2026

View reviewed changes

jcaip added 2 commits February 25, 2026 16:38

Remove TODO for weight quantization choice

c4f1692

Removed TODO comment regarding symmetric/asymmetric weight quantization.

Update Int8Tensor docstring to remove symmetric quantization

dd40b73

Removed mention of symmetric quantization support from docstring.

jcaip merged commit 8d65522 into main Feb 26, 2026
34 of 36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add asymmetric support for Int8Tensor + SmoothQuant#3900

Add asymmetric support for Int8Tensor + SmoothQuant#3900
jcaip merged 4 commits intomainfrom
gh/jcaip/12/head

jcaip commented Feb 17, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 17, 2026 •

edited

Loading

Uh oh!

namgyu-youn Feb 24, 2026

Uh oh!

jcaip Feb 24, 2026

Uh oh!

namgyu-youn Feb 24, 2026

Uh oh!

namgyu-youn Feb 24, 2026

Uh oh!

jcaip Feb 24, 2026

Uh oh!

hossein1387 Feb 24, 2026

Uh oh!

hossein1387 Feb 24, 2026

Uh oh!

hossein1387 left a comment

Uh oh!

jcaip commented Feb 24, 2026

Uh oh!

Xia-Weiwen commented Feb 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

jcaip commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3900

❌ 1 New Failure

Uh oh!

namgyu-youn Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

jcaip Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

namgyu-youn Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

namgyu-youn Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

jcaip Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

hossein1387 Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

hossein1387 Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

hossein1387 left a comment

Choose a reason for hiding this comment

Uh oh!

jcaip commented Feb 24, 2026

Uh oh!

Xia-Weiwen commented Feb 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jcaip commented Feb 17, 2026 •

edited

Loading

pytorch-bot bot commented Feb 17, 2026 •

edited

Loading