[mlir-gen] Introduce basic support for quantization ops (2/n). #1089

shahidact · 2025-08-26T14:16:59Z

This patch tries to extend the mixed precision isupport and add basic infra to define or create quantization kernel as we get more clarity on kind of primitive operations required.

Addition of run time test is not in scope of this patch.

This patch tries to extend the mixed precision isupport and add basic infra to define or create quantization kernel as we get more clarity on kind of primitive operations required. Addition of run time test is not in scope of this patch.

tools/mlir-gen/MLIRGen.cpp

adam-smnk · 2025-08-27T13:03:41Z

General question, shouldn't we migrate to lighthouse's gen?

test/Integration/mlir-gen-matmul.mlir

rolfmorel · 2025-08-27T14:18:14Z

General question, shouldn't we migrate to lighthouse's gen?

That's indeed a good question. Might be a good example to dogfood how we facilitate downstream projects interacting with lighthouse.

rengolin · 2025-08-27T14:21:04Z

General question, shouldn't we migrate to lighthouse's gen?

That's indeed a good question. Might be a good example to dogfood how we facilitate downstream projects interacting with lighthouse.

For now, I'd duplicate, because we don't know what we can build with this work. Once we're happy with the result, we should bring this up to the lighthouse and help talk about quantization upstream.

…ion.

adam-smnk

General structure seems fine
I assume that the sequence of quant ops has been tested and is correct

rengolin · 2025-09-01T09:35:26Z

I'd like to see at least two back-and-forth execution test:

I8 -> F32 -> I8: Must be identical results
F32 -> I8 -> F32: Should be equal to a small delta

rengolin · 2025-09-01T09:36:16Z

Test failures are due to NFS migration and will be fixed in time

…untime test. -Adds '-print-input' flag to print input arguments for visual inspection. -Refactored and updated the corresponding APIs.

shahidact requested review from rengolin, adam-smnk, rolfmorel and arun-thmn August 26, 2025 14:17

rengolin reviewed Aug 27, 2025

View reviewed changes

tools/mlir-gen/MLIRGen.cpp Outdated Show resolved Hide resolved

tools/mlir-gen/MLIRGen.cpp Outdated Show resolved Hide resolved

adam-smnk reviewed Aug 27, 2025

View reviewed changes

test/Integration/mlir-gen-matmul.mlir Outdated Show resolved Hide resolved

shahidact added 2 commits August 28, 2025 00:01

Refactor the code, simplify conditional test case checks.

772484d

Extend existing createLayer() API to support quantization ops generat…

c49808a

…ion.

adam-smnk approved these changes Sep 1, 2025

View reviewed changes

-Adds quant-dequant(f32->i8->f32) validation kernel generation and r…

f39300c

…untime test. -Adds '-print-input' flag to print input arguments for visual inspection. -Refactored and updated the corresponding APIs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir-gen] Introduce basic support for quantization ops (2/n). #1089

[mlir-gen] Introduce basic support for quantization ops (2/n). #1089

Uh oh!

shahidact commented Aug 26, 2025

Uh oh!

Uh oh!

Uh oh!

adam-smnk commented Aug 27, 2025

Uh oh!

Uh oh!

rolfmorel commented Aug 27, 2025

Uh oh!

rengolin commented Aug 27, 2025

Uh oh!

adam-smnk left a comment

Uh oh!

rengolin commented Sep 1, 2025

Uh oh!

rengolin commented Sep 1, 2025

Uh oh!

Uh oh!

[mlir-gen] Introduce basic support for quantization ops (2/n). #1089

Are you sure you want to change the base?

[mlir-gen] Introduce basic support for quantization ops (2/n). #1089

Uh oh!

Conversation

shahidact commented Aug 26, 2025

Uh oh!

Uh oh!

Uh oh!

adam-smnk commented Aug 27, 2025

Uh oh!

Uh oh!

rolfmorel commented Aug 27, 2025

Uh oh!

rengolin commented Aug 27, 2025

Uh oh!

adam-smnk left a comment

Choose a reason for hiding this comment

Uh oh!

rengolin commented Sep 1, 2025

Uh oh!

rengolin commented Sep 1, 2025

Uh oh!

Uh oh!