Backward fold scale axis to Gemm layer in ONNX Dialect #3131

Arkar-Hema · 2025-04-18T09:38:13Z

BackwardFoldScaleAxisToGemm

BackwardFoldScaleAxisToGemm is an optimization pass in ONNX-MLIR that merges a BatchNormalization layer into a preceding Gemm (General Matrix Multiplication) layer when specific conditions are met.
Goal: To simplify the model by reducing redundant computations and improving runtime efficiency by statically folding the BatchNormalization into the Gemm operation.

Conditions:

The Gemm op must have transB = 0 (i.e., the weight matrix is used without transposition)
The BatchNormalization must be in inference mode

Original computation:
Output=BatchNorm(Gemm(X,W,B))

After pass computation:
NewOutput=Gemm(X,W×γ,B×γ+β)
where,

X = Input to Gemm
W = Weights in Gemm
B = Bias in Gemm
γ = Scale in BatchNormalization
β = Bias in BatchNormalization

Signed-off-by: Arkar-Hema <[email protected]>

Arkar-Hema · 2025-04-22T03:51:00Z

Can any one of the admins verify this patch?

tungld · 2025-04-24T05:18:04Z

src/Dialect/ONNX/ONNXOps/Canonicalize.td

+//   (BatchNorm)      Y = scale * (Z - mean) / sqrt(var + eps) + bias
+//
+// This transformation corresponds to a recomposition:
+//   Y = A * (scale * B) + (scale * bias + C)


@Arkar-Hema could you elaborate how you derived this formula where mean, var and eps are canceled?

Assume mean=0, var=1 and exp=0 (which is usually present in any pre-compiled and normalised models):

Y ≈ scale × Z + bias
Substituting Z = A x B + C
Y ≈ scale × (A × B + C) + bias
Y = A × (scale × B) + (scale × C + bias)

Assume mean=0, var=1 and exp=0 (which is usually present in any pre-compiled and normalised models):

Then, you have to define this assumption in the constraint part of the rewriting rule. Otherwise, the rewriting rule produces a wrong result.

Anyway, my recommendation is to handle the general case where mean, var, eps are constants (not necessary concrete values, say, of 0, 1 and 0, respectively). New scale and bias values for matmul can be easily computed from mean, var, eps, scale and bias of BatchNorm, and in the inference mode, these values are constants and will be folded automatically by the compiler into a single constant.

Signed-off-by: Arkar-Hema <[email protected]>

jenkins-droid · 2025-04-25T03:47:59Z

Can one of the admins verify this patch?

src/Dialect/ONNX/ONNXOps/Canonicalize.td

Signed-off-by: Arkar-Hema <[email protected]>

jenkins-droid · 2025-05-15T04:43:20Z

Can one of the admins verify this patch?

Arkar-Hema added 2 commits April 18, 2025 05:30

Backward fold scale axis to gemm layer

0c9ff3c

Signed-off-by: Arkar-Hema <[email protected]>

Clang format fix

f672ddf

Signed-off-by: Arkar-Hema <[email protected]>

tungld reviewed Apr 24, 2025

View reviewed changes

Backward fold batch to gemm

1426876

Signed-off-by: Arkar-Hema <[email protected]>

Merge branch 'main' into backward_fold_scale_to_gemm

2c5c861

AlexandreEichenberger reviewed Apr 30, 2025

View reviewed changes

src/Dialect/ONNX/ONNXOps/Canonicalize.td Outdated Show resolved Hide resolved

Added mean, var and eps contraints

9619505

Signed-off-by: Arkar-Hema <[email protected]>

Merge branch 'main' into backward_fold_scale_to_gemm

bfe06b5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Backward fold scale axis to Gemm layer in ONNX Dialect #3131

Backward fold scale axis to Gemm layer in ONNX Dialect #3131

Uh oh!

Arkar-Hema commented Apr 18, 2025

Uh oh!

Arkar-Hema commented Apr 22, 2025

Uh oh!

tungld Apr 24, 2025

Uh oh!

Arkar-Hema Apr 25, 2025

Uh oh!

tungld Apr 25, 2025

Uh oh!

jenkins-droid commented Apr 25, 2025

Uh oh!

Uh oh!

jenkins-droid commented May 15, 2025

Uh oh!

Uh oh!

Backward fold scale axis to Gemm layer in ONNX Dialect #3131

Are you sure you want to change the base?

Backward fold scale axis to Gemm layer in ONNX Dialect #3131

Uh oh!

Conversation

Arkar-Hema commented Apr 18, 2025

Uh oh!

Arkar-Hema commented Apr 22, 2025

Uh oh!

tungld Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

Arkar-Hema Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

tungld Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

jenkins-droid commented Apr 25, 2025

Uh oh!

Uh oh!

jenkins-droid commented May 15, 2025

Uh oh!

Uh oh!