[Codegen][CPU] Enable scalable transfer lowerings #18170

MacDue · 2024-08-09T13:24:36Z

This enables a general scalable lowering for transfer_write(transpose) when ArmSME is not available. The ArmSME dialect already had its own (more specific) lowerings for cases like this, which is why these lowerings are disabled when SME is available.

Depends on: llvm/llvm-project#101353

This enables a general scalable lowering for `transfer_write(transpose)` when ArmSME is _not_ available. The ArmSME dialect already had its own (more specific) lowerings for cases like this, which is why these lowerings are disabled when SME is available. Depends on: llvm/llvm-project#101353 Signed-off-by: Benjamin Maxwell <[email protected]>

Signed-off-by: Benjamin Maxwell <[email protected]>

c-rhodes

LGTM cheers

hanhanW · 2024-08-15T16:38:41Z

compiler/src/iree/compiler/Codegen/LLVMCPU/test/vector_lowering.mlir

+// CHECK-LABEL: func.func @scalable_transpose_store
+// CHECK-NOT: vector.transpose
+// CHECK: vector.store {{.*}} : memref<?x?xf32>, vector<4xf32>
+// CHECK-NOT: vector.transpose


Why do we need the second CHECK-NOT?

CHECK-NOT only checks between two matches. The first checks between func.func @scalable_transpose_store and vector.store, the second checks from vector.store to the end of the function (IIRC).

Yeah, I understand what it is happening. My point is that the check of vector.store already shows that the lowering happens. In this case, why do we need to check if there is a vector.transpose followed by it?

I also want to check that the transpose (which is not directly supported) is eliminated.

hanhanW

LGTM, just one question about the test.

MacDue force-pushed the enable_scalable_transfers branch 3 times, most recently from 09a58cd to b28f396 Compare August 14, 2024 09:16

MacDue marked this pull request as ready for review August 14, 2024 09:16

MacDue requested review from hanhanW and MaheshRavishankar as code owners August 14, 2024 09:16

MacDue requested review from c-rhodes and banach-space August 14, 2024 09:16

MacDue added 2 commits August 15, 2024 09:08

Add test

645ab7d

Signed-off-by: Benjamin Maxwell <[email protected]>

MacDue force-pushed the enable_scalable_transfers branch from b28f396 to 645ab7d Compare August 15, 2024 09:08

c-rhodes approved these changes Aug 15, 2024

View reviewed changes

hanhanW reviewed Aug 15, 2024

View reviewed changes

hanhanW approved these changes Aug 15, 2024

View reviewed changes

c-rhodes merged commit 8a1d78b into iree-org:main Aug 16, 2024
36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Codegen][CPU] Enable scalable transfer lowerings #18170

[Codegen][CPU] Enable scalable transfer lowerings #18170

MacDue commented Aug 9, 2024

c-rhodes left a comment

hanhanW Aug 15, 2024

MacDue Aug 15, 2024

hanhanW Aug 15, 2024

MacDue Aug 15, 2024

hanhanW left a comment

[Codegen][CPU] Enable scalable transfer lowerings #18170

[Codegen][CPU] Enable scalable transfer lowerings #18170

Conversation

MacDue commented Aug 9, 2024

c-rhodes left a comment

Choose a reason for hiding this comment

hanhanW Aug 15, 2024

Choose a reason for hiding this comment

MacDue Aug 15, 2024

Choose a reason for hiding this comment

hanhanW Aug 15, 2024

Choose a reason for hiding this comment

MacDue Aug 15, 2024

Choose a reason for hiding this comment

hanhanW left a comment

Choose a reason for hiding this comment