Add NpuSyncOp generation to AIEDmaToNpu #1114

AndraBisca · 2024-03-08T15:05:36Z

Designs that use objectfifos can rely on the ShimDMAAllocationOps generated by the objectfifo lowering to produce the corresponding NpuSyncOps. Other designs will still require that the sync be added explicitly.

This PR works in relation with the MLIR-AIR channel-to-objectfifo lowering to ensure that the NpuSyncOps are added at the MIR-AIE level, after the mapping decisions done by the objectififo lowering.

lib/Dialect/AIEX/Transforms/AIEDmaToIpu.cpp

reference_designs/ipu-xrt/matrix_multiplication/aie2.py

reference_designs/ipu-xrt/matrix_multiplication_column/aie2.py

reference_designs/ipu-xrt/matrix_vector_multiplication/aie2.py

github-actions · 2024-03-08T15:18:30Z

Coverage Report

Created: 2024-06-04 22:05

Click here for information about interpreting this report.

Filename	Function Coverage	Line Coverage	Region Coverage	Branch Coverage
home/runner/work/mlir-aie/mlir-aie/lib/Dialect/AIEX/Transforms/AIEDmaToNpu.cpp	100.00%	95.86%	90.65%	80.56%
Totals	100.00%	95.86%	90.65%	80.56%

Generated by llvm-cov -- llvm version 14.0.0

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

reference_designs/ipu-xrt/matrix_multiplication_column/aie2.py

reference_designs/ipu-xrt/matrix_vector_multiplication/aie2.py

reference_designs/ipu-xrt/matrix_multiplication/aie2.py

reference_designs/ipu-xrt/matrix_multiplication_column/aie2.py

reference_designs/ipu-xrt/matrix_vector_multiplication/aie2.py

lib/Dialect/AIEX/Transforms/AIEDmaToNpu.cpp

programming_examples/ml/bottleneck/aie2.py

programming_examples/ml/resnet/layers_conv2_x/aie2.py

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

lib/Dialect/AIEX/Transforms/AIEDmaToNpu.cpp

programming_examples/ml/bottleneck/aie2.py

programming_examples/ml/resnet/layers_conv2_x/aie2.py

programming_examples/ml/bottleneck/aie2.py

programming_examples/ml/resnet/layers_conv2_x/aie2.py

…u-sync

fifield · 2024-06-05T15:03:16Z

lib/Dialect/AIEX/Transforms/AIEDmaToNpu.cpp

+    for (auto dma : dmas) {
+      if (auto infoOp =
+              getAllocOpForSymbol(shimDmaAllocOps, dma.getMetadata())) {
+        if (infoOp->getChannelDir() == AIE::DMAChannelDir::S2MM) {
+          // Found dma op copying results to host
+          OpBuilder builder(dma);
+          auto col = builder.getI32IntegerAttr(infoOp->getCol());
+          auto row = builder.getI32IntegerAttr(0);
+          auto dir = builder.getI32IntegerAttr(0);
+          auto chan = builder.getI32IntegerAttr(infoOp->getChannelIndex());
+          auto col_num = builder.getI32IntegerAttr(1);
+          auto row_num = builder.getI32IntegerAttr(1);
+          builder.setInsertionPoint(returnOp);
+          builder.create<AIEX::NpuSyncOp>(dma->getLoc(), col, row, dir, chan,
+                                          col_num, row_num);
+        }
+      }
+    }


I don't think we want to unconditionally add a sync to every outgoing memcpy. For example, what if we are collecting N output tiles at the shim and only need to sync at the end? The N-1 extraneous syncs will have a performance penalty vs. the single (manually inserted) sync at the end.

fifield · 2024-06-05T15:04:58Z

programming_examples/basic/dma_transpose/aie2.py

@@ -59,7 +59,6 @@ def sequence(A, B, C):
                npu_dma_memcpy_nd(
                    metadata="in", bd_id=1, mem=A, sizes=[1, K, M, 1], strides=[1, 1, K]
                )
-                npu_sync(column=0, row=0, direction=0, channel=0)


I'd prefer that the tests keep the sync explicit, but using aiex.npu.dma_wait instead of aiex.npu.sync

I'd prefer that the tests keep the sync explicit, but using aiex.npu.dma_wait instead of aiex.npu.sync

Should we close this PR in favor of #1791 ? Or is there still a desire to insert the sync/wait automatically?

abisca added 4 commits March 7, 2024 06:08

Add generation of ipu sync

932eea6

Move ipu_sync generationto AIEDmaToIpu and update tests

f223798

Merge branch 'main' of https://github.com/Xilinx/mlir-aie into ipu-sync

9d9f618

Revert e2e changes

9e6f582

github-actions bot reviewed Mar 8, 2024

View reviewed changes

lib/Dialect/AIEX/Transforms/AIEDmaToIpu.cpp Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 8, 2024

View reviewed changes

reference_designs/ipu-xrt/matrix_multiplication/aie2.py Outdated Show resolved Hide resolved

reference_designs/ipu-xrt/matrix_multiplication_column/aie2.py Outdated Show resolved Hide resolved

reference_designs/ipu-xrt/matrix_vector_multiplication/aie2.py Outdated Show resolved Hide resolved

AndraBisca and others added 4 commits March 8, 2024 16:25

Update lib/Dialect/AIEX/Transforms/AIEDmaToIpu.cpp

812ec66

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update reference_designs/ipu-xrt/matrix_multiplication/aie2.py

6642c30

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update reference_designs/ipu-xrt/matrix_multiplication_column/aie2.py

fef09fd

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update reference_designs/ipu-xrt/matrix_vector_multiplication/aie2.py

3b88caf

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

github-actions bot reviewed Mar 8, 2024

View reviewed changes

reference_designs/ipu-xrt/matrix_multiplication_column/aie2.py Outdated Show resolved Hide resolved

reference_designs/ipu-xrt/matrix_vector_multiplication/aie2.py Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 8, 2024

View reviewed changes

reference_designs/ipu-xrt/matrix_vector_multiplication/aie2.py Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 8, 2024

View reviewed changes

reference_designs/ipu-xrt/matrix_multiplication/aie2.py Outdated Show resolved Hide resolved

reference_designs/ipu-xrt/matrix_multiplication_column/aie2.py Outdated Show resolved Hide resolved

reference_designs/ipu-xrt/matrix_vector_multiplication/aie2.py Outdated Show resolved Hide resolved

abisca added 3 commits March 13, 2024 10:49

Merge branch 'main' of https://github.com/Xilinx/mlir-aie into ipu-sync

1521c11

Merge fixes

9fb0964

Fixed conflicts

56cf237

github-actions bot reviewed Jun 4, 2024

View reviewed changes

lib/Dialect/AIEX/Transforms/AIEDmaToNpu.cpp Outdated Show resolved Hide resolved

github-actions bot reviewed Jun 4, 2024

View reviewed changes

programming_examples/ml/bottleneck/aie2.py Outdated Show resolved Hide resolved

programming_examples/ml/resnet/layers_conv2_x/aie2.py Outdated Show resolved Hide resolved

abisca and others added 4 commits June 4, 2024 15:03

Remove conflicts

0133741

Update lib/Dialect/AIEX/Transforms/AIEDmaToNpu.cpp

6b05733

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update programming_examples/ml/bottleneck/aie2.py

9f72509

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update programming_examples/ml/resnet/layers_conv2_x/aie2.py

5122553

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

github-actions bot reviewed Jun 4, 2024

View reviewed changes

lib/Dialect/AIEX/Transforms/AIEDmaToNpu.cpp Outdated Show resolved Hide resolved

github-actions bot reviewed Jun 4, 2024

View reviewed changes

programming_examples/ml/bottleneck/aie2.py Outdated Show resolved Hide resolved

programming_examples/ml/resnet/layers_conv2_x/aie2.py Outdated Show resolved Hide resolved

github-actions bot reviewed Jun 4, 2024

View reviewed changes

programming_examples/ml/resnet/layers_conv2_x/aie2.py Outdated Show resolved Hide resolved

github-actions bot reviewed Jun 4, 2024

View reviewed changes

programming_examples/ml/bottleneck/aie2.py Outdated Show resolved Hide resolved

programming_examples/ml/resnet/layers_conv2_x/aie2.py Outdated Show resolved Hide resolved

abisca added 3 commits June 4, 2024 15:08

Remove leftover syncs

6edffdc

Merge branch 'ipu-sync' of https://github.com/Xilinx/mlir-aie into ip…

5f98b63

…u-sync

Removed another sync

56eddbb

AndraBisca marked this pull request as ready for review June 4, 2024 22:04

AndraBisca requested review from denolf, jgmelber, fifield and makslevental as code owners June 4, 2024 22:04

fifield changed the title ~~Add IpuSyncOp generation to AIEDmaToIpu~~ Add NpuSyncOp generation to AIEDmaToNpu Jun 5, 2024

fifield requested changes Jun 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NpuSyncOp generation to AIEDmaToNpu #1114

Add NpuSyncOp generation to AIEDmaToNpu #1114

AndraBisca commented Mar 8, 2024 •

edited

Loading

github-actions bot commented Mar 8, 2024 •

edited

Loading

fifield Jun 5, 2024

fifield Jun 5, 2024

fifield Sep 25, 2024

Add NpuSyncOp generation to AIEDmaToNpu #1114

Are you sure you want to change the base?

Add NpuSyncOp generation to AIEDmaToNpu #1114

Conversation

AndraBisca commented Mar 8, 2024 • edited Loading

github-actions bot commented Mar 8, 2024 • edited Loading

Coverage Report

Created: 2024-06-04 22:05

Generated by llvm-cov -- llvm version 14.0.0

fifield Jun 5, 2024

Choose a reason for hiding this comment

fifield Jun 5, 2024

Choose a reason for hiding this comment

fifield Sep 25, 2024

Choose a reason for hiding this comment

AndraBisca commented Mar 8, 2024 •

edited

Loading

github-actions bot commented Mar 8, 2024 •

edited

Loading