Iree multi reduction support #1346

Sameeranjoshi · 2025-08-08T01:29:18Z

Single commit fix for #1306

Unroll vector.multi_reduction -> vector.reduction and transfer_read/writes -> load/stores
Add correctness support for Reduction tests in run.py
Made the tiling sizes in the copy pipeline to be 32 as they are legalized in Peano.

…enerates vector.extract + vector.add + vector.insert, need vector.transfer_Write instead.

This is partial fix, the program fails for out of Program memory on a core. 1. solves the reduction legalizer issue. 2. Converts the transfer_reads into loads 3. Applies 2D->1D flattening pattern on vector type. 4. Solves stack size problem by bumping the stack size.

1. Selected appropriate tile sizes. Peano vectorizes for <32xbf16> we choose the tile sizes such that we always generate the legal shapes. 2. found the bounds for both bf16 and f32 which work. 3. Tested the patch and cleaned up the patterns.

F32 Correctness: Pass Benchmark: Fails PM issue(run_benchmarks=true) BF16: Correctness: Fails Benchmarks: Pass(but this might be not correct as results are wrong) Run commands: python run.py delete_out_reduction $IREE_DIR --xrt_dir=$XRT_DIR --peano_dir=$PEANO_DIR \ --target_device="npu4" --xrt_lite_n_core_rows=$XRT_LITE_N_CORE_ROWS \ --xrt_lite_n_core_cols=$XRT_LITE_N_CORE_COLS --tests Reduction

…ests run.

newling · 2025-08-08T15:11:25Z

Nice! FFR, accompanying peano changes: Xilinx/llvm-aie#604

Sameeranjoshi added 7 commits July 28, 2025 14:41

Solved part 1 for issue, transfer_reads are now generated, next: it g…

ba83a9d

…enerates vector.extract + vector.add + vector.insert, need vector.transfer_Write instead.

[Fixed] Fix for out of program memory error

e55287e

1. Selected appropriate tile sizes. Peano vectorizes for <32xbf16> we choose the tile sizes such that we always generate the legal shapes. 2. found the bounds for both bf16 and f32 which work. 3. Tested the patch and cleaned up the patterns.

Testing setup

b442141

[typo] Fix compiler build error

177fc1b

[Reduction] Fix f32 not vectorized issue also made sure correctness t…

047235e

…ests run.

Sameeranjoshi requested review from MaheshRavishankar, yzhang93, Abhishek-Varma, jtuyls, newling and Yu-Zhewen as code owners August 8, 2025 01:29

Sameeranjoshi mentioned this pull request Aug 8, 2025

Direct codegen vectorized lowering of reduction operation #1306

Closed

Merge branch 'main' into iree-multi-reduction-support

c2abb91

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Iree multi reduction support #1346

Iree multi reduction support #1346

Uh oh!

Sameeranjoshi commented Aug 8, 2025

Uh oh!

newling commented Aug 8, 2025

Uh oh!

Uh oh!

Iree multi reduction support #1346

Are you sure you want to change the base?

Iree multi reduction support #1346

Uh oh!

Conversation

Sameeranjoshi commented Aug 8, 2025

Uh oh!

newling commented Aug 8, 2025

Uh oh!

Uh oh!