-
Notifications
You must be signed in to change notification settings - Fork 780
Closed
Description
Targeting a very simplistic function:
func.func @add_tensors(%arg0: tensor<2x32x32x32xi1>, %arg1: tensor<2x32x32x32xi1>) -> tensor<2x32x32x32xi1> {
%result = arith.xori %arg0, %arg1 : tensor<2x32x32x32xi1>
return %result : tensor<2x32x32x32xi1>
}
compile with:
iree-compile add.mlir --iree-hal-target-backends=llvm-cpu
Don't forget this patch:
diff --git a/compiler/src/iree/compiler/Utils/ElementPackingUtils.cpp b/compiler/src/iree/compiler/Utils/ElementPackingUtils.cpp
index a7fa1c3f7b..79445148e5 100644
--- a/compiler/src/iree/compiler/Utils/ElementPackingUtils.cpp
+++ b/compiler/src/iree/compiler/Utils/ElementPackingUtils.cpp
@@ -20,7 +20,7 @@ bool needToPackSubByteElementBitWidth(unsigned bitWidth) {
// trickiness and weirdness of packing and cross-byte access.
// Also disallow boolean values for now--they may require separate interface
// choices.
- return bitWidth < 8 && llvm::isPowerOf2_32(bitWidth) && bitWidth != 1;
+ return bitWidth < 8 && llvm::isPowerOf2_32(bitWidth);
}
bool needToPackSubByteElements(RankedTensorType shapedType) {
Metadata
Metadata
Assignees
Labels
No labels