-
Notifications
You must be signed in to change notification settings - Fork 613
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Account all the element types to determine vector sizes. (#8552)
The assumption was that all the element types have the same bitwidth. However, there are cases that element types do not match, e.g., matmul i8xi8 -> i32. It caused issues that large tiling sizes were selected, which kicked in heavy optimization in LLVM. This commit chooses the smallest vector size over all the element types. This also updates the logic of first level tiling, which follows what we've done for generic ops. The commit reduce compilation time from hours to 5 mins for mobilebert-baseline-tf2-quant.mlir when targeting ARM. Fixes #8540
- Loading branch information
Showing
2 changed files
with
97 additions
and
27 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters