You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add support for in-place (and maybe not in place) methods of matmul accumulate operations. This would allow speeding up certain algorithms by keeping a running accumulator between matmul invocations.
This feature would require an option to load the out buffer to the CMMA accumulator matrices, rather than zeroing them out.
The text was updated successfully, but these errors were encountered:
Feature request
Add support for in-place (and maybe not in place) methods of matmul accumulate operations. This would allow speeding up certain algorithms by keeping a running accumulator between matmul invocations.
This feature would require an option to load the
out
buffer to the CMMA accumulator matrices, rather than zeroing them out.The text was updated successfully, but these errors were encountered: