You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
#18 reduces the memory usage for use cases where you need the workspace for both forward and backward (training).
I haven't done any testing but it could be possible the forward workspace is smaller than the backward one, leading to to higher memory usage than necessary in pure forward(/inference) use cases.
This would also help seperate the automatic convolution algorithm detection for those uses cases, leading to quicker startup time (this should also be possible right now by using an Algo different than Auto, but would clearer).
The text was updated successfully, but these errors were encountered:
BREAKING CHANGE: All convolution functions now require
a SharedTensor<u8> workspace to be passed.
This allows for reuse of the workspace between different
convolution operations and a global shared workspace.
REFERENCE #27
#18 reduces the memory usage for use cases where you need the workspace for both forward and backward (training).
I haven't done any testing but it could be possible the forward workspace is smaller than the backward one, leading to to higher memory usage than necessary in pure forward(/inference) use cases.
This would also help seperate the automatic convolution algorithm detection for those uses cases, leading to quicker startup time (this should also be possible right now by using an Algo different than
Auto
, but would clearer).The text was updated successfully, but these errors were encountered: