You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current challenge is great to stress the extensibility of a language (lifting functions on variadic container inputs) but it does not really stress performance as a naïve for loop on the buffers with the same index "i" is enough.
NdArrays/tensors are very often sliced, creating a non-contiguous view over the memory which requires a stride-aware iteration scheme which is often quite costly.
I think it would be great to have another challenge with element-wise operations on sliced tensors.