Extra challenge: Sliced/non-contiguous tensor broadcasting.

The current challenge is great to stress the extensibility of a language (lifting functions on variadic container inputs) but it does not really stress performance as a naïve for loop on the buffers with the same index "i" is enough.

NdArrays/tensors are very often sliced, creating a non-contiguous view over the memory which requires a stride-aware iteration scheme which is often quite costly.

I think it would be great to have another challenge with element-wise operations on sliced tensors.

See the following benchmark from TRIOT (issue #7)

![image](https://user-images.githubusercontent.com/22738317/45947052-22680280-bff3-11e8-8d89-2c0214e62253.png)
![image](https://user-images.githubusercontent.com/22738317/45947058-27c54d00-bff3-11e8-990e-c3a5c43e5154.png)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extra challenge: Sliced/non-contiguous tensor broadcasting. #8

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Extra challenge: Sliced/non-contiguous tensor broadcasting. #8

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions