Skip to content

Extra challenge: Sliced/non-contiguous tensor broadcasting. #8

@mratsim

Description

@mratsim

The current challenge is great to stress the extensibility of a language (lifting functions on variadic container inputs) but it does not really stress performance as a naïve for loop on the buffers with the same index "i" is enough.

NdArrays/tensors are very often sliced, creating a non-contiguous view over the memory which requires a stride-aware iteration scheme which is often quite costly.

I think it would be great to have another challenge with element-wise operations on sliced tensors.

See the following benchmark from TRIOT (issue #7)

image
image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions