Reduced Precision Abstraction (TF32, FP16/BF16 ..)

Decreased precision data types like `FP16` increases the performance for `compute bound `tasks like `Matrix Multiplication`, `Dense Linear Algebra`, `Convolutions`.

May be adding a general abstraction could be usefull for alpaka?

Especially  `transform` and `transformReduce` functions in `alpaka3` seem tobe good candidates for reduced precision types because they're templated and can work with custom data types.  

<img width="688" height="178" alt="Image" src="https://github.com/user-attachments/assets/ae3d7556-2da7-43b0-b269-f8fefaf3edc2" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduced Precision Abstraction (TF32, FP16/BF16 ..) #260

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reduced Precision Abstraction (TF32, FP16/BF16 ..) #260

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions