Design name | Data type | Description |
---|---|---|
Eltwise Add | bfloat16 | An element by element addition of two vectors |
Eltwise Mul | i32 | An element by element multiplication of two vectors |
ReLU | bfloat16 | Rectified linear unit (ReLU) activation function on a vector |
Softmax | bfloat16 | Softmax operation on a matrix |
Conv2D | i8 | A single core 2D convolution for CNNs |
Conv2D+ReLU | i8 | A Conv2D with a ReLU fused at the vector register level |
Bottleneck | ui8 | A Bottleneck Residual Block is a variant of the residual block that utilizes three convolutions, using 1x1, 3x3, and 1x1 filter sizes, respectively. The implementation features fusing of multiple kernels and dataflow optimizations, highlighting the unique architectural capabilities of AI Engines |
ResNet | ui8 | ResNet with offloaded conv2_x layers. The implementation features depth-first implementation of multiple bottleneck blocks across multiple NPU columns. |
ml
Folders and files
Name | Name | Last commit date | ||
---|---|---|---|---|
parent directory.. | ||||