Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Machine Learning Examples

Design name Data type Description
Eltwise Add bfloat16 An element by element addition of two vectors
Eltwise Mul i32 An element by element multiplication of two vectors
ReLU bfloat16 Rectified linear unit (ReLU) activation function on a vector
Softmax bfloat16 Softmax operation on a matrix
Conv2D i8 A single core 2D convolution for CNNs
Conv2D+ReLU i8 A Conv2D with a ReLU fused at the vector register level
Bottleneck ui8 A Bottleneck Residual Block is a variant of the residual block that utilizes three convolutions, using 1x1, 3x3, and 1x1 filter sizes, respectively. The implementation features fusing of multiple kernels and dataflow optimizations, highlighting the unique architectural capabilities of AI Engines
ResNet ui8 ResNet with offloaded conv2_x layers. The implementation features depth-first implementation of multiple bottleneck blocks across multiple NPU columns.