v1.2.1
This is a patch release containing following changes to v1.2:
- Improved GEMM performance for 1 thread (1fd2bc0)
- Fixed RNN cell backpropagation computations (4b15a0c)
- Fixed alpha and beta handling in vanilla RNN cell (70f8b87)
- Reduced sizes in performance profiling example to avoid memory overflow for systems with less than 2 GB memory (f6e2ef9)
- Fix correctness for strided convolution with 1x1 filter with non-matching source and destination formats (0405c9a)
- Removed lambda calls from OpenMP loops as a workaround for Intel C/C++ Compiler 19.1 (a603593)
- Added -O1 flag for backward convolution gtests as a workaround for Intel C/C++ Compiler 19.1 (495b91f)