Skip to content

v1.2.1

Choose a tag to compare

@vpirogov vpirogov released this 26 Feb 06:28
· 19 commits to rls-v1.2 since this release

This is a patch release containing following changes to v1.2:

  • Improved GEMM performance for 1 thread (1fd2bc0)
  • Fixed RNN cell backpropagation computations (4b15a0c)
  • Fixed alpha and beta handling in vanilla RNN cell (70f8b87)
  • Reduced sizes in performance profiling example to avoid memory overflow for systems with less than 2 GB memory (f6e2ef9)
  • Fix correctness for strided convolution with 1x1 filter with non-matching source and destination formats (0405c9a)
  • Removed lambda calls from OpenMP loops as a workaround for Intel C/C++ Compiler 19.1 (a603593)
  • Added -O1 flag for backward convolution gtests as a workaround for Intel C/C++ Compiler 19.1 (495b91f)