Skip to content

v0.20.2

Choose a tag to compare

@tprimak tprimak released this 16 Aug 22:49
· 15 commits to rls-v0.20 since this release

This is a patch release containing following changes to Intel MKL-DNN v0.20.1:

  • Fixed issue with bfloat16 instructions detection in Xbyak (b59bf2e)
  • Fixed offset calculation issue in weight update depthwise convolution in fp32 and bfloat16 kernels (ddc54e5, 0982b25)
  • Added check that size of generated kernel doesn't exceed the maximum allowed bound in fp32 forward and backward kernels (24abe20)
  • Various fixes in RNN primitive:
    • Avoid unaligned pointers usage in vex instructions in GRU cell (8eb14f5)
    • Addressed bugs in tests for RNNs (fa534ef, 3ac4db4)
    • Fixed potential integer overflow (35c5f8a)