Skip to content

v1.2.2

Choose a tag to compare

@tprimak tprimak released this 19 Mar 07:23
· 9 commits to rls-v1.2 since this release

This is a patch release containing following changes to v1.2.1:

  • Fixed overflow in transposition in bfloat16 weights gradient convolution (0d28389)
  • Added work around corrupted unique_ptr usage in scratchpad (91c89a9)
  • Fixed int8 deconvolution with int32 output on Intel AVX2 systems (ef2d652)
  • Fixed fixed segmentation fault in concat due to incorrect memory alighment #668 (7a0c3a9)
  • Fixed performance regression in no-copy gemm dispatching #525 (89a303b)
  • Fixed segmentation fault in fp32 weights gradient convolution with dilation and large padding (50546ad)
  • Fixed bfloat16/fp32 scalability for eltwise primitive (e281a4a)