v1.2.2
This is a patch release containing following changes to v1.2.1:
- Fixed overflow in transposition in bfloat16 weights gradient convolution (0d28389)
- Added work around corrupted unique_ptr usage in scratchpad (91c89a9)
- Fixed int8 deconvolution with int32 output on Intel AVX2 systems (ef2d652)
- Fixed fixed segmentation fault in concat due to incorrect memory alighment #668 (7a0c3a9)
- Fixed performance regression in no-copy gemm dispatching #525 (89a303b)
- Fixed segmentation fault in fp32 weights gradient convolution with dilation and large padding (50546ad)
- Fixed bfloat16/fp32 scalability for eltwise primitive (e281a4a)