This is a patch release containing following changes to v0.21.2:
- Reduced the upper-bound of memory requirement for gemm-based convolution to reduce the probability of OOM error (cd99749)
- Significantly reduced the size required for 1x1 convolution (5643445)
- Added new dummy stream (cba5823)