Highlights of AOCL-LAPACK 5.0 release
- Improved performance of the following APIs through AVX2 and AVX512 SIMD instructions:
- Double Precision SVD (DGESVD)
- LU Factorization/Solver routines for general matrices (DGETRF, ZGETRF, DGETRS, and DGESV)
- Matrix inverse routine DGETRI for small sizes
- Least Square solver DGELS for small sizes
- Double Precision Auxiliary routine and DLARFG
- Improved performance of the following APIs using local AOCL-BLAS optimized kernels:
- LU Factorization/Solver routines for band storage matrices (DGBTRF and DGBTRS)
- Option to set specific ISA code path at runtime through the AOCL_ENABLE_INSTRUCTIONS environment variable
- Sphinx-based AOCL-LAPACK API documentation
- pkgconfig support on Linux with CMake builds
- LAPACK API modifications:
- Updated AOCL-LAPACK APIs return type to match with corresponding netlib subroutine prototypes
- Removed xerbla and lsame definition from AOCL-LAPACK. Applications must invoke lsame from the BLAS library
- Test suite framework enhancements:
- Improved accuracy tests including testing with different input generation mechanisms
- Addition of extreme values, negative, and corner test cases
- Addition of cases to test numerical stability
- Support for LAPACKE interface test