Skip to content

AOCL-LAPACK 5.0

Latest
Compare
Choose a tag to compare
@pradeeptrgit pradeeptrgit released this 11 Oct 05:25

Highlights of AOCL-LAPACK 5.0 release

  • Improved performance of the following APIs through AVX2 and AVX512 SIMD instructions:
    • Double Precision SVD (DGESVD)
    • LU Factorization/Solver routines for general matrices (DGETRF, ZGETRF, DGETRS, and DGESV)
    • Matrix inverse routine DGETRI for small sizes
    • Least Square solver DGELS for small sizes
    • Double Precision Auxiliary routine and DLARFG
  • Improved performance of the following APIs using local AOCL-BLAS optimized kernels:
    • LU Factorization/Solver routines for band storage matrices (DGBTRF and DGBTRS)
  • Option to set specific ISA code path at runtime through the AOCL_ENABLE_INSTRUCTIONS environment variable
  • Sphinx-based AOCL-LAPACK API documentation
  • pkgconfig support on Linux with CMake builds
  • LAPACK API modifications:
    • Updated AOCL-LAPACK APIs return type to match with corresponding netlib subroutine prototypes
    • Removed xerbla and lsame definition from AOCL-LAPACK. Applications must invoke lsame from the BLAS library
  • Test suite framework enhancements:
    • Improved accuracy tests including testing with different input generation mechanisms
    • Addition of extreme values, negative, and corner test cases
    • Addition of cases to test numerical stability
    • Support for LAPACKE interface test