Skip to content

AMD Optimized libFLAME Version 3.0.1

Compare
Choose a tag to compare
@pradeeptrgit pradeeptrgit released this 06 Jul 03:39
· 1011 commits to master since this release

AMD Optimized libFLAME Version 3.0.1

Highlights of improvements on AMD EPYCTM processor family CPUs

  • Improved performance of LU, QR and Cholesky Factorization
  • Improved performance of routine that computes partial LDLT factorization of a symmetric matrix using packed storage: spffrt2 and spffrtx
  • Improved performance of routine that computes complete/incomplete LU factorization without pivoting of a general matrix: getrfnp and getrfnpi
  • Library function tracing and input logging support added