Revisions of openblas
Jiri Srain (jsrain)
committed
(revision 3)
- Add support for gcc8/9 building (jsc#SLE-8604). Note: this submission backports a change submitted to openSUSE:Factory. It doesn't introduce any functional change for SLE-15, it is to keep things in sync for Leap - and to ship missing flavors on PH.
Gustavo Yokoyama Ribeiro (gyribeiro)
committed
(revision 2)
- Update to version 0.3.7 (jsc#SLE-8492) common: * having the gmake special variables TARGET_ARCH or TARGET_MACH defined no longer causes build failures in ctest or utest * defining NO_AFFINITY or USE_TLS to zero in gmake builds no longer has the same effect as setting them to one * a new test program was added to allow checking the library for thread safety * a new option USE_LOCKING was added to ensure thread safety when OpenBLAS itself is built without multithreading but * will be called from multiple threads. * a build failure on Linux with glibc versions earlier than 2.5 was fixed * a runtime error with CPU enumeration (and NO_AFFINITY not set) on glibc 2.6 was fixed * NO_AFFINITY was added to the CMAKE options (and defaults to being active on Linux, as in the gmake builds) x86_64 * the build-time logic for detection of AVX512 availability in the processor and compiler was fixed * gmake builds on OSX now set the internal name of the library to libopenblas.0.dylib (consistent with CMAKE) * the Haswell DGEMM kernel received a significant speedup through improved prefetch and load instructions * performance of DGEMM, DTRMM, DTRSM and ZDOT on Zen/Zen2 was markedly increased by avoiding vpermpd instructions * the SKYLAKEX (AVX512) DGEMM helper functions have now been disabled to fix remaining errors in DGEMM, DSYMM and DTRMM POWER: * added support for building on FreeBSD/powerpc64 and FreeBSD/ppc970 * added optimized kernels for POWER9 single and double precision complex BLAS3 * added optimized kernels for POWER9 SGEMM and STRMM
Gustavo Yokoyama Ribeiro (gyribeiro)
committed
(revision 1)
initialize package
Displaying all 3 revisions