Revisions of pocl
- Update to version 1.6 * Support for LLVM 11. * CUDA kernels using constant __local blocks are now ABI incompatible with previous release. Users need to delete their pocl cache. * Improved debugging of OpenCL code with CPU driver. * Improved the PTX code generation for __local blocks. * Improved handling of command queue barriers * Fix LLVM loop vectorizing remarks printing (POCL_VECTORIZER_REMARKS=1). * Fix an issue in which the loop vectorizer produced code with invalid memory reads (issue #757). * Fix compilation error when CMake option SINGLE_LLVM_LIB is set to OFF. * Fix wrongly output dlerror (Undefined symbol) after dlopen, caused by a previous libdl call in an ICD loader * [CPU] safety margin of pocl's CPU driver local memory allocation has been reduced to a much more reasonable value * [CPU] buffer size for OpenCL printf is now configurable with PRINTF_BUFFER_SIZE CMake variable * [CPU] local memory size reported is now the size of last level of non-shared data cache (usually L1 or L2 depending on CPU), if hwloc can determine it. - Update patch link_against_libclang-cpp_so.patch
buildservice-autocommit
accepted
request 843814
from
Martin Hauke (mnhauke)
(revision 57)
baserev update by copy to link target
- remove broken installation workaround
buildservice-autocommit
accepted
request 817238
from
Martin Hauke (mnhauke)
(revision 55)
baserev update by copy to link target
- moved pocl.icd to /usr/share/OpenCL/vendors for real ...
- Update to version 1.5 * Added support for LLVM/Clang 10.0 - adjusted link_against_libclang-cpp_so.patch - move pocl.icd from /usr/etc/OpenCL/vendors to /usr/share/OpenCL/vendors (boo#1173005)
buildservice-autocommit
accepted
request 745273
from
Martin Hauke (mnhauke)
(revision 52)
baserev update by copy to link target
- Update to version 1.4 * Support for LLVM/Clang 8.0 and 9.0 * Support for LLVM older than 6.0 has been removed. * Improved SPIR and SPIR-V support for CPU device * pocl-accel: An example driver and support infrastructure for OpenCL 1.2 CL_DEVICE_TYPE_CUSTOM hardware accelerators. - Remove upstreamed fix_resources_path_version_dependency.patch - Fix build with single-component libclang-cpp.so, add link_against_libclang-cpp_so.patch
buildservice-autocommit
accepted
request 720127
from
Martin Hauke (mnhauke)
(revision 50)
baserev update by copy to link target
- Fix build with LTO enabled - Fixup build/runtime dependencies - Correct opencl header lookup
buildservice-autocommit
accepted
request 691907
from
Martin Hauke (mnhauke)
(revision 48)
baserev update by copy to link target
- Adjust required clang version (clang < 9) since clang 8 is now supported by upstream.
- Update to version 1.3 * Support for Clang/LLVM 8.0. Bug Fixes: * Fixed kernel debug symbol generation. * HSA: fix kernel caching. * Fix clCreateImage doesn't fail with unsupported image type. * Fix handle non-kernel functions with barriers properly. * Fix Unable to build pocl with CUDA support with LLVM 7 and host GCC 8.2. * Fix image format/size handling with multiple devices in context. * Fix padding issue with context arrays that manifested as unaligned access errors after autovectorization. Notable Internal Changes * Add group ids as hidden kernel arguments instead of digging them up from the context struct. * Ability to generate the final binary via separate assembly text + assembler call. Useful for supporting LLVM targets without direct binary emission support. * Use Clang's Driver API for launching the final linkage step. This way we utilize the toolchain registry with correct linkage steps required for the target at hand. * Add 'device_aux_functions' to the driver layer attributes. This can be used to retain device-specific functions required by the target across the pruning of unused globals. * The "default kernels" hack which was used to store kernel metadata, has been removed. Kernel metadata are now stored only once, in cl_program struct; every new cl_kernel structs holds only a pointer. * Major 'pthread' CPU driver cleanup. * Major Workgroup.cc cleanup. - Remove reproducible.patch (fixed upstream)
buildservice-autocommit
accepted
request 646209
from
Martin Hauke (mnhauke)
(revision 44)
baserev update by copy to link target
Add upstream reproducible.patch to make build result independent of build system CPU (boo#1110722)
buildservice-autocommit
accepted
request 638018
from
Dirk Stoecker (dstoecker)
(revision 42)
baserev update by copy to link target
- update to version 1.2 * Support for LLVM/Clang 7.0 and 6.0 * HWLOC 2.0 support - build kernels with distro support - detect and load cpu optimized code on runtime
buildservice-autocommit
accepted
request 627128
from
Martin Hauke (mnhauke)
(revision 40)
baserev update by copy to link target
Disable compile time CPU-detection instead always asume core2 (boo#1100677)
Displaying revisions 21 - 40 of 78