Please login to access the resource
Overview

Request 691606 accepted

- Update to version 1.3
* Support for Clang/LLVM 8.0.
Bug Fixes:
* Fixed kernel debug symbol generation.
* HSA: fix kernel caching.
* Fix clCreateImage doesn't fail with unsupported image type.
* Fix handle non-kernel functions with barriers properly.
* Fix Unable to build pocl with CUDA support with LLVM 7 and host
GCC 8.2.
* Fix image format/size handling with multiple devices in context.
* Fix padding issue with context arrays that manifested as unaligned
access errors after autovectorization.
Notable Internal Changes
* Add group ids as hidden kernel arguments instead of digging
them up from the context struct.
* Ability to generate the final binary via separate assembly text +
assembler call. Useful for supporting LLVM targets without direct
binary emission support.
* Use Clang's Driver API for launching the final linkage step. This
way we utilize the toolchain registry with correct linkage steps
required for the target at hand.
* Add 'device_aux_functions' to the driver layer attributes. This
can be used to retain device-specific functions required by the
target across the pruning of unused globals.
* The "default kernels" hack which was used to store kernel metadata,
has been removed. Kernel metadata are now stored only once, in
cl_program struct; every new cl_kernel structs holds only a pointer.
* Major 'pthread' CPU driver cleanup.
* Major Workgroup.cc cleanup.
- Remove reproducible.patch (fixed upstream)

Loading...

Atri Bhattacharya's avatar

OK with me. I am not the package manager, so I am not hitting the "Accept" button myself; but feel free to take this as my review and accept it yourself. Thanks!

Request History
Martin Hauke's avatar

mnhauke created request

- Update to version 1.3
* Support for Clang/LLVM 8.0.
Bug Fixes:
* Fixed kernel debug symbol generation.
* HSA: fix kernel caching.
* Fix clCreateImage doesn't fail with unsupported image type.
* Fix handle non-kernel functions with barriers properly.
* Fix Unable to build pocl with CUDA support with LLVM 7 and host
GCC 8.2.
* Fix image format/size handling with multiple devices in context.
* Fix padding issue with context arrays that manifested as unaligned
access errors after autovectorization.
Notable Internal Changes
* Add group ids as hidden kernel arguments instead of digging
them up from the context struct.
* Ability to generate the final binary via separate assembly text +
assembler call. Useful for supporting LLVM targets without direct
binary emission support.
* Use Clang's Driver API for launching the final linkage step. This
way we utilize the toolchain registry with correct linkage steps
required for the target at hand.
* Add 'device_aux_functions' to the driver layer attributes. This
can be used to retain device-specific functions required by the
target across the pruning of unused globals.
* The "default kernels" hack which was used to store kernel metadata,
has been removed. Kernel metadata are now stored only once, in
cl_program struct; every new cl_kernel structs holds only a pointer.
* Major 'pthread' CPU driver cleanup.
* Major Workgroup.cc cleanup.
- Remove reproducible.patch (fixed upstream)


Martin Hauke's avatar

mnhauke accepted request

OK

openSUSE Build Service is sponsored by