Mirror intel/llvm commits #2795

kbenzie · 2025-06-25T00:42:10Z

Automated changes by create-pull-request GitHub action

github-actions · 2025-06-25T01:33:56Z

Unified Runtime -> intel/llvm Repo Move Notice

Information

The source code of Unified Runtime has been moved to intel/llvm under the unified-runtime top-level directory,
all future development will now be carried out there. This was done in intel/llvm#17043.

The code will be mirrored to oneapi-src/unified-runtime and the specification will continue to be hosted at oneapi-src.github.io/unified-runtime.

The contribution guide will be updated with new instructions for contributing to Unified Runtime.

PR Migration

All open PRs including this one will be marked with the auto-close label and shall be automatically closed after 30 days.

Should you wish to continue with your PR you will need to migrate it to intel/llvm.
We have provided a script to help automate this process.

If your PR should remain open and not be closed automatically, you can remove the auto-close label.

This is an automated comment.

Enable origin tracking for host/shared/device USM, which can provide the more debug information about the detected uninitialized memory.

When we try to exit the problem when reporting the error, the `exit(1)` could cause hangs in SYCL runtime because it skips some processes before the SYCL shutdown process in atexit stage. The `abort()` is more accurate here because it would stop the whole process right away, and no more SYCL shutdown process with the unstable, early exited program.

Level zero is the only backend that supports 1D fetch. However it was marked as unsupported. This PR fixes that and adds corresponding tests. As with other fetch cases, O0 builds fail on windows for L0 using fetch 1D (see intel/llvm#18919). --------- Signed-off-by: JackAKirk <[email protected]>

Bug1: Leak, when private shadow failed to allocate, the already allocated private base would not be freed. Bug2: Leak, the old private base is never freed. Improve: try to reuse private base just like we try to reuse private shadow. --------- Co-authored-by: Copilot <[email protected]> Co-authored-by: Kenneth Benzie (Benie) <[email protected]>

This commit changes the HIP adapter to select the correct binary for the device when a bundle contains binaries built for multiple AMDGPU architectures. Similarly to other adapaters, the HIP adapter would previously select the first 'amdgcn' binary it came across. This works fine for the common case where the program was compiled for one architecture but may fail otherwise. To aid in this, the SYCL runtime passes some extra information into urDeviceSelectBinary via the pre-existing 'pNext' field of ur_device_binary_t. It does this only for the HIP backend. The HIP adapater then parses this binary information as a clang offload bundle, which conveniently contains specific triple & architecture information for each binary. For this we re-use the code that the offload adapter was using, making it common and fixing a bug in the version matching logic.

urDeviceGetInfo is now able to retrieve the max memory bandwidth. --------- Signed-off-by: Zhang, Winston <[email protected]>

Instead of using a global constructor to initialize the L0 adapter, do it in the first call to `urAdapterGet`. Likewise, instead of de-initing it as a global destructor, do it in the last call to `urAdapterRelease`. As well as not doing L0 initialization where the user is not using L0, it also allows `urAdapterRelease` to be called in a global destructor (e.g. what the SYCL runtime does) without worrying about global destructor order.

kbenzie requested a review from a team as a code owner June 25, 2025 00:42

github-actions bot added the auto-close label Jun 25, 2025

AllanZyne and others added 10 commits June 26, 2025 00:41

Enable origin tracking (#18693)

6f62eb4

Enable origin tracking for host/shared/device USM, which can provide the more debug information about the detected uninitialized memory.

Remove the direct use of std::mutex in L0v2 (#19100)

0b278a2

Added support to retrieve maximum memory bandwidth (#18770)

c5da29d

urDeviceGetInfo is now able to retrieve the max memory bandwidth. --------- Signed-off-by: Zhang, Winston <[email protected]>

Remove mutex for OpenCL urContextRelease (#19090)

6fb43e8

Update intel/llvm mirror base commit to 8678a739

cb4d997

github-actions bot force-pushed the mirror-commits- branch from aa6a556 to cb4d997 Compare June 26, 2025 00:41

aarongreig merged commit 6772259 into main Jun 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Mirror intel/llvm commits #2795

Mirror intel/llvm commits #2795

Uh oh!

kbenzie commented Jun 25, 2025

Uh oh!

github-actions bot commented Jun 25, 2025

Uh oh!

Uh oh!

Mirror intel/llvm commits #2795

Mirror intel/llvm commits #2795

Uh oh!

Conversation

kbenzie commented Jun 25, 2025

Uh oh!

github-actions bot commented Jun 25, 2025

Unified Runtime -> intel/llvm Repo Move Notice

Information

PR Migration

Uh oh!

Uh oh!