Implement work_group_static / work_group_scratch_memory #15061

Naghasan · 2024-08-13T20:33:38Z

The patch partially implements work_group_static and update proposal.

Implemented:

work_group_static to handle static allocation in kernel.
get_dynamic_work_group_memory to handle runtime allocation, but only on CUDA

work_group_static is implemented by exposing SYCLScope(WorkGroup), allowing the class to be decorated by the attribute and uses the same mechanism during lowering to place the variable in local memory.

get_dynamic_work_group_memory uses a new builtin function, __sycl_dynamicLocalMemoryPlaceholder , which is lowered into referencing a 0 sized array GV when targeting NVPTX. The approach for SPIR will need to differ from this lowering.

UR change oneapi-src/unified-runtime#1968

Implemented basic interface Implemented skeleton to lower dynamic case TODO: implement compile time kernel property update extension

…o place work_group_static in local memory.

Signed-off-by: Lukas Sommer <[email protected]>

sommerlukas

Changes in jit_compiler.cpp LGTM.

intel/llvm#15061 introduces a new property work_group_scratch_memory which allow the user to set a given amount of local memory to be used. In order to pass this information to the adaptor, the patch adds a new launch property to urEnqueueKernelLaunchCustomExp. The patch also changes the signature of urEnqueueKernelLaunchCustomExp to add global offset in order to maintain features when using this extension. Signed-off-by: Victor Lomuller <[email protected]>

aelovikov-intel

Do we need to add a test when the memory is requested but not used/eliminated?

sycl/source/detail/jit_compiler.cpp

intel/llvm#15061 introduces a new property work_group_scratch_memory which allow the user to set a given amount of local memory to be used. In order to pass this information to the adaptor, the patch adds a new launch property to urEnqueueKernelLaunchCustomExp. The patch also changes the signature of urEnqueueKernelLaunchCustomExp to add global offset in order to maintain features when using this extension. Signed-off-by: Victor Lomuller <[email protected]>

Naghasan · 2024-11-18T12:54:37Z

Do we need to add a test when the memory is requested but not used/eliminated?

oh missed that test ... I added one where the requested scratch memory is unused in source, thanks

intel/llvm#15061 introduces a new property work_group_scratch_memory which allow the user to set a given amount of local memory to be used. In order to pass this information to the adaptor, the patch adds a new launch property to urEnqueueKernelLaunchCustomExp. The patch also changes the signature of urEnqueueKernelLaunchCustomExp to add global offset in order to maintain features when using this extension. Signed-off-by: Victor Lomuller <[email protected]>

Naghasan added 7 commits August 5, 2024 23:00

WIP work_group_static

aa87907

Implemented basic interface Implemented skeleton to lower dynamic case TODO: implement compile time kernel property update extension

fist stab at a combined compile-runtime property

1a92e2d

clang format and use SYCLScope attr rather than local address space t…

7d686cf

…o place work_group_static in local memory.

Merge branch 'sycl' into work_group_static

e5fd086

add more testing

7746d8e

fix up interface and update extension proposal

cabdbf3

restrict test

1567b92

Naghasan mentioned this pull request Aug 13, 2024

Add new launch property to support work_group_scratch_memory oneapi-src/unified-runtime#1968

Open

sommerlukas self-requested a review August 14, 2024 07:20

Merge branch 'sycl' into work_group_static

817f293

Naghasan had a problem deploying to WindowsCILock August 14, 2024 10:01 — with GitHub Actions Failure

use https for UR

a392e4e

Naghasan had a problem deploying to WindowsCILock August 14, 2024 10:06 — with GitHub Actions Error

clang format

149f98f

Naghasan had a problem deploying to WindowsCILock August 14, 2024 10:08 — with GitHub Actions Error

clang-format using proper version

10edb4a

Naghasan had a problem deploying to WindowsCILock August 14, 2024 10:16 — with GitHub Actions Error

Naghasan added 2 commits August 14, 2024 11:47

change type for local memory size

57114bc

fix unit test

6cb38eb

Naghasan had a problem deploying to WindowsCILock August 14, 2024 11:20 — with GitHub Actions Error

fix testing

1982e4b

Naghasan had a problem deploying to WindowsCILock August 14, 2024 12:30 — with GitHub Actions Failure

Naghasan had a problem deploying to WindowsCILock August 14, 2024 14:07 — with GitHub Actions Failure

fix test and added few comments

8ddbb3c

Naghasan had a problem deploying to WindowsCILock August 14, 2024 19:30 — with GitHub Actions Failure

Naghasan temporarily deployed to WindowsCILock August 14, 2024 20:49 — with GitHub Actions Inactive

sommerlukas and others added 4 commits August 16, 2024 16:36

Split compile-time property key and value

4f465fd

Signed-off-by: Lukas Sommer <[email protected]>

quick clean up

08c716d

Merge branch 'sycl' into work_group_static

cdbd3f5

format

c65dee9

fix UR commit hash

9a505ae

Naghasan temporarily deployed to WindowsCILock November 13, 2024 14:47 — with GitHub Actions Inactive

maarquitos14 approved these changes Nov 13, 2024

View reviewed changes

Naghasan had a problem deploying to WindowsCILock November 13, 2024 15:44 — with GitHub Actions Error

Naghasan mentioned this pull request Nov 13, 2024

Investigate work_group_scratch_memory failure on gen12 #16072

Open

sommerlukas approved these changes Nov 13, 2024

View reviewed changes

add tracker and UR squash commit

9cefd48

Naghasan had a problem deploying to WindowsCILock November 13, 2024 15:56 — with GitHub Actions Error

remove eq/neq operator

eea2241

Naghasan temporarily deployed to WindowsCILock November 13, 2024 16:09 — with GitHub Actions Inactive

aelovikov-intel approved these changes Nov 13, 2024

View reviewed changes

sycl/source/detail/jit_compiler.cpp Outdated Show resolved Hide resolved

Naghasan temporarily deployed to WindowsCILock November 13, 2024 18:04 — with GitHub Actions Inactive

Naghasan added 2 commits November 18, 2024 12:52

Merge branch 'sycl' into work_group_static

c163fb1

address last feedbacks

430bd9c

Naghasan temporarily deployed to WindowsCILock November 18, 2024 12:53 — with GitHub Actions Inactive

Naghasan had a problem deploying to WindowsCILock November 18, 2024 14:15 — with GitHub Actions Error

fix save issue

44ada24

Naghasan temporarily deployed to WindowsCILock November 18, 2024 14:56 — with GitHub Actions Inactive

Naghasan temporarily deployed to WindowsCILock November 18, 2024 15:43 — with GitHub Actions Inactive

update ur tag

94bb685

Merge branch 'sycl' into work_group_static

ca0b450

Naghasan temporarily deployed to WindowsCILock November 19, 2024 15:39 — with GitHub Actions Inactive

Naghasan temporarily deployed to WindowsCILock November 19, 2024 16:55 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement work_group_static / work_group_scratch_memory #15061

Implement work_group_static / work_group_scratch_memory #15061

Naghasan commented Aug 13, 2024 •

edited

Loading

sommerlukas left a comment

aelovikov-intel left a comment

Naghasan commented Nov 18, 2024

Implement work_group_static / work_group_scratch_memory #15061

Are you sure you want to change the base?

Implement work_group_static / work_group_scratch_memory #15061

Conversation

Naghasan commented Aug 13, 2024 • edited Loading

sommerlukas left a comment

Choose a reason for hiding this comment

aelovikov-intel left a comment

Choose a reason for hiding this comment

Naghasan commented Nov 18, 2024

Naghasan commented Aug 13, 2024 •

edited

Loading