You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
CDNA supports only 64 wave front size, for those GPUs allow subgroup
size of 64. Some GPUs support both 32 and 64, for those (and the rest)
only allow 32.
// Sub-group size is optimized for 32, warn (and ignore the attribute) if the
4
+
// size is not 32.
5
+
#include"sycl.hpp"
6
+
7
+
intmain() {
8
+
9
+
sycl::queue Q;
10
+
11
+
Q.submit([&](sycl::handler &h) {
12
+
h.single_task<classinvalid_kernel>([=] [[sycl::reqd_sub_group_size(64)]] {}); // expected-warning {{attribute argument 64 is invalid and will be ignored; amdgcn requires sub_group size 32}}
h.single_task<classinvalid_kernel_2>([=] [[sycl::reqd_sub_group_size(8)]] {}); // expected-warning {{attribute argument 8 is invalid and will be ignored; amdgcn requires sub_group size 32}}
h.single_task<classinvalid_kernel>([=] [[sycl::reqd_sub_group_size(32)]] {}); // expected-warning {{attribute argument 32 is invalid and will be ignored; amdgcn requires sub_group size 64}}
17
+
});
18
+
19
+
Q.submit([&](sycl::handler &h) {
20
+
h.single_task<classinvalid_kernel_2>([=] [[sycl::reqd_sub_group_size(8)]] {}); // expected-warning {{attribute argument 8 is invalid and will be ignored; amdgcn requires sub_group size 64}}
Copy file name to clipboardExpand all lines: clang/test/SemaSYCL/reqd-sub-group-size-cuda.cpp
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@ int main() {
9
9
sycl::queue Q;
10
10
11
11
Q.submit([&](sycl::handler &h) {
12
-
h.single_task<classinvalid_kernel>([=] [[sycl::reqd_sub_group_size(8)]] {}); // expected-warning {{attribute argument 8 is invalid and will be ignored; CUDA requires sub_group size 32}}
12
+
h.single_task<classinvalid_kernel>([=] [[sycl::reqd_sub_group_size(8)]] {}); // expected-warning {{attribute argument 8 is invalid and will be ignored; nvptx requires sub_group size 32}}
0 commit comments