Add `--target-cuda` argument for selecting CUDA architecture #2478

vlad-perevezentsev · 2025-06-10T19:33:25Z

This PR suggests adding --target-cuda argument to scripts/build_locally.py allowing to enable CUDA support and optionally specify the target architecture (e.g. sm_80).
If no architecture is specified, sm_50 is used by default.

$ python scripts/build_locally.py --target-cuda
# or
$ python scripts/build_locally.py --target-cuda=<arch>

The specified architecture is used to construct a SYCL alias target (e.g. nvidia_gpu_sm_80) and passed via -fsycl-targets option, following OneAPI for NVIDIA GPUs

Have you provided a meaningful PR description?
Have you added a test, reproducer or referred to an issue with a reproducer?
Have you tested your changes locally for CPU and GPU devices?
Have you made sure that new changes do not introduce compiler warnings?
Have you checked performance impact of proposed changes?
Have you added documentation for your changes, if necessary?
Have you added your changes to the changelog?

T_CUDA

github-actions · 2025-06-10T19:52:58Z

View rendered docs @ https://intelpython.github.io/dpnp/pull/2478/index.html

CMakeLists.txt

antonwolfy · 2025-06-10T19:47:00Z

CMakeLists.txt

@@ -87,8 +91,18 @@ set(_dpnp_sycl_target_compile_options)
 set(_dpnp_sycl_target_link_options)

 if ("x${DPNP_SYCL_TARGETS}" STREQUAL "x")
-    if(DPNP_TARGET_CUDA)
-        set(_dpnp_sycl_targets "nvptx64-nvidia-cuda,spir64-unknown-unknown")
+    if (DPNP_TARGET_CUDA)


It is not OFF by default now. Should this be updated?

Suggested change

if (DPNP_TARGET_CUDA)

if (NOT "x${DPNP_TARGET_CUDA}" STREQUAL "x")

The empty string is False for this check if (DPNP_TARGET_CUDA)
I added this check in case when DPNP_TARGET_CUDA is passed as 0, OFF, NO, FALSE, N via cmake-opts argument

in case when DPNP_TARGET_CUDA is passed as 0, OFF, NO, FALSE, N via cmake-opts argument

That is not the case when DPNP_TARGET_CUDA passed as an empty string. So it's still unclear for me.
Per my understanding the string can't be empty due to the check.

You are right that --target-cuda= is checked in build_locally.py.
But if someone bypasses it via --cmake-opts="-DDPNP_TARGET_CUDA=" the empty string is still evaluated as FALSE in if(DPNP_TARGET_CUDA). Thus this condition safely handles both cases.
Using if (NOT "x${DPNP_TARGET_CUDA}" STREQUAL "x") would only check for non-empty strings but still treat values like OFF or 0 as TRUE

doc/quick_start_guide.rst

CMakeLists.txt

github-actions · 2025-06-10T20:06:05Z

Array API standard conformance tests for dpnp=0.19.0dev0=py312h509198e_18 ran successfully.
Passed: 1231
Failed: 0
Skipped: 9

doc/quick_start_guide.rst

CHANGELOG.md

vlad-perevezentsev added 5 commits June 10, 2025 11:12

Add sm_* offload arch support to DPNP_TARGE

b6ed7f6

T_CUDA

Enable CUDA architecture selection via --target-cuda

25bf7b9

Raise RuntimeError if onemkl_interfaces_dir passed

b0bd17c

Clarify --target-cuda help message

e0dae0e

Update CUDA build docs

c670477

vlad-perevezentsev requested review from antonwolfy, AlexanderKalistratov, vtavana and ndgrigorian as code owners June 10, 2025 19:33

antonwolfy reviewed Jun 10, 2025

View reviewed changes

antonwolfy added this to the 0.19.0 release milestone Jun 10, 2025

antonwolfy reviewed Jun 10, 2025

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

vlad-perevezentsev added 4 commits June 11, 2025 03:38

Add CUDA and AMD build subchapters to docs

dbdd077

Clarify SYCL alias target usage

b08c0e5

Apply remarks

117f6a5

Update changelog

2a03eba

antonwolfy reviewed Jun 11, 2025

View reviewed changes

doc/quick_start_guide.rst Outdated Show resolved Hide resolved

doc/quick_start_guide.rst Outdated Show resolved Hide resolved

CHANGELOG.md Outdated Show resolved Hide resolved

Apply remarks

39b62e2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `--target-cuda` argument for selecting CUDA architecture #2478

Add `--target-cuda` argument for selecting CUDA architecture #2478

Uh oh!

vlad-perevezentsev commented Jun 10, 2025

Uh oh!

github-actions bot commented Jun 10, 2025

Uh oh!

Uh oh!

antonwolfy Jun 10, 2025

Uh oh!

vlad-perevezentsev Jun 11, 2025

Uh oh!

antonwolfy Jun 11, 2025

Uh oh!

vlad-perevezentsev Jun 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jun 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

	if (DPNP_TARGET_CUDA)
	if (NOT "x${DPNP_TARGET_CUDA}" STREQUAL "x")

Add --target-cuda argument for selecting CUDA architecture #2478

Are you sure you want to change the base?

Add --target-cuda argument for selecting CUDA architecture #2478

Uh oh!

Conversation

vlad-perevezentsev commented Jun 10, 2025

Uh oh!

github-actions bot commented Jun 10, 2025

Uh oh!

Uh oh!

antonwolfy Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

vlad-perevezentsev Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

antonwolfy Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

vlad-perevezentsev Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add `--target-cuda` argument for selecting CUDA architecture #2478

Add `--target-cuda` argument for selecting CUDA architecture #2478

github-actions bot commented Jun 10, 2025 •

edited

Loading