Add Nsight profiling support (nsys/ncu) #244

apaolillo · 2025-06-08T17:43:35Z

Introduced host_to_comm_path() in CommunicationLayer and implemented Docker-specific logic in DockerCommLayer to resolve host/container path mapping (e.g., for file outputs inside mounted volumes).
Added new example: campaign_nsys_ncu.py showcasing how to run GPU benchmarks with nsys and ncu wrappers, including post-run hooks for metrics extraction.
Created NsysWrap and NcuWrap. NsysWrap runs nsys profile, then extracts memory usage stats from report_cuda_gpu_mem_size_sum.csv. NcuWrap runs ncu in CSV mode and parses per-kernel metrics from the log.
Refactored AddVecBench into its own reusable file under examples/gpus/kit/addvec.py.
Updated gpus.py to install Nsight Systems and libsmctrl by default in the Docker image.
Added a realistic CUDA benchmark simplesleep.cu to simulate kernel workloads with artificial delay and multiple phases.

These changes improve support for automated GPU profiling and set the foundation for deeper performance analysis using NVIDIA's Nsight tooling inside Docker-based benchkit platforms.

- Introduced `host_to_comm_path()` in `CommunicationLayer` and implemented Docker-specific logic in `DockerCommLayer` to resolve host/container path mapping (e.g., for file outputs inside mounted volumes). - Added new example: `campaign_nsys_ncu.py` showcasing how to run GPU benchmarks with `nsys` and `ncu` wrappers, including post-run hooks for metrics extraction. - Created `NsysWrap` and `NcuWrap`. `NsysWrap` runs `nsys profile`, then extracts memory usage stats from `report_cuda_gpu_mem_size_sum.csv`. `NcuWrap` runs `ncu` in CSV mode and parses per-kernel metrics from the log. - Refactored `AddVecBench` into its own reusable file under `examples/gpus/kit/addvec.py`. - Updated `gpus.py` to install Nsight Systems and libsmctrl by default in the Docker image. - Added a realistic CUDA benchmark `simplesleep.cu` to simulate kernel workloads with artificial delay and multiple phases. These changes improve support for automated GPU profiling and set the foundation for deeper performance analysis using NVIDIA's Nsight tooling inside Docker-based benchkit platforms. Signed-off-by: Antonio Paolillo <[email protected]>

Signed-off-by: Antonio Paolillo <[email protected]>

apaolillo requested a review from aaronbog June 8, 2025 17:43

apaolillo self-assigned this Jun 8, 2025

apaolillo added 2 commits June 8, 2025 19:50

Fix import

4458f52

Signed-off-by: Antonio Paolillo <[email protected]>

Add the other way for comm<->host files

e56c9aa

Signed-off-by: Antonio Paolillo <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Nsight profiling support (nsys/ncu) #244

Add Nsight profiling support (nsys/ncu) #244

Uh oh!

apaolillo commented Jun 8, 2025

Uh oh!

Uh oh!

Add Nsight profiling support (nsys/ncu) #244

Are you sure you want to change the base?

Add Nsight profiling support (nsys/ncu) #244

Uh oh!

Conversation

apaolillo commented Jun 8, 2025

Uh oh!

Uh oh!