Fix ggml-cuda using a driver symbol in NO_VMM mode #11188

milot-mirdita · 2025-01-11T09:19:04Z

I have integrated the ProstT5 protein language into Foldseek. Thanks a lot for the great library! I am upstreaming a few fixes for issues I found in ggml during the integration. I hope that it's okay to push the changes here and that they get synced at some point to the main ggml repo.

This fix should be simple. cuGetErrorString uses the driver API and should not be available if GGML_CUDA_NO_VMM is set.

JohannesGaessler · 2025-01-11T10:23:52Z

I don't follow your logic. How are these two things related?

milot-mirdita · 2025-01-11T10:42:05Z

Would you prefer to rename GGML_CUDA_NO_VMM to GGML_CUDA_NO_DRIVER?

As far as I can tell GGML_CUDA_NO_VMM is used to disable the only subsystem that relies on the libcuda.so, everything else relies only on the cuda runtime.

JohannesGaessler · 2025-01-11T11:38:50Z

Can you explain the software and hardware setup where such an option would be beneficial?

milot-mirdita · 2025-01-11T12:14:36Z

cudart exists as a static binary. The cuda driver exists only as a .so/.dll. Without the driver dependency I can build a (nearly) static binary that only depends on libc, that can be executed on e.g. a CPU-only system that doesn't have cuda installed. If I have to link against libcuda.so, the binary will not run on this system.

If you run ldd on the foldseek binary (https://mmseqs.com/foldseek/foldseek-linux-gpu.tar.gz), you will not see any cuda dependencies, despite it working fully with cuda.

slaren · 2025-01-11T13:32:14Z

You can also avoid adding a hard dependency to CUDA by loading the backend dynamically. This wouldn't require disabling any features.

Fix ggml-cuda using a driver symbol in NO_VMM mode

894f260

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Jan 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ggml-cuda using a driver symbol in NO_VMM mode #11188

Fix ggml-cuda using a driver symbol in NO_VMM mode #11188

milot-mirdita commented Jan 11, 2025

JohannesGaessler commented Jan 11, 2025

milot-mirdita commented Jan 11, 2025

JohannesGaessler commented Jan 11, 2025

milot-mirdita commented Jan 11, 2025

slaren commented Jan 11, 2025

Fix ggml-cuda using a driver symbol in NO_VMM mode #11188

Are you sure you want to change the base?

Fix ggml-cuda using a driver symbol in NO_VMM mode #11188

Conversation

milot-mirdita commented Jan 11, 2025

JohannesGaessler commented Jan 11, 2025

milot-mirdita commented Jan 11, 2025

JohannesGaessler commented Jan 11, 2025

milot-mirdita commented Jan 11, 2025

slaren commented Jan 11, 2025