Compile without cublas dlls? #10988

vladfaust · 2024-12-26T18:50:09Z

vladfaust
Dec 26, 2024

Is it possible to compile a llama binary without it requiring cublas64_12.dll and cublasLt64_12.dll in runtime? cudart64_12.dll is tiny, but cublas is around half a gig! I don't want to ship it with my app neither I want to make users install CUDA toolkit (cublas is not found when installing usual Nvidia drivers).

I tried setting -DGGML_CUDA_FORCE_MMQ=ON, but it still crashes because it can't find cublas64_12.dll in runtime.

vladfaust · 2025-01-12T13:21:38Z

vladfaust
Jan 12, 2025
Author

FYI: https://siboehm.com/articles/22/CUDA-MMM.
Just saying that's theoretically possible to get rid of cublas in favor of custom kernels!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compile without cublas dlls? #10988

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Compile without cublas dlls? #10988

vladfaust Dec 26, 2024

Replies: 1 comment

vladfaust Jan 12, 2025 Author

vladfaust
Dec 26, 2024

vladfaust
Jan 12, 2025
Author