Replies: 1 comment
-
FYI: https://siboehm.com/articles/22/CUDA-MMM. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Is it possible to compile a llama binary without it requiring
cublas64_12.dll
andcublasLt64_12.dll
in runtime?cudart64_12.dll
is tiny, but cublas is around half a gig! I don't want to ship it with my app neither I want to make users install CUDA toolkit (cublas is not found when installing usual Nvidia drivers).I tried setting
-DGGML_CUDA_FORCE_MMQ=ON
, but it still crashes because it can't findcublas64_12.dll
in runtime.Beta Was this translation helpful? Give feedback.
All reactions