Commit 29be596
authored
fix: undefined symbol cudaGetDriverEntryPointByVersion with CUDA >= 12.5 (flashinfer-ai#928)
## Problem:
When ① build flashinfer with CUDA >= 12.5 (using system-wide CUDA
toolkit under `/usr/local/cuda`), and ② run with CUDA < 12.5 (using
`libcudart.so` under the python environment
`/usr/local/lib/python3.10/dist-packages/nvidia/cuda_runtime/lib/libcudart.so.12`),
one would meet the issue of undefined symbol
`cudaGetDriverEntryPointByVersion`, which is introduced since CUDA 12.5.
<img width="824" alt="image"
src="https://github.com/user-attachments/assets/30322352-2cdc-45b5-adc3-2eb82fbac45e"
/>
This issue has been reported and fixed in other projects:
- cutlass: NVIDIA/cutlass#2086
- sglang: sgl-project/sglang#3372
## Fix
This fix is a workaround of this issue which forces flashinfer use
system-wide CUDA toolkit, refer to the fix in
[sglang](sgl-project/sglang#3372), cc @zhyncs.1 parent 1e2515e commit 29be596
1 file changed
+8
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
52 | 52 | | |
53 | 53 | | |
54 | 54 | | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
55 | 63 | | |
56 | 64 | | |
57 | 65 | | |
| |||
0 commit comments