Skip to content

Don't detect local GPU if $DS_IGNORE_CUDA_DETECTION is set#7896

Open
Flamefire wants to merge 2 commits intodeepspeedai:masterfrom
Flamefire:DS_IGNORE_CUDA_DETECTION-fix
Open

Don't detect local GPU if $DS_IGNORE_CUDA_DETECTION is set#7896
Flamefire wants to merge 2 commits intodeepspeedai:masterfrom
Flamefire:DS_IGNORE_CUDA_DETECTION-fix

Conversation

@Flamefire
Copy link
Contributor

Make this consistent over all OPs:
For cross-compilation we should not check the local GPU version.

This is also important when having multiple different GPUs in the system, so checking only GPU 0 could be wrong.

It looks like checks for the version of CUDA should be moved out of this and checks for compute capability vs CUDA version moved to filter_ccs instead.

CUDAOpBuilder.is_compatible could call compute_capability_args to see if the CCs after filtering turn up empty.

Make this consistent over all OPs:
For cross-compilation we should not check the local GPU version.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants