Replies: 4 comments 2 replies
-
Beta Was this translation helpful? Give feedback.
                  
                    2 replies
                  
                
            -
Beta Was this translation helpful? Give feedback.
                  
                    0 replies
                  
                
            -
| Here is a PR #13403 | 
Beta Was this translation helpful? Give feedback.
                  
                    0 replies
                  
                
            -
| This looks better suited for an issue than a discussion. Can we close in favor of #13422? (Even though this was started first) | 
Beta Was this translation helpful? Give feedback.
                  
                    0 replies
                  
                
            
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
        
    
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I ran into this issue where I want to run a program that does not use collective operations. I have an older NCCL version that does not have the function
ncclCommInitRankConfig. This function appears in NCCL version 2.14.3. Whem loading the CUDA driver I get this errorIn the code if the library is not found it is not an error and will fail only later on channel creation. Isn't it more correct that a symbol load failure be also silenced here? If lazy-like loading is intended, not having the right NCCL version is the same as not having it at all.
Beta Was this translation helpful? Give feedback.
All reactions