-
Notifications
You must be signed in to change notification settings - Fork 2.4k
[build] Blackwell Support LLVM20 + ARM Support #8735
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
…TX handling, and implement new pass manager setup
…d missing include in window_base.cpp
… environment variable settings in llvm.py
…ity; preserve bool handling and adjust metadata usage
…bility; streamline bool handling and update metadata usage
…caching; switch to nvvm_ldu instructions for better compatibility
…with invariant metadata; remove nvvm.ldg intrinsics for improved compatibility
…ldg intrinsics with standard loads; update metadata usage for invariant loads
@feisuzhu @yuanming-hu you can copy my files from here to merge for blackwell |
Hello, thank you for your work ! |
yes it is, i have it running in jetson and gh200 |
Ho so cool ! I have see that maybe i have to replace slim_libdevice.10.bc with my 12.9 version. but i have struggle to find it.
This show cuda 10 |
And every time i tried to compile with TI_WITH_CUDA_TOOLKIT flag ON and with cuda toolkit 12.9 The error Trace
|
I open this PR if anyone want to use my fork
thanks to sonicflux fork also
454040244-f87ae1b3-b55f-4b6e-b385-8163a0009008.mp4