Skip to content

[BUG]cant install with torch-rocm #1473

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
yggdrasil75 opened this issue Mar 20, 2025 · 7 comments
Open

[BUG]cant install with torch-rocm #1473

yggdrasil75 opened this issue Mar 20, 2025 · 7 comments
Labels
bug Something isn't working

Comments

@yggdrasil75
Copy link

Describe the bug

A clear and concise description of what the bug is.

GPU Info

3 MI60s

Software Info

python 3.12.3
ubuntu 24.04
https://gist.github.com/yggdrasil75/3e593b9fcec1161ad9ed4266f487fd73

I saw another issue that said do it twice, I have done it 4 times and it still hasnt installed.

@yggdrasil75 yggdrasil75 added the bug Something isn't working label Mar 20, 2025
@Qubitium
Copy link
Collaborator

We only validate rocm for 6.3.3. Please upgrade to rocm 6.3.3.

@yggdrasil75
Copy link
Author

I have switched from 6.2.2 to 6.3.3. its still failing to build. here is just the end:
`
Successfully preprocessed all matching files.
Total number of unsupported CUDA function calls: 0

Total number of replaced kernel launches: 4
/dataset/tool/taggui/venv/lib/python3.11/site-packages/setuptools/dist.py:333: InformationOnly: Normalizing '2.1.1-dev' to '2.1.1.dev0'
self.metadata.version = self._normalize_version(self.metadata.version)
/dataset/tool/taggui/venv/lib/python3.11/site-packages/setuptools/dist.py:760: SetuptoolsDeprecationWarning: License classifiers are deprecated.
!!

      ********************************************************************************
      Please consider removing the following classifiers in favor of a SPDX license expression:

      License :: OSI Approved :: Apache Software License

      See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details.
      ********************************************************************************

!!
self._finalize_license_expression()
running clean
removing 'build/temp.linux-x86_64-cpython-311' (and everything under it)
removing 'build/lib.linux-x86_64-cpython-311' (and everything under it)
'build/bdist.linux-x86_64' does not exist -- can't clean it
'build/scripts-3.11' does not exist -- can't clean it
removing 'build'
Failed to build gptqmodel
ERROR: Could not build wheels for gptqmodel, which is required to install pyproject.toml-based projects`
I tried switching to python3.11 while posting the original message as well. is there anything else that might help resolve this?

more info about the system its being compiled on:
xeon w5-2455 is the cpu, kernel is 6.8.0-55. amd-smi version returns AMDSMI Tool: 24.7.1+8dc45db | AMDSMI Library version: 25.1.0.0 | ROCm version: 6.3.3 after the upgrade and a reboot.

@Qubitium
Copy link
Collaborator

Qubitium commented Mar 22, 2025

@yggdrasil75 Try this. I suspect you are running into bad setuptools or pip bug

pip install -U pip setuptools
pip install -e . --no-build-isolation -v

@yggdrasil75
Copy link
Author

https://gist.github.com/yggdrasil75/701ce4a32d69b8e9a7ec627dcf91702c
I updated pip and setuptools. setuptools was already updated beforehand in previous attempts.

@Qubitium
Copy link
Collaborator

@yggdrasil75 Can you make sure you have cuda-toolkit-12.4 or 12.6 installed? The error is due to cuda vs hip compat issue. I honestly don't know why it is throwing those errors. Please make sure you have a relatively new cuda (12.4, 12.6, 12.8) installed.

@yggdrasil75
Copy link
Author

I dont have an nvidia card in this system.

@yggdrasil75
Copy link
Author

I uninstalled cuda toolkit (dont know why it was there at all) and all other nvidia-cuda packages (dev, gdb, and toolkit-doc were the only other ones installed) and still same error.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants