-
Notifications
You must be signed in to change notification settings - Fork 687
Error encountered while using huggingface-cli to download datasets #2936
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @GuocunWang sorry you've encountered this issue. Can you post a full stacktrace of this error? I'm particularly interested in getting a request ID (that is logged in the error message) to investigate on our side. |
@Wauplin
However, I was unable to find the request ID in the logs, which I understand is crucial for investigating the issue further. Here is the full error stack trace I received:
Unfortunately, no request ID is provided in the error message. Could you please assist me in resolving this issue or advise if there is any other information I can provide? Thank you for your support, and I look forward to your guidance on how to proceed. |
@GuocunWang thanks for providing this! Very helpful! It turns out that you are using hf-mirror.com mirror site maintained by @padeoe. Unfortunately there is nothing much we can do on our side as we don't manage this CDN. @padeoe any idea what can be done? |
Hi @GuocunWang @Wauplin ,It appears that the To address this issue, I have added an exception in our system for the specific path you mentioned (https://cdn-lfs.hf-mirror.com/repos/d3/36/d336e691db6b26b29482553af06f12dcdf6030d98dd09e339e447f224906b13e/*). This should help mitigate the rate limiting issue and allow you to download the files more smoothly. If you continue to experience issues, please let us know. |
Describe the bug
When I download the HSSD dataset, I use the following command:
huggingface-cli download --repo-type dataset --resume-download hssd/hssd-hab --local-dir . /data/hssd-hab
The following error was encountered: huggingface_hub.errors.HfHubHTTPError: 429 Client Error: Too Many Requests for url.
This should be due to a large number of small files in the dataset resulting in too many download requests. Is there a proper way to download it please?
Reproduction
No response
Logs
System info
The text was updated successfully, but these errors were encountered: