Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Retry downloading a shard when file sizes are zero. #261

Open
alxmrs opened this issue Nov 23, 2022 · 3 comments
Open

Retry downloading a shard when file sizes are zero. #261

alxmrs opened this issue Nov 23, 2022 · 3 comments

Comments

@alxmrs
Copy link
Collaborator

alxmrs commented Nov 23, 2022

Sometimes, files downloaded by weather-dl are created in GCS, but contain zero bytes. To address this, we could:

  • Add a check after fetch is complete that checks the file size in the bucket. If zero, retry the DL.
  • When we check if the file exists or needs to be skipped, we should also check the file size.

Workarounds: Users can make use of gsutil or equivalent tools to find all empty files. Then, they can delete these and re-download (with a separate invocation).

@nagavenkateshgavini
Copy link

@alxmrs I would like to work on this, could you please assign this to me

@alxmrs
Copy link
Collaborator Author

alxmrs commented Mar 3, 2024

I think @fredzyda can help you with the assignment. Though, I don’t think any one else is working on it right now, feel free to take a crack at the implementation. :)

Before you do, check with @mahrsee to see if this is still an issue — it may have been fixed in wdl2.

@mahrsee1997
Copy link
Collaborator

mahrsee1997 commented Mar 8, 2024

We haven't observed this issue recently, and it's not implemented in WDL and WDLv2. However, implementing this would serve as a great safety check, so we should proceed with it.

@nagavenkateshgavini,

Pointers for WDL:

Pointers for WDLv2:

  • We can incorporate validation logic in main().

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants