[1pt] PR: Ripple updates inc multiproc for Ripple FIM 100 data downloads #1590
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Updated FIM 30 scripts to hand much larger ripple FIM 100 data volume.
The focus of the upgrades are
Also change the outputs to now have three output files as part of the hecras_processing.ipynb file. It takes in all of the data created by the other three scripts, then creates:
mip
,ble
orras2fim
. The "selected_source" column is the one of the three sources with the most number of models.Note: While this is made for ripple FIM 100, I made it more generic for further ripple runs or even for other similar data sources.
Additions
Changes
data/ripple
get_s3_folder.sh
: Updated from earlier version. Downloads from ripple source, calcs some metrics and re-uploads it to our FiM S3 buckets. One MC (model collection) folder at a time.get_s3_folders_from_list.sh
: A wrapper to get_s3_folder to download in bulk. This now has multi-processing capacity to speed it up significantly. It is now only limited by network speeds.hecras_processing.ipynb
: Upgraded to make the three output files. Note: Renamed from hecras_boundaries.ipynbRenaming
hecras_boundaries.ipynb
, nowhecras_processing.ipynb
Testing
Deployment Plan (For developer use)
How does the changes affect the product?
(not applicable to the FIM product and its outputs)
Issuer Checklist (For developer use)
You may update this checklist before and/or after creating the PR. If you're unsure about any of them, please ask, we're here to help! These items are what we are going to look for before merging your code.
[_pt] PR: <description>
dev
branch (the default branch), you have a descriptive Feature Branch name using the format:dev-<description-of-change>
(e.g.dev-revise-levee-masking
)dev
branchpre-commit
hooks were run locally4.x.x.x
Merge Checklist (For Technical Lead use only)