-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
columns expected but not found: ['structural', 'contents', 'nonstructural', 'asset_id'] #126
Comments
Is this issue still relevant? |
Apparently still relevant, for better or for wosre. Will (@wkhchow) ran into the same error (with Docker Desktop on Windows) on August 8, 2022 while testing the updates_july2022 branch at commit 141a5455b80ff3152159aad3ef634ed9bc12ba73:
Will started another run of the stack build to see if the error persists or not. |
Good news! @wkhchow reported at today's scrum that the re-run of the stack build proceeded smoothly past the DSRA import stage into the PostGIS-to-Elasticsearch export stage. So, it would appear that
Seeing how this happened twice both with DSRA_outputs2postgres_lfs.py from the OpenDRR/model-factory repo, and how DSRA_outputs2postgres_lfs.py uses the Python Moving this issue to the OpenDRR/model-factory repo for further investigation of DSRA_outputs2postgres_lfs.py (and maybe other similar scripts too). |
2022-08-09 Update: @wkhchow encountered this error too, but could not reproduce it the second time. Suspect to be an occasional network error that caused incomplete download. Moved this issue from OpenDRR/opendrr-api to OpenDRR/model-factory. Perhaps adding a fail-safe mechanism in DSRA_outputs2postgres_lfs.py (verify checksum, retry download, etc.) would mitigate the issue?
2021-06-07 Update: I wasn't able to reproduce this in my June 4 to 5 local run (using the
pipeline-optimization
branch at commit 15c2d1889c8b28a04671fbc10a5a0436ba071289). To be investigated.On 2021-05-28, Joost encountered
ValueError: Usecols do not match columns, columns expected but not found: ['structural', 'contents', 'nonstructural', 'asset_id']
during[2021-06-07 Update] Joost was using commit 6ae277be4189bfee6af52c82afde89cfdc2baabf, which is the tip branch that I renamed to
pipeline-optimization_experimental_with_bubble-merged_gen-pygeoapi-config
).It does look like something that #53 intends to fix, i.e. flexible CSV header data loading.
The same routine that Anthony ran on 2021-05-22 was successful though, and there did not seem to be any recent change to the upstream repos.
Hypothesis 1: only have PSRA data enabled in the ENV?
In Joost's
.env
file, onlyloadPsraModels
is set totrue
only; all the other load* variables are set tofalse
.... but on closer look, routines for
loadPsraModels
are run very first,And the DSRA_outputs2postgres_lfs.py comes before all of the above, so that's probably not it.
Hypothesis 2: Result of keeping volumes from previous run?
Nope, Joost nuked the volumes.
Hypothesis 3: Upstream repos changed?
Not at first glance, but maybe I missed something.
More info
Refer to the Slack DM log between Joost and me on 2021-05-21.
Failure log
The text was updated successfully, but these errors were encountered: