-
Notifications
You must be signed in to change notification settings - Fork 29
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Currently, the Excel tables are converted to NDJSON and concatenated to the existing cache. If an Excel table has a new columns, then the curate rule will fail because augur curate has a built-in validation to check all records have the same fields. I knew this could be an issue when initially setting up the ingest workflow, but I thought the GISAID metadata Excel table has been very stable over the last couple years and didn't add any guardrails. However, this came up when I was testing a new download that includes the new Publishing_Embargo_Until column.
TODOs
- check if concatenated NDJSONs have matching fields
- emit loud error to fix before cache gets uploaded
- add mechanism to rename/add fields to cache to match new Excel tables
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request