Rework Workflow #4
valentinedwv
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Thinking that we might consider reworking the workflow.
Right now we do a few corrections, like the context when we put the Harvested JSONLD documents in summoned s3 path.
Thinking that we might put the uncorrected documents in summoned path, then have a processed/enhanced s3 path, where the documents are corrected, and perhaps formatted into a single style (compact or flattened). Then have a standardized path where we have further processed the documents so they can be loaded.
summoned
summoned
> Context and other processing >processed
-- this would be a valid document that can be directly converted to RDF reliably.processed
> standardization processing >standardized
standardized
> transform > other metadata formatsStandardizing:
Standardize is for our systems, it might even be accomplished with a set of s3 events, and lambda serverless functions.. which would be a 'tool' in gleaner.
@fils @nein09 any thoughts?
Beta Was this translation helpful? Give feedback.
All reactions