Row-level test failure documentation: pointblank extension package #638
petrbouchal
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I have been working through a use case with the following characteristics:
This turns out to be quite difficult with the current interrogation flow - the post-interrogation agent is quite large, one needs full-sample failure extracts (which contain the whole row, in our case with hundreds of columns) and needs to compile them manually, in-memory. This soon becomes unworkable in terms of memory, speed, and code maintainability.
The size of the input data can be dealt with via databases, the main problem is the size and manipulation of the failed rows.
To solve this, I created an extension package that accommodates this use case: it extracts per-row failure logs directly into a database, file, or R object without creating a large post-interrogation agent.
See https://petrbouchal.xyz/pointblankops/
The implemntation is: create a lightweight agent (called an operative), then instead of interrogating, debrief the operative (which is a lightweight version of interrogation which creates a row-level failure log instead of an agent with the report).
If this worked as a new "validation workflow" in pointblank, I would be happy for it to be incorporated, but the code as it is now has not been tested extensively for the various pointblank scenarios.
Beta Was this translation helpful? Give feedback.
All reactions