You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a specific data transformation in mind: to skip the details, the input is a line-delimited JSON while the output is a list of partitioned files that follow an application's custom binary format.
With those specifications in mind, an Arrow dataset feels like an almost perfect fit... if I could implement my own FileFormat, that is. Would that even be possible?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I have a specific data transformation in mind: to skip the details, the input is a line-delimited JSON while the output is a list of partitioned files that follow an application's custom binary format.
With those specifications in mind, an Arrow dataset feels like an almost perfect fit... if I could implement my own
FileFormat, that is. Would that even be possible?Beta Was this translation helpful? Give feedback.
All reactions