-
Notifications
You must be signed in to change notification settings - Fork 902
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add python tests for Parquet DELTA_BINARY_PACKED encoder #14316
Changes from 17 commits
98b0f79
a8427f7
09983f0
41f827f
772a275
0379b4d
b6a97e8
5d5268e
7a2ee66
13330e7
6592442
f1d3c88
eeea1b9
e7b3694
11e6c5e
eeac17a
a49f606
0776811
3965125
5f85be0
7a770d6
1e0cc58
6dabbcb
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change | ||
---|---|---|---|---|
|
@@ -6370,6 +6370,8 @@ def to_parquet( | |||
max_page_size_rows=None, | ||||
storage_options=None, | ||||
return_metadata=False, | ||||
use_dictionary=True, | ||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Can we document these two parameters here: cudf/python/cudf/cudf/utils/ioutils.py Line 222 in 16051a7
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. done There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Thanks @galipremsagar. I would have never thought to look there for the docstring 😅 There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. First time hearing about it as well 🤷♂️ (just don't git blame |
||||
header_version="1.0", | ||||
*args, | ||||
**kwargs, | ||||
): | ||||
|
@@ -6394,6 +6396,8 @@ def to_parquet( | |||
max_page_size_rows=max_page_size_rows, | ||||
storage_options=storage_options, | ||||
return_metadata=return_metadata, | ||||
use_dictionary=use_dictionary, | ||||
header_version=header_version, | ||||
*args, | ||||
**kwargs, | ||||
) | ||||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I realize it isn't the fault of this current PR, but one does wish
enable_write_v2_headers
were namedset_write_v2_headers
.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We use enable_ for bool options, so this should be consistent (for better or for worse, apparently).