Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
✨ support Dataframe interchange protocol (#1509)
* 🐛 handle offset for categories * Draft for the Dataframe interchange protocol * Adding test for virtual column plus typo. * Roundtrip test change plus some corrections in functions parameters * Apply suggestions from code review * Dtype for arrow dict plus use of arrow dict in convert_categorical_column * Add missing value handling * Added chunk handling and tests * Corrected usage of metadata for categories * Applying changes from general dataframe protocol * Delete copy error * Change sentinel value handling in convert_categorical_column * Add select_columns() and test * Update to _get_data_buffer() for Arrow Dictionary * Minor commenting changes * Correct typo error * Add _VaexBuffer test * Add tests and correction for _VaexColumn * Added tests for _VaexDataFrame * Added more tests and one correction for format_str * format to LF and black * support passing in allow_copy * correct descibe_null for arrow and numpy * correct _get_validity_buffer to match describe_null * correct describe_null, convert_categorical_column and test_categorical_ordinal for categorical dtypes * Apply suggestions from code review * correct get_chunks for _VaexDataFrame * Replace return with yield in get_chunks * Check for LF and run black with -l 220 * Black with line length 220 * Add string dtype support * Add Arrow Dict check to describe_categorical * avoid copying data for strings * small fix * also test sliced dataframe * test that we do not copy data * Apply string no-mem copy suggestions * fix and test get_chunks * use future ordinal encoding feature * make test work with dict encoded Co-authored-by: Maarten A. Breddels <[email protected]> Co-authored-by: Alenka Frim <[email protected]>
- Loading branch information