Fix type of empty `Index` and raise warning in `Series` constructor #14116

galipremsagar · 2023-09-18T02:05:17Z

Description

Fixes: #14091
This PR fixes empty inputs dtype in Index to default to str instead of float64. Another change is there is a deprecation warning for Series constructor to match pandas.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

python/cudf/cudf/tests/test_dataframe.py

bdice · 2023-09-18T13:29:22Z

python/cudf/cudf/tests/test_dataframe.py

@@ -2840,7 +2841,7 @@ def test_series_all_null(num_elements, null_type):
 @pytest.mark.parametrize("num_elements", [0, 2, 10, 100])
 def test_series_all_valid_nan(num_elements):
    data = [np.nan] * num_elements
-    sr = cudf.Series(data, nan_as_null=False)
+    sr = _create_cudf_series(data, nan_as_null=False)


Why is this change needed?

For num_elements == 0, the null count should still be zero even if it interprets the dtype as string/object (so the test is still valid, though it might warn?).

For num_elements > 0, the data should be non-empty and thus be correctly interpreted as a floating type.

For num_elements == 0, the null count should still be zero even if it interprets the dtype as string/object (so the test is still valid, though it might warn?).

Yup, this is the reason for this change.

bdice · 2023-09-18T13:31:54Z

python/cudf/cudf/tests/test_dataframe.py

@@ -4073,28 +4074,28 @@ def test_empty_dataframe_describe():


 def test_as_column_types():
-    col = column.as_column(cudf.Series([]))
+    col = column.as_column(_create_cudf_series([]))


Same here and below -- I'm not sure if I understand why these helper functions are needed. It seems like the test construction might need subtle changes but I don't think this casting helper function is the right answer. It's not announcing what it is actually doing -- the differences between cudf.Series and _create_cudf_series should be in the name, if this is needed (like _create_cudf_series_float64_default).

Agreed, reverted the change here and passed explicit dtypes. Renamed the helper to _create_cudf_series_float64_default and _create_pandas_series_float64_default. These will be removed in pandas-2.0 upgrade.

bdice

Approving -- but let's make sure to remove these testing helper functions early in the pandas 2.0 migration.

galipremsagar · 2023-09-20T15:49:02Z

/merge

Calling `cudf.Index([])` results in `str` dtype `Index`. This PR fixes an issue with a pytorch related pytest by explicitly passing a `float64` dtype. xref: #14116 Authors: - GALI PREM SAGAR (https://github.com/galipremsagar) Approvers: - https://github.com/brandon-b-miller URL: #14198

galipremsagar added 2 commits September 17, 2023 19:00

Fix empty column type

3ce402d

Merge

24ebad8

galipremsagar added Python Affects Python cuDF API. 4 - Needs cuDF (Python) Reviewer improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Sep 18, 2023

galipremsagar self-assigned this Sep 18, 2023

galipremsagar requested a review from a team as a code owner September 18, 2023 02:05

galipremsagar requested review from shwina and charlesbluca September 18, 2023 02:05

isort

266c503

bdice requested changes Sep 18, 2023

View reviewed changes

galipremsagar added 2 commits September 18, 2023 08:38

Address reviews

e53168d

Merge

c84e594

galipremsagar requested a review from bdice September 18, 2023 15:45

bdice approved these changes Sep 20, 2023

View reviewed changes

rapids-bot bot merged commit f7ca051 into rapidsai:branch-23.10 Sep 20, 2023
54 checks passed

galipremsagar added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 4 - Needs cuDF (Python) Reviewer labels Sep 20, 2023

galipremsagar mentioned this pull request Sep 26, 2023

Fix pytorch related pytest #14198

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix type of empty `Index` and raise warning in `Series` constructor #14116

Fix type of empty `Index` and raise warning in `Series` constructor #14116

galipremsagar commented Sep 18, 2023

bdice Sep 18, 2023

galipremsagar Sep 18, 2023

bdice Sep 18, 2023

galipremsagar Sep 18, 2023

bdice left a comment •

edited

Loading

galipremsagar commented Sep 20, 2023

Fix type of empty Index and raise warning in Series constructor #14116

Fix type of empty Index and raise warning in Series constructor #14116

Conversation

galipremsagar commented Sep 18, 2023

Description

Checklist

bdice Sep 18, 2023

Choose a reason for hiding this comment

galipremsagar Sep 18, 2023

Choose a reason for hiding this comment

bdice Sep 18, 2023

Choose a reason for hiding this comment

galipremsagar Sep 18, 2023

Choose a reason for hiding this comment

bdice left a comment • edited Loading

Choose a reason for hiding this comment

galipremsagar commented Sep 20, 2023

Fix type of empty `Index` and raise warning in `Series` constructor #14116

Fix type of empty `Index` and raise warning in `Series` constructor #14116

bdice left a comment •

edited

Loading