Skip to content

Use same defaults as pandas.DataFrame.to_csv in kedro_datasets.pandas.csv_dataset.CSVDataset. #1073

Open
@galenseilisnh

Description

@galenseilisnh

Description

I ran a pipeline and saved the results only to find out that the index, which contained required information, was missing!

Context

Users are expecting the pandas.CSVDataset to be a shallow wrapper around pandas.DataFrame.to_csv. It essentially is, but along with that shallow wrapper expectation is the expectation that the defaults will be the same. I found it surprising that the defaults were not preserved. It isn't a huge deal, but it becomes a "gotcha" for anyone onboarding Kedro + pandas CSV.

Possible Alternatives

Naturally, users 'can' use the code in the current state. This isn't a huge problem. Just a footgun for new users.

Metadata

Metadata

Assignees

No one assigned

    Labels

    CommunityIssue/PR opened by the open-source communitygood first issueGood for newcomers

    Type

    No type

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions