Skip to content

[FR] map should reuse unchanged columns from the previous dataset to avoid disk usage #6013

Open
@NightMachinery

Description

@NightMachinery

Feature request

Currently adding a new column with map will cause all the data in the dataset to be duplicated and stored/cached on the disk again. It should reuse unchanged columns.

Motivation

This allows having datasets with different columns but sharing some basic columns. Currently, these datasets would become too expensive to store and one would need some kind of on-the-fly join; which also doesn't seem implemented.

Your contribution

_

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestgood second issueIssues a bit more difficult than "Good First" issues

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions