Skip to content

[ENH] re-implement the logic of data loading to ensure good performance #1860

@workhours

Description

@workhours

the current implementation performance is poor. dataload process consumed 99% performance time while fitting.
please don't use traditional way of handling large group of time series, which organize multi-dimensions data in a 2-dimensions pd.DataFrame and manupulate it. use tensor please!
use DataFrame as input data type only, and manupulate it always by tensor or np.array, restore every usefull dimension from df columns. if DataFrame manupulation is inevitable, please use AI to find a vectorized way
never pre-process data on time series wise, thounds and thounds times of time cost there.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions