Performance Issue with Combiner Function on Large Dataset (2M Rows, 56 Columns)

I am currently using Toad version 0.1.4 and encountered a performance issue when using the combiner function on a dataset with 2 million rows and 56 columns. The execution time is taking approximately 3 days to complete, which is significantly longer than expected.

Parameters used:
          combiner.fit(X=x_train,
                       y=x_train['label'],
                       method='chi',
                       min_samples = 0.05,
                       exclude=exclude_list)
  
I wanted to inquire if:
- This performance issue has been addressed in a later version of Toad.
- There are any recommended methods or optimizations to speed up the combiner function for large datasets.
- If there are any specific configurations or parameters I can adjust to improve performance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance Issue with Combiner Function on Large Dataset (2M Rows, 56 Columns) #153

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Performance Issue with Combiner Function on Large Dataset (2M Rows, 56 Columns) #153

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions