I am currently using Toad version 0.1.4 and encountered a performance issue when using the combiner function on a dataset with 2 million rows and 56 columns. The execution time is taking approximately 3 days to complete, which is significantly longer than expected.
Parameters used:
combiner.fit(X=x_train,
y=x_train['label'],
method='chi',
min_samples = 0.05,
exclude=exclude_list)
I wanted to inquire if:
- This performance issue has been addressed in a later version of Toad.
- There are any recommended methods or optimizations to speed up the combiner function for large datasets.
- If there are any specific configurations or parameters I can adjust to improve performance.