Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Adding logs & the ability to specify categorical data * Fixes categorical features * Changing getsizeof to nbytes * Delete _clean_extra_spaces * Adding more logs, adding OptimisedFeature storage, refactoring fitting BinaryCategoricalPreprocessor, fix bugs, adding reduce memory size, delete clean_extra_spaces * @Lopa10ko requested changes * Fix bug with nbytes * Fix bug with cat_features_names if there aren't exists features_names * Adding reduce_memory_size to pipeline._preprocess * Return to Pandas for nan_matrix * Change logic of _into_categorical_features_transformation_for_fit * Adding convert to np.array * Update ImputationImplementation * Fix bug in BinaryCategorical * Fix bug with test_data_from_csv_load_correctly * Fix bug with test_api_fit_predict_with_pseudo_large_dataset_with_label_correct * Fix bug with test_pipeline_preprocessing_through_api_correctly * Fix bug with test_default_forecast (add new TODO for ts_forecasting) * Fix bug with test_cv_multiple_metrics_evaluated_correct by adding copy method to OptimisedFeature * Fix bug with test_regression_pipeline_with_data_operation_fit_predict_correct by adding check for target * Fix bug in test_default_train_test_simple with nbytes * Fix bugs with str* types in features * Fix bug with test_inf_and_nan_absence_after_imputation_implementation_fit_transform by adding cat and num idx in get_dataset func * Fix bug with test_pipeline_objective_evaluate_with_different_metrics by switching Xgboost to Catboost, due to "Experimental support for categorical data is not implemented for current tree method yet." for XgBoost and checking feat ids with size * Fix bug with test_order_by_data_flow_len_correct * Fix bug with test_pipeline_with_imputer (finally) * Fix bug with test_correct_api_dataset_with_text_preprocessing by update col_type regex rule for str* * Update for OneHotImplementation * Update for subset_features and post_init * Update data_has_categorical_features * Adding bool to numerical * Update for ImputationImplementation * Fix data for tests * Fix test with adding new types * Update test with deleting extra spaces * Update test with adding extra types_encountered * Fixes different tests * Update expected_values for test_metrics test * pep8 fixes * Adding preprocessing copying to predefined models * Adding docstring to reduce memory and optimisedfeatures * Automated autopep8 fixes * Fix bug with unhashable np * Temp update * Fix tests * Fix test_regression_data_operations with inf data after poly_features * Fix bug in tests with IndexError * Adding take by indecies method and to_numpy() in OptimisedFeatures * Update train_test_split for OptimisedFeatures * Transform target to numpy array during memory_reduce * PR#1318 migration * Fixing for test_metrics with py3.10 * Fix test_from_ ... with broadcast * Hide preprocessing messages under debug logging (2) * Fix TypeError with float16, rejection from this type * Refactoring OptimisedFeatures - _columns: np.ndarray -> _columns: pd.DataFrame * Revert changes with features property * Fixes various tests * Global refactoring - Rejection from separate class * Fix pep8, wrong code correction & test * Fixes bug with memory_usage & test * Fixes bug with invalid slice * pep8 fix * test fixes * pep8 fix * fix bug with memory_usage * reduce_memory_usage in utils, fix test with operations * fix tests * fix tests in main api * fix: fix ambiguous value in integration test * fix: fix typing error * fix: fix arrays used as indices must be of integer * fix: fix NoneType object isn't subscriptable error * fix: copy input_data to prevent modification * fix: fix fedot input_data transform to h2o_frame for regression * fix: update the type of ids attributes to np.ndarray * Automated autopep8 fixes * chore: change the logging levels of new messages * chore: fix pep8 style problems * Automated autopep8 fixes * fix: cannot concatenate ndarray * fix: preserve single ndarray type for num_features --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: DRMPN <[email protected]>
- Loading branch information