Skip to content

Releases: daisybio/drevalpy

v1.3.5

01 Jul 08:02
dfd351f

Choose a tag to compare

Changes

  • --curve_curator arg is replaced with --no_refitting (default False)
  • Changed GPL-3 license to MIT license
  • _split_early_stopping_data -> no longer private function because it is also used in the pipeline
  • Updated which functions are used in the pipeline

Bug fixes

  • Meaningful error message when visualization does not work because of an empty dataframe

New features

  • CLI interface is now drevalpy --help and drevalpy-report --helpaccessible also without cloning. Removed run_suite.py and create_report.py: Command Line Interfaces (#243) @PascalIversen
  • We added a demo notebook for an easy quickstart :)

Maintenance

  • New package versions

v1.3.4

23 Jun 10:57

Choose a tag to compare

Changes

  • Updated versions but no support for scipy 1.16.0 yet because of statsmodels/statsmodels#9584 (comment)
  • Dataset.py: FeatureDataset from_csv now extracts meta info, to_csv saves it
  • Removed hard-coded cell line/drug identifiers in visualization/utils.py

🐛 Bug Fixes

  • various fixes (#235) @PascalIversen:
    • Fixed inverse transform in cross_study_prediction
    • Inverse transforming in train_and_predict for train and early stopping as well now
    • Removed nested final_model directory
    • Copying drug features before supplying it to train/predict functions, too, now. Forgot that previously

v.1.3.3

16 Jun 13:58
1d26871

Choose a tag to compare

Changes

  • Bumped Dockerfile to Python 3.13, too, and added unzip for Nextflow

v1.3.2

16 Jun 13:14
3618669

Choose a tag to compare

Preprint is out! Linking it in the README

Changes

  • [DEPENDABOT]: Bump codecov/codecov-action from 5.4.2 to 5.4.3 (#218) @dependabot[bot]
  • Transformation on the features like scaling, variance feature selection, … are now implemented as sklearn transformer so that they can be fitted in train and applied for predicting. Because of that, cell line/drug features are now copied in experiment.py
  • Added SingleDrugProteomicsRandomForest, ProteomicsElasticNet
  • Made Naive Models nicer/less redundant
  • Forcing NaiveMeanEffectsPredictor to be always run

🚀 Features

  • Optional final production model fitting procedure after CV (#221) @PascalIversen. Flag final_model_on_full data: if True, a final/production model is saved in the results directory. If hyperparameter_tuning is true, the final model is produced according to the hyperparameter tuning procedure, which was evaluated in the nested cross-validation. Accordingly, all models now have save() and load() functions.
  • Memes on the documentation page!
  • Added no_hyperparameter_tuning flag

🐛 Bug Fixes

  • Fix Feature Scaling (#220) @PascalIversen
  • CurveCurator: name the new data columns like the constants CELL_LINE_IDENTIFIER (=cell_line_name), DRUG_IDENTIFIER (=pubchem_id) instead of "cell_line_id", "drug_id"
  • response from_csv: look for measure, cell line identifier, and drug identifier column; cast drug identifier to str; same for to_dataframe.
  • feature from_csv: forgot metainfo, some fixes
  • load_custom, evaluate_file: forgot dataset_name

🧰 Maintenance

  • Bump to Python 3.13 (now supporting 3.11, 3.12, 3.13)
  • New versions, separate pre-commit action

v1.3.1

09 May 08:34
4fe8bd1

Choose a tag to compare

Small Changes

  • Defaults for visualization result and data directory
  • Fixed bug that CD diagram is overwritten
  • Updated documentation
  • New package versions

v1.3.0

05 May 10:22

Choose a tag to compare

Changes

🐛 Bug Fixes

  • fix LTO for low tissue validations sets (#202) @PascalIversen
  • Lots of restructuring in visualization, kick out partial correlation, instead normalized metrics (#81) @JudithBernett
  • fix measure being not picked for datasets (#186) @PascalIversen
  • Multiomics input fix for MultiOmicsNN (#201) @JudithBernett
  • Fixed an error for early splitting for when there are less than 4 groups in validation

🧰 Maintenance

v1.2.7

04 Apr 15:05
7c18391

Choose a tag to compare

Changes

🐛 Bug Fixes

v1.2.6

21 Mar 17:17
e7c697f

Choose a tag to compare

Changes

  • forced NaiveMeanEffectsPredictor (#169) @PascalIversen
  • updated requirements
  • test restructuring: now baseline tests, single drug model tests, neural network tests for non-single drug models, adapted tests for the bugs
  • MultiOmicsNN: removed unused self.methylation_features

🐛 Bug Fixes

  • Minor bug fixes for DIPK, MOLIR, and SuperFELTR (#161) @JudithBernett
    • DIPK fix for batch size 1
    • MOLIR/SuperFELTR: removed VarianceThreshold selection. Now always selecting min(1000, n_features) most variable features from the dataset
    • MOLIR/SuperFELTR fix for if self.model is None
    • MOLIR/SuperFELTR: moved duplicated code to utils

v1.2.5

07 Mar 19:14
5b1ce9e

Choose a tag to compare

Changes

  • New package versions

🐛 Bug Fixes

  • Metrics fix (#159) @JudithBernett : If there were NaNs in the predicted values, the metrics could not be computed -> also adapted test

v1.2.4

06 Mar 16:13
5d4876a

Choose a tag to compare

Many things were going wrong for cross study prediction -> fixed

Changes

  • Instead of Toy_Data, we now have 2 datasets: TOYv1 (subsetted from CTRPv2) and TOYv2 (subsetted from GDSC2) for testing cross study
  • We needed lists that contain the intersection of all OMICs features across datasets
  • rename Toy_Data and add TOYv2 for testing cross study predictions (#153) @PascalIversen
  • load_and_reduce_gene_features -> load_and_select_gene_features
  • New versions of packages
  • [DEPENDABOT]: Bump actions/cache from 4.2.1 to 4.2.2 (#146) @dependabot[bot]
  • [DEPENDABOT]: Bump codecov/codecov-action from 5.3.1 to 5.4.0 (#145) @dependabot[bot]

🐛 Bug Fixes

  • Bug for pipeline: consolidate single drug model predictions was receiving cross study datasets instead of the dataset names: Consolidation fix (#156) @JudithBernett
  • The features have to have the same order for all datasets -> now happens in load_and_select_gene_features: fix the order of genes across datasets when selecting a gene/methylation island subset of features (#154) @PascalIversen
  • Bug for MOLIR, SuperFELTR: meta-info of feature dataset was not subsetted for variancethreshold filtering
  • Bug for MOLIR, SuperFELTR: features used in training have to be remembered such that the cross-study prediction dataset can be subsetted
  • Bug for MultiOMICs RF: methylation features have to occur in both datasets -> now uses list
  • DIPK fix: gene expression data has to be encoded again for cross study prediction
    updated tests