NCI-DOE-Collab-Pilot1-Semi-Supervised-Feature-Learning-with-Center-Loss

Description

This software proposes a semi-supervised, autoencoder-based, machine learning procedure, which learns a smaller set of gene expression features that are robust to batch effects using background information on a cell line or tissue’s tumor type. We implemented this reduced feature representation and show that the new feature space clusters strongly according to tumor type. This experiment is carried out across multiple studies: CCLE, CTRP, gCSI, GDSC,NCI60, and patient derived tumors. We hypothesize that using a batch effect resistant feature set across studies will improve prediction performance.

User Community

Researchers interested in the following topics:

Primary: Cancer biology data modeling
Secondary: Machine Learning; bioinformatics; computational biology

Usability

The current code can be used by a data scientist experienced in Python and the domain.

Uniqueness

The new cost function balances the reconstruction performance, with the classification and ‘center loss’ performance. Reconstruction performance ensures that the ‘pinch’ layer retains information about original gene expression while classification performance shapes the space so tumors of the same type of close together regardless of the source study. Using the ‘pinch’ layer as new features reduces the number of features from 17,000 genes to approximately 1000 features or as few as 20 features. We compare the new features from our ‘center loss’ autoencoder and ComBat using Silhouette score, the Calinski – Harabaszindex, and the Davies – Bouldin index. All metrics show that using the proposed ‘center loss’ autoencoder features provide a latent space with better clusters than applying ComBat.

Components

This capability provides the following components:

Scripts to download and process RNAseq expression and cell line data.
Script to train the autoencoder model
The trained model
Scripts to encode the RNAseq expression and visualize the reduced dimension resutls

Technical Details

Refer to this README.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
figures		figures
src		src
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NCI-DOE-Collab-Pilot1-Semi-Supervised-Feature-Learning-with-Center-Loss

Description

User Community

Usability

Uniqueness

Components

Technical Details

About

Uh oh!

Releases

Packages

Languages

License

stewarthe6/NCI-DOE-Collab-Pilot1-Semi-Supervised-Feature-Learning-with-Center-Loss

Folders and files

Latest commit

History

Repository files navigation

NCI-DOE-Collab-Pilot1-Semi-Supervised-Feature-Learning-with-Center-Loss

Description

User Community

Usability

Uniqueness

Components

Technical Details

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages