Skip to content

Understand BioQC analysis and evaluate whether we could use it in hermes #9

Open
@cicdguy

Description

@cicdguy

BioQC is a method to detect tissue heterogeneity in gene expression data (Zhang et al., BMC Genomics, 2017). Enrichment for unexpected tissues can be a consequence of mislabeled samples, technical reasons (e.g. imperfect dissection) or biological reasons (e.g. immune infiltration).

In biokitr:

  • 378 tissue-specific gene signatures derived from public and in-house data are considered
  • Signatures are tested for enrichment in highly expressed genes for each sample using the Wilcoxon-Mann-Whitney test
  • Enrichment scores Q are defined based on the -log10(p-value)
  • Signatures with Q > 2 in at least one sample are shown in the heatmap (the top 40 if there are more)

Internal information:
See https://drive.google.com/file/d/18U_ShdB0ATSWSwdXohUvlA3aC4ZnRmWE/view minute 17
BEDA/biokitr/blob/ab4c130410539f51fbfe680b7e5bc93f5a5163e7/R/gse_bioqc.R#L24
BEDA/biokitr/blob/ab4c130410539f51fbfe680b7e5bc93f5a5163e7/R/gse_bioqc.R#L67

To do: just design.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions