Skip to content

Commit 3874f3d

Browse files
committed
📝 update docs and config
1 parent 4cecd9e commit 3874f3d

File tree

2 files changed

+8
-3
lines changed

2 files changed

+8
-3
lines changed

COLLABORATIONS/openTARGETS/README.md

+7-2
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@ fusion-dgd.tsv.gz
2020
fusion-putative-oncogenic.tsv
2121
gene-expression-rsem-tpm-collapsed.rds
2222
tcga_gene-expression-rsem-tpm-collapsed.rds
23+
gtex_gene-expression-rsem-tpm-collapsed.rds
2324
snv-consensus-plus-hotspots.maf.tsv.gz
2425
snv-mutect2-tumor-only-plus-hotspots.maf.tsv.gz
2526
```
@@ -92,7 +93,11 @@ TCGA data are kept in a seprate matrix from everything else. We need to merge th
9293
```sh
9394
Rscript COLLABORATIONS/openTARGETS/merge_rsem_rds.R --first_file gene-expression-rsem-tpm-collapsed.rds --second_file tcga-gene-expression-rsem-tpm-collapsed.rds --output_fn gene_tcga_expression_common_merge.rds
9495
```
95-
96+
UPDATE: GTEx is also in a seprate matrix, so run again currently to make the "final" merge before conversion
97+
```sh
98+
Rscript COLLABORATIONS/openTARGETS/merge_rsem_rds.R --first_file gene_tcga_expression_common_merge.rds --second_file gtex_gene-expression-rsem-tpm-collapsed.rds --output_fn gene_tcga_gtex_expression_common_merge.rds
99+
```
100+
```
96101
97102
### File Transformation
98103
It's recommended to put datasheets in a dir called `datasheets`, downloaded files in it's own dir (in v12 it's `GF_INPUTS`) and the rest of the processed outputs into it's own dir (`study_build` for v12) to keep things sane and also be able to leverage existing study build script in `scripts/organize_upload_packages.py`
@@ -195,7 +200,7 @@ Options:
195200
Show this help message and exit
196201
```
197202
Example run:
198-
`Rscript COLLABORATIONS/openTARGETS/rename_export_rsem.R --rna_rds gene_tcga_expression_common_merge.rds --map_id bs_id_sample_map.txt --type openpedcan_v11 --computeZscore R 2> rna_convert.errs`
203+
`Rscript COLLABORATIONS/openTARGETS/rename_export_rsem.R --rna_rds gene_tcga_gtex_expression_common_merge.rds --map_id bs_id_sample_map.txt --type openpedcan_v15 --computeZscore R 2> rna_convert.errs`
199204
200205
#### 5. scripts/convert_fusion_as_sv.py
201206

COLLABORATIONS/openTARGETS/openpedcan_v15_case_meta_config.json

+1-1
Original file line numberDiff line numberDiff line change
@@ -119,7 +119,7 @@
119119
"groups": "PUBLIC",
120120
"cancer_study_identifier": "openpedcan_v15",
121121
"reference_genome": "hg38",
122-
"display_name": "Open Pediatric Cancer (OpenPedCan) Project v14",
122+
"display_name": "Open Pediatric Cancer (OpenPedCan) Project v15",
123123
"type_of_cancer": "mixed",
124124
"short_name": "openpedcan_v15"
125125

0 commit comments

Comments
 (0)