You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: COLLABORATIONS/openTARGETS/README.md
+11-11
Original file line number
Diff line number
Diff line change
@@ -35,7 +35,7 @@ To create the histologies file, recommended method is to:
35
35
library("tidyr")
36
36
```
37
37
38
-
1.PulltheOpenPedCan repo (warning, it's 12GB ): https://github.com/PediatricOpenTargets/OpenPedCan-analysis, or just download the script from `analyses/pedcbio-sample-name/pedcbio_sample_name_col.R`
38
+
1.PulltheOpenPedCan repo (warning, it's 12GB ): https://github.com/d3b-center/OpenPedCan-analysis, or just download the script from `analyses/pedcbio-sample-name/pedcbio_sample_name_col.R`
39
39
1. Export from D3b Warehouse the latest existing cBio IDs to use for population. Ensure that the output is csv double-quoted. Currently that can be obtained using the sql command:
40
40
```sql
41
41
@@ -73,7 +73,7 @@ To create the histologies file, recommended method is to:
73
73
1. Get a blacklist from D3b Warehouse, exporting table `bix_workflows.cbio_hide_reasons
74
74
75
75
### Run as standalone
76
-
1. Download from https://github.com/PediatricOpenTargets/OpenPedCan-analysis the `analyses/pedcbio-sample-name/pedcbio_sample_name_col.R` or run from repo if you have it
76
+
1. Download from https://github.com/d3b-center/OpenPedCan-analysis the `analyses/pedcbio-sample-name/pedcbio_sample_name_col.R` or run from repo if you have it
It'srecommendedtoputdatasheetsinadircalled`datasheets`, downloadedfilesinit's own dir (in v12 it's`GF_INPUTS`) andtherestoftheprocessedoutputsintoit's own dir (`study_build` for v12) to keep things sane and also be able to leverage existing study build script in `scripts/organize_upload_packages.py`
92
+
It'srecommendedtoputdatasheetsinadircalled`datasheets`, downloadedfilesinit's own dir (in v12 it's`DOWNLOADED_INPUTS`) andtherestoftheprocessedoutputsintoit's own dir (`study_build` for v12) to keep things sane and also be able to leverage existing study build script in `scripts/organize_upload_packages.py`
Outputs a `data_clinical_sample.txt` and `data_clinical_patient.txt` for the cBio package, and a `bs_id_sample_map.txt` mapping file to link BS IDs to gnerated cBioPortal IDs based on the rules for creating a proper somatic event using column `parent_aliquot_id`
_NOTE_ for v11 input, I ran the following command `zcat snv-dgd.maf.tsv.gz | perl -e '$skip=<>; $skip=<>; while(<>){print$_;}' | gzip -c >> snv-consensus-plus-hotspots.maf.tsv.gz` to add DGD data
142
142
143
-
_NOTE_ for v12 input,I would have following command `python3 ~/tools/kf-cbioportal-etl/COLLABORATIONS/openTARGETS/add_dgd_maf_to_openpedcan.py -i /home/ubuntu/tools/kf-cbioportal-etl/COLLABORATIONS/openTARGETS/maf_openpedcan_v12_header.txt -c openpedcan_v12.maf -t ../bs_id_sample_map.txt -m ../GF_INPUTS/snv-dgd.maf.tsv.gz` to add DGD data, which is more robust - however, there are data issues with DGD, so it was left out
143
+
_NOTE_ for v12 input,I would have following command `python3 ~/tools/kf-cbioportal-etl/COLLABORATIONS/openTARGETS/add_dgd_maf_to_openpedcan.py -i /home/ubuntu/tools/kf-cbioportal-etl/COLLABORATIONS/openTARGETS/maf_openpedcan_v12_header.txt -c openpedcan_v12.maf -t ../bs_id_sample_map.txt -m ../DOWNLOADED_INPUTS/snv-dgd.maf.tsv.gz` to add DGD data, which is more robust - however, there are data issues with DGD, so it was left out
Note, I merged the tcga into the main rds. I also needed an instance with _64GB ram_ in order to calc z scores. Update: Can also achieve by setting up 32GB swap space
Copy file name to clipboardexpand all lines: COLLABORATIONS/openTARGETS/openpedcan_v12_case_meta_config.json
+1-1
Original file line number
Diff line number
Diff line change
@@ -115,7 +115,7 @@
115
115
},
116
116
"study": {
117
117
"_comment": "see https://docs.cbioportal.org/5.1-data-loading/data-loading/file-formats#cancer-study for detailed specifics",
118
-
"description": "<a href=\"https://github.com/PediatricOpenTargets/OpenPedCan-analysis\">OpenPedCan</a> is a collaborative project between the National Cancer Institute and the Children's Hospital of Philadelphia as part of the NCI's Childhood Cancer Data Initiative. Here, we harmonize pan-cancer data using <a href=\"https://kidsfirstdrc.org/\">KidsFirst Data Resource Center</a> workflows and harness <a href=\"https://github.com/AlexsLemonade/OpenPBTA-analysis/\">OpenPBTA analytics</a> workflows to scale and add modules across pediatric cancer datasets. This data has been integrated into the pediatric open targets platform to assist in development and query of the FDA's Relevant Pediatric Molecular Targets List (PMTL) to identify new therapeutics for children with cancer. This is the v12 release of this effort, for v10 please see <a href=\"https://pedcbioportal.kidsfirstdrc.org/study/summary?id=ped_opentargets_2021\">OpenPedCan v10</a>. For study release details, please see <a href=\"https://tinyurl.com/55cxz9am\">Release Notes</a>",
118
+
"description": "<a href=\"https://github.com/d3b-center/OpenPedCan-analysis\">OpenPedCan</a> is a collaborative project between the National Cancer Institute and the Children's Hospital of Philadelphia as part of the NCI's Childhood Cancer Data Initiative. Here, we harmonize pan-cancer data using <a href=\"https://kidsfirstdrc.org\">KidsFirst Data Resource Center</a> workflows and harness <a href=\"https://github.com/AlexsLemonade/OpenPBTA-analysis\">OpenPBTA analytics</a> workflows to scale and add modules across pediatric cancer datasets. This data has been integrated into the pediatric open targets platform to assist in development and query of the FDA's Relevant Pediatric Molecular Targets List (PMTL) to identify new therapeutics for children with cancer. This is the v12 release of this effort underlying the <a href=\"https://moleculartargets.ccdi.cancer.gov\">NCI's molecular targets platform</a>. For study release details, please see <a href=\"https://tinyurl.com/55cxz9am\">Release Notes</a>",
0 commit comments