@@ -64,7 +64,18 @@ To fetch DITTO source code, change in to directory of your choice and run:
6464git clone https://github.com/uab-cgds-worthey/DITTO.git
6565```
6666
67- #### Setup OpenCravat (only one-time installation)
67+ #### Run DITTO pipeline on UAB Cheaha
68+
69+ To run on UAB cheaha, please update the ` model.job ` and ` .test_data/file_list.txt ` files with complete file paths for all
70+ necessary files and tools and submit a slurm job using the command below
71+
72+ ``` sh
73+ sbatch model.job
74+ ```
75+
76+ #### Run DITTO pipeline outside of UAB Cheaha
77+
78+ *** Setup OpenCravat (only one-time installation)***
6879
6980Please follow the steps mentioned in [ install_openCravat.md] ( docs/install_openCravat.md ) .
7081
@@ -75,7 +86,7 @@ Please follow the steps mentioned in [install_openCravat.md](docs/install_openCr
7586 <!-- markdown-link-check-enable -->
7687> These will be ignored when running the pipeline.
7788
78- #### Run DITTO pipeline
89+ *** Setup Nextflow ***
7990
8091Create an environment via conda. Below is an example to install ` nextflow ` .
8192
@@ -91,7 +102,8 @@ conda activate ditto-env
91102conda install bioconda::nextflow
92103```
93104
94- Please make a samplesheet with VCF files (incl. path). Please make sure to edit the directory paths as needed and run
105+ Please make a samplesheet ` .test_data/file_list.txt ` with VCF files (incl. path).
106+ Please make sure to edit the directory paths as needed and run
95107the pipeline as shown below.
96108
97109``` sh
@@ -103,24 +115,18 @@ nextflow run pipeline.nf \
103115 --sample_sheet .test_data/file_list
104116```
105117
106- To run on UAB cheaha, please update the ` model.job ` file and submit a slurm job using the command below
107-
108- ``` sh
109- sbatch model.job
110- ```
111-
112118## Reproducing the DITTO model
113119
114120Detailed instructions on reproducing the model is explained in [ build_DITTO.md] ( docs/build_DITTO.md )
115121
116122## Download DITTO DB (Precomputed scores)
117123
118- Precomputed scores for all possible SNVs and known Indels from gnomADv3.0 in main chromosomes in hg38 reference genome
124+ Precomputed scores for all possible SNVs and known Indels from gnomADv3.0 in main chromosomes in hg38 reference genome
119125are available to download here - < https://s3.lts.rc.uab.edu/cgds-public/dittodb/dittodb.html >
120126
121127## How to cite?
122128<!-- markdown-link-check-disable -->
123- Mamidi, T.K.K.; Wilk, B.M.; Gajapathy, M.; Worthey, E.A. DITTO: An Explainable Machine-Learning Model for
129+ Mamidi, T.K.K.; Wilk, B.M.; Gajapathy, M.; Worthey, E.A. DITTO: An Explainable Machine-Learning Model for
124130Transcript-Specific Variant Pathogenicity Prediction. Preprints 2024, 2024040837. < https://doi.org/10.20944/preprints202404.0837.v1 >
125131<!-- markdown-link-check-enable -->
126132## Contact information
0 commit comments