You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+33-36Lines changed: 33 additions & 36 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,20 +6,24 @@ The germline variant annotator (*gvanno*) is a simple, Docker-based software pac
6
6
7
7
*gvanno* accepts query files encoded in the VCF format, and can analyze both SNVs and short InDels. The workflow relies heavily upon [Ensembl’s Variant Effect Predictor (VEP)](http://www.ensembl.org/info/docs/tools/vep/index.html), and [vcfanno](https://github.com/brentp/vcfanno). It produces an annotated VCF file and a file of tab-separated values (.tsv), the latter listing all annotations pr. variant record.
8
8
9
-
#### Annotation resources included in _gvanno_ - 0.8.0
9
+
#### Annotation resources included in _gvanno_ - 0.9.0
10
10
11
-
*[VEP v95](http://www.ensembl.org/info/docs/tools/vep/index.html) - Variant Effect Predictor (GENCODE v29/v19 as the gene reference dataset)
12
-
*[dBNSFP v4.0](https://sites.google.com/site/jpopgen/dbNSFP) - Database of non-synonymous functional predictions (February 2019)
@@ -95,42 +99,35 @@ We __strongly__ recommend that the input VCF is compressed and indexed using [bg
95
99
96
100
A few elements of the workflow can be figured using the *gvanno* configuration file (i.e. **gvanno.toml**), encoded in [TOML](https://github.com/toml-lang/toml) (an easy to read file format).
97
101
98
-
* The initial step of the workflow performs [VCF validation](https://github.com/EBIvariation/vcf-validator) on the input VCF file. This procedure is very strict, and often causes the workflow to return an error due to various violations of the VCF specification. If the user trusts that the most critical parts of the input VCF is properly encoded, a setting in the configuration file (`vcf_validation = false`) can be used to turn off VCF validation.
99
-
100
102
* Prediction of loss-of-function variants using VEP's LOFTEE plugin can be turned on in the configuration file (`lof_prediction = true`). Do note that this frequently increases the run time for VEP significantly.
101
103
102
104
#### STEP 5: Run example
103
105
104
106
Run the workflow with **gvanno.py**, which takes the following arguments and options:
sample_id Sample identifier - prefix for output files
121
119
122
-
optional arguments:
123
-
-h, --help show this help message and exit
124
-
--force_overwrite The script will fail with an error if the output file
125
-
already exists. Force the overwrite of existing result
126
-
files by using this flag (default: False)
127
-
--version show program's version number and exit
120
+
optional arguments:
121
+
-h, --help show this help message and exit
122
+
--force_overwrite The script will fail with an error if the output file already exists. Force the overwrite of existing result files by using this flag
123
+
--version show program's version number and exit
124
+
--no_vcf_validate Skip validation of input VCF with Ensembl's vcf-validator
128
125
129
126
130
127
The _examples_ folder contains an example VCF file. Analysis of the example VCF can be performed by the following command:
Copy file name to clipboardExpand all lines: gvanno.py
+34-14Lines changed: 34 additions & 14 deletions
Original file line number
Diff line number
Diff line change
@@ -10,20 +10,23 @@
10
10
importgetpass
11
11
importplatform
12
12
importtoml
13
+
fromargparseimportRawTextHelpFormatter
13
14
14
15
15
-
gvanno_version='0.8.0'
16
-
db_version='GVANNO_DB_VERSION = 20190320'
17
-
vep_version='95'
16
+
17
+
gvanno_version='0.9.0'
18
+
db_version='GVANNO_DB_VERSION = 20190521'
19
+
vep_version='96'
18
20
globalvep_assembly
19
21
20
22
def__main__():
21
23
22
-
parser=argparse.ArgumentParser(description='Germline variant annotation (gvanno) workflow for clinical and functional interpretation of germline nucleotide variants',formatter_class=argparse.ArgumentDefaultsHelpFormatter)
24
+
parser=argparse.ArgumentParser(description='Germline variant annotation (gvanno) workflow for clinical and functional interpretation of germline nucleotide variants',formatter_class=RawTextHelpFormatter, usage="%(prog)s [options] <QUERY_VCF> <GVANNO_DIR> <OUTPUT_DIR> <GENOME_ASSEMBLY> <CONFIG_FILE> <SAMPLE_ID>")
23
25
parser.add_argument('--force_overwrite', action="store_true", help='The script will fail with an error if the output file already exists. Force the overwrite of existing result files by using this flag')
0 commit comments