Skip to content

Commit 7ff8091

Browse files
committed
Merge branch 'release/v5.2.0'
2 parents 14fa7b6 + 694a781 commit 7ff8091

File tree

277 files changed

+424
-28032
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

277 files changed

+424
-28032
lines changed

README.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
[![pylint](imgs/pylint.svg)](https://github.com/acenglish/truvari/actions/workflows/pylint.yml)
33
[![FuncTests](https://github.com/acenglish/truvari/actions/workflows/func_tests.yml/badge.svg?branch=develop&event=push)](https://github.com/acenglish/truvari/actions/workflows/func_tests.yml)
44
[![coverage](imgs/coverage.svg)](https://github.com/acenglish/truvari/actions/workflows/func_tests.yml)
5-
[![develop](https://img.shields.io/github/commits-since/acenglish/truvari/v5.0.0)](https://github.com/ACEnglish/truvari/compare/v5.0.0...develop)
5+
[![develop](https://img.shields.io/github/commits-since/acenglish/truvari/v5.1.1)](https://github.com/ACEnglish/truvari/compare/v5.1.1...develop)
66
[![Downloads](https://static.pepy.tech/badge/truvari)](https://pepy.tech/project/truvari)
77

88
![Logo](https://raw.githubusercontent.com/ACEnglish/truvari/develop/imgs/BoxScale1_DarkBG.png)
@@ -26,12 +26,12 @@ Each sub-command contains help documentation. Start with `truvari -h` to see ava
2626

2727
The current most common Truvari use case is for structural variation benchmarking:
2828
```
29-
truvari bench -b base.vcf.gz -c comp.vcf.gz -o output_dir/
29+
truvari bench -b base.vcf.gz -c comp.vcf.gz -f reference.fa -o output_dir/
3030
```
3131

3232
Find more matches by harmonizing phased varians using refine:
3333
```
34-
truvari refine -R -U -r reference.fa --regions output_dir/candidate.refine.bed output_dir/
34+
truvari refine output_dir/
3535
```
3636

3737
Use Truvari's comparison engine to consolidate redundant variants in a merged multi-sample VCF:

imgs/coverage.svg

+2-2
Loading
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
0 Bytes
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.

repo_utils/answer_key/help.txt

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
usage: truvari [-h] CMD ...
22

3-
Truvari v5.1.1 Structural Variant Benchmarking and Annotation
3+
Truvari v5.2.0 Structural Variant Benchmarking and Annotation
44

55
Available commands:
66
bench Performance metrics from comparison of two VCFs

repo_utils/answer_key/refine/refine_output_one/log.txt

+8-8
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
1-
2025-02-02 06:13:45,675 [INFO] Truvari v5.0.1.dev0+aef4e1a.uc
2-
2025-02-02 06:13:45,675 [INFO] Command /data/truvari/__main__.py bench -b repo_utils/test_files/refine_data/hg002_base.vcf.gz -c repo_utils/test_files/refine_data/hg002_comp.vcf.gz --includebed repo_utils/test_files/refine_data/h1_hc_tr_hg002.bed -s 5 -o test_results/refine_output_one
3-
2025-02-02 06:13:45,676 [INFO] Params:
1+
2025-02-16 01:48:20,256 [INFO] Truvari v5.1.1
2+
2025-02-16 01:48:20,257 [INFO] Command /data/truvari/__main__.py bench -b repo_utils/test_files/refine_data/hg002_base.vcf.gz -c repo_utils/test_files/refine_data/hg002_comp.vcf.gz --includebed repo_utils/test_files/refine_data/h1_hc_tr_hg002.bed -s 5 -o test_results/refine_output_one
3+
2025-02-16 01:48:20,258 [INFO] Params:
44
{
55
"base": "/data/repo_utils/test_files/refine_data/hg002_base.vcf.gz",
66
"comp": "/data/repo_utils/test_files/refine_data/hg002_comp.vcf.gz",
@@ -36,10 +36,10 @@
3636
"skip_gt": false,
3737
"max_resolve": 25000
3838
}
39-
2025-02-02 06:13:45,720 [INFO] Including 225 bed regions
40-
2025-02-02 06:13:49,217 [INFO] Zipped 7157 variants Counter({'comp': 5302, 'base': 1855})
41-
2025-02-02 06:13:49,218 [INFO] 211 chunks of 7157 variants Counter({'__filtered': 6137, 'base': 569, 'comp': 451})
42-
2025-02-02 06:13:49,324 [INFO] Stats: {
39+
2025-02-16 01:48:20,308 [INFO] Including 225 bed regions
40+
2025-02-16 01:48:24,069 [INFO] Zipped 7157 variants Counter({'comp': 5302, 'base': 1855})
41+
2025-02-16 01:48:24,070 [INFO] 211 chunks of 7157 variants Counter({'__filtered': 6137, 'base': 569, 'comp': 451})
42+
2025-02-16 01:48:24,192 [INFO] Stats: {
4343
"TP-base": 387,
4444
"TP-comp": 387,
4545
"FP": 64,
@@ -71,4 +71,4 @@
7171
}
7272
}
7373
}
74-
2025-02-02 06:13:49,324 [INFO] Finished bench
74+
2025-02-16 01:48:24,193 [INFO] Finished bench
Binary file not shown.
Binary file not shown.
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
chr20 5040466 5040487
2-
chr20 5041931 5042443
2+
chr20 5041952 5042292
33
chr20 16257844 16259455
44
chr20 32723034 32723055
55
chr20 51953809 51953830
Binary file not shown.
Binary file not shown.
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
{"base": "test_results/refine_output_one/phab.output.vcf.gz", "comp": "test_results/refine_output_one/phab.output.vcf.gz", "output": "test_results/refine_output_one/phab_bench", "includebed": "/tmp/xn9atc8u.bed", "extend": 0, "debug": false, "reference": null, "refdist": 500, "pctseq": 0.7, "pctsize": 0.7, "pctovl": 0.0, "typeignore": false, "no_roll": true, "chunksize": 1000, "bSample": "syndip", "cSample": "p:HG002", "dup_to_ins": false, "bnddist": 100, "sizemin": 5, "sizefilt": 5, "sizemax": -1, "passonly": false, "no_ref": "a", "pick": "single", "ignore_monref": true, "check_multi": true, "check_monref": true, "no_single_bnd": true, "write_resolved": false, "decompose": true, "short_circuit": true, "skip_gt": false, "max_resolve": 25000}
1+
{"base": "test_results/refine_output_one/phab.output.vcf.gz", "comp": "test_results/refine_output_one/phab.output.vcf.gz", "output": "test_results/refine_output_one/phab_bench", "includebed": "/tmp/_0inhoz1.bed", "extend": 0, "debug": false, "reference": null, "refdist": 500, "pctseq": 0.7, "pctsize": 0.7, "pctovl": 0.0, "typeignore": false, "no_roll": true, "chunksize": 1000, "bSample": "syndip", "cSample": "p:HG002", "dup_to_ins": false, "bnddist": 100, "sizemin": 5, "sizefilt": 5, "sizemax": -1, "passonly": false, "no_ref": "a", "pick": "single", "ignore_monref": true, "check_multi": true, "check_monref": true, "no_single_bnd": true, "write_resolved": false, "decompose": true, "short_circuit": true, "skip_gt": false, "max_resolve": 25000}
Original file line numberDiff line numberDiff line change
@@ -1,27 +1,27 @@
11
{
2-
"TP-base": 361,
3-
"TP-comp": 361,
4-
"FP": 5,
2+
"TP-base": 320,
3+
"TP-comp": 320,
4+
"FP": 9,
55
"FN": 2,
6-
"precision": 0.9863387978142076,
7-
"recall": 0.9944903581267218,
8-
"f1": 0.9903978052126201,
9-
"base cnt": 363,
10-
"comp cnt": 366,
11-
"TP-comp_TP-gt": 361,
6+
"precision": 0.9726443768996961,
7+
"recall": 0.9937888198757764,
8+
"f1": 0.9831029185867896,
9+
"base cnt": 322,
10+
"comp cnt": 329,
11+
"TP-comp_TP-gt": 320,
1212
"TP-comp_FP-gt": 0,
13-
"TP-base_TP-gt": 361,
13+
"TP-base_TP-gt": 320,
1414
"TP-base_FP-gt": 0,
1515
"gt_concordance": 1.0,
1616
"gt_matrix": {
1717
"(0, 1)": {
18-
"(1, 0)": 136
18+
"(1, 0)": 125
1919
},
2020
"(1, 0)": {
21-
"(0, 1)": 178
21+
"(0, 1)": 162
2222
},
2323
"(1, 1)": {
24-
"(1, 1)": 47
24+
"(1, 1)": 33
2525
}
2626
}
2727
}
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
1-
2025-02-02 06:13:50,924 [INFO] Truvari v5.0.1.dev0+aef4e1a.uc
2-
2025-02-02 06:13:50,924 [INFO] Command /data/truvari/__main__.py refine --write-phab --coords O --buffer 100 --use-original-vcfs -f repo_utils/test_files/refine_data/chr20.fa.gz test_results/refine_output_one
3-
2025-02-02 06:13:50,925 [INFO] Params:
1+
2025-02-16 01:48:25,954 [INFO] Truvari v5.1.1
2+
2025-02-16 01:48:25,954 [INFO] Command /data/truvari/__main__.py refine --write-phab --coords O --use-original-vcfs -f repo_utils/test_files/refine_data/chr20.fa.gz test_results/refine_output_one
3+
2025-02-16 01:48:25,956 [INFO] Params:
44
{
55
"benchdir": "test_results/refine_output_one",
6-
"align": "mafft",
6+
"align": "poa",
77
"use_original_vcfs": true,
88
"write_phab": true,
99
"threads": 4,
@@ -15,19 +15,30 @@
1515
"mafft_params": "--auto --thread 1",
1616
"debug": false
1717
}
18-
2025-02-02 06:13:50,925 [INFO] Setting up regions
19-
2025-02-02 06:13:50,941 [INFO] Evaluating 225 regions
20-
2025-02-02 06:13:50,942 [INFO] Extending the regions by 100 bases
21-
2025-02-02 06:13:50,979 [INFO] Found 1 chromosomes with overlapping regions
22-
2025-02-02 06:13:51,987 [INFO] 75 regions to be refined
23-
2025-02-02 06:13:51,997 [WARNING] Harmonizing variants ≥50kbp is not recommended
24-
2025-02-02 06:13:51,998 [INFO] Preparing regions
25-
2025-02-02 06:13:52,006 [INFO] Extracting haplotypes
26-
2025-02-02 06:13:53,095 [WARNING] /usr/local/lib/python3.10/dist-packages/coverage/control.py:888: CoverageWarning:No data was collected. (no-data-collected)
27-
2025-02-02 06:13:53,099 [WARNING] /usr/local/lib/python3.10/dist-packages/coverage/control.py:888: CoverageWarning:No data was collected. (no-data-collected)
28-
2025-02-02 06:13:53,110 [INFO] Harmonizing variants
29-
2025-02-02 06:13:55,109 [INFO] Running bench
30-
2025-02-02 06:13:55,134 [INFO] Including 75 bed regions
31-
2025-02-02 06:13:57,321 [INFO] Zipped 4134 variants Counter({'base': 2067, 'comp': 2067})
32-
2025-02-02 06:13:57,322 [INFO] 76 chunks of 4134 variants Counter({'__filtered': 3405, 'comp': 366, 'base': 363})
33-
2025-02-02 06:13:58,488 [INFO] Finished refine
18+
2025-02-16 01:48:25,956 [INFO] Setting up regions
19+
2025-02-16 01:48:25,970 [INFO] Evaluating 225 regions
20+
2025-02-16 01:48:25,971 [INFO] Extending the regions by 100 bases
21+
2025-02-16 01:48:26,011 [INFO] Found 1 chromosomes with overlapping regions
22+
2025-02-16 01:48:27,023 [INFO] 75 regions to be refined
23+
2025-02-16 01:48:27,034 [WARNING] Harmonizing variants ≥50kbp is not recommended
24+
2025-02-16 01:48:27,035 [INFO] Preparing regions
25+
2025-02-16 01:48:27,043 [INFO] Extracting haplotypes
26+
2025-02-16 01:48:28,113 [WARNING] /usr/local/lib/python3.10/dist-packages/coverage/control.py:892: CoverageWarning:No data was collected. (no-data-collected)
27+
2025-02-16 01:48:28,116 [WARNING] /usr/local/lib/python3.10/dist-packages/coverage/control.py:892: CoverageWarning:No data was collected. (no-data-collected)
28+
2025-02-16 01:48:28,130 [INFO] Harmonizing variants
29+
2025-02-16 01:48:30,259 [INFO] Running bench
30+
2025-02-16 01:48:30,288 [INFO] Including 75 bed regions
31+
2025-02-16 01:48:32,225 [INFO] Zipped 4226 variants Counter({'base': 2113, 'comp': 2113})
32+
2025-02-16 01:48:32,226 [INFO] 77 chunks of 4226 variants Counter({'__filtered': 3575, 'comp': 329, 'base': 322})
33+
2025-02-16 01:48:33,331 [INFO] Stats: {
34+
"TP-base": 555,
35+
"TP-comp": 555,
36+
"FP": 20,
37+
"FN": 2,
38+
"precision": 0.9652173913043478,
39+
"recall": 0.9964093357271095,
40+
"f1": 0.980565371024735,
41+
"base cnt": 557,
42+
"comp cnt": 575
43+
}
44+
2025-02-16 01:48:33,332 [INFO] Finished refine

0 commit comments

Comments
 (0)