Skip to content

Commit e910feb

Browse files
committed
Update README for ShoRAH2
1 parent 368dd38 commit e910feb

File tree

1 file changed

+50
-32
lines changed

1 file changed

+50
-32
lines changed

README.md

+50-32
Original file line numberDiff line numberDiff line change
@@ -20,36 +20,42 @@ several programs, the most imporant of which are:
2020

2121
| Tool | What it does |
2222
| -------------- | ------------------------------------------------------------------- |
23-
| `amplian.py` | amplicon based analysis |
24-
| `dec.py` | local error correction based on diri_sampler |
23+
| `shorah` | wrapper for everything |
24+
|`shorah amplicon`| amplicon based analysis |
25+
|`shorah shotgun`| shotgun sequencing analysis |
26+
| `shorah snv` | detects single nucleotide variants, taking strand bias into account |
27+
| `b2w` | splitting shotgun sequencing .BAM into multiple overlapping windows |
2528
| `diri_sampler` | Gibbs sampling for error correction via Dirichlet process mixture |
26-
| `contain` | removal of redundant reads |
27-
| `mm.py` | maximum matching haplotype construction |
28-
| `freqEst` | EM algorithm for haplotype frequency |
29-
| `snv.py` | detects single nucleotide variants, taking strand bias into account |
30-
| `shorah.py` | wrapper for everything |
29+
| `fil` | strand bias test |
3130

3231
## Citation
3332
If you use shorah, please cite the application note paper _Zagordi et al._ on
3433
[BMC Bioinformatics](http://www.biomedcentral.com/1471-2105/12/119).
3534

3635
## General usage
36+
### Pre-built packages
37+
ShoRAH and its dependencies are all
38+
[available in bioconda](https://bioconda.github.io/recipes/shorah/README.html).
39+
We strongly advise you to install this package for a hassle-free experience.
40+
41+
conda install shorah
42+
3743
### Dependencies
3844
shorah requires the following pieces of software:
3945

40-
1. **Python 2 or Python 3**, backward compatibility is provided as some current Linux distributions and OS X systems are still using 2.x as default. The required dependencies are:
46+
1. **Python 3** The required dependencies are:
4147

4248
a) **Biopython**, and
4349
b) **NumPy**.
44-
These packages can be downloaded using pip or anaconda
50+
These packages can be downloaded using pip or conda
4551

46-
2. **Perl**, for some scripts
52+
2. **HTSlib** which is used to access bam/cram/sam and fasta files.
4753

48-
3. **zlib**, which is used by the bundled samtools for compressing bam files
54+
3. **zlib**, which is used by HTSlib for compressing bam files
4955

50-
4. **pkg-config**, for discovering dependencies, which most Unix-like systems include
56+
3. **pkg-config**, for discovering dependencies, which most Unix-like systems include
5157

52-
5. **GNU scientific library**, for random number generation
58+
4. **Boost C++ library**, for random number generation
5359

5460
In addition, if you want to bootstrap the git version of shorah instead of using the provided tarballs,
5561
you will need the GNU Autotools:
@@ -95,44 +101,49 @@ where `4` specifies the number of compilation threads to use. Finally, after com
95101

96102
All the programs should now be located in `/usr/local/shorah/bin`.
97103

98-
### Boostrapping from git
104+
#### Boostrapping from git
99105
If you opted to clone the git repository instead of downloading a prepared tarball, you will need to bootstrap
100106
the configure script:
101107

102-
autoreconf -vif
108+
autoreconf -vif -I m4
103109

104110
After this, you can run the `configure` script as described previously.
105111

106112

107113
#### Windows users
108-
You can install and run `shorah` with [Cygwin](http://www.cygwin.com).
114+
Since Windows 10, Microsoft provides the
115+
[Windows Subsystem for Linux](https://docs.microsoft.com/en-us/windows/wsl/about)
116+
which enables a linux environment under Windows.
117+
118+
Users of older versions of Windows can can install and run `shorah` with [Cygwin](http://www.cygwin.com).
109119
Please see the relevant paragraph on the
110120
[documentation page](http://cbg-ethz.github.io/shorah/).
111121

112122
### Run
113-
The input is a sorted bam file. Analysis can be performed in local or global
114-
mode.
123+
The input is a sorted bam file. Analysis can be performed in local mode.
124+
125+
**Note:** Currently in ShoRAH2 global haplotype reconstruction is disabled.
126+
The last version of ShoRAH1 with global mode is
127+
[v1.1.3](https://github.com/cbg-ethz/shorah/tree/v1.1.3)
128+
and can easily be obtained from
129+
[bioconda](https://bioconda.github.io/recipes/shorah/README.html).
130+
If you wish to perform global reconstruction, we suggest that you consider the
131+
[global haplotype options](https://github.com/cbg-ethz/V-pipe/wiki/options#haplotype_reconstruction)
132+
available in [V-pipe](https://cbg-ethz.github.io/V-pipe/)
115133

116134
#### Local analysis
117-
The local analysis alone can be run invoking `dec.py` or `amplian.py` (program
135+
The local analysis alone can be run invoking `shorah shotgun` or `shorah amplicon` (program
118136
for the amplicon mode). They work by cutting window from the multiple sequence
119-
alignment, invoking `diri_sampler` on the windows and calling `snv.py` for the
137+
alignment, invoking `diri_sampler` on the windows and calling `shorah snv` for the
120138
SNV calling. See the
139+
[`README`](https://github.com/cbg-ethz/shorah/blob/master/examples/shotgun_test/README.md)
140+
file in directory
141+
[`shotgun_test`](https://github.com/cbg-ethz/shorah/blob/master/examples/shotgun_test/)
142+
and the
121143
[`README`](https://github.com/cbg-ethz/shorah/blob/master/examples/amplicon_test/README.md)
122144
file in directory
123145
[`amplicon_test`](https://github.com/cbg-ethz/shorah/blob/master/examples/amplicon_test/).
124146

125-
#### Global analysis
126-
The whole global reconstruction consists of the following steps:
127-
128-
1. error correction (*i.e.* local haplotype reconstruction);
129-
2. SNV calling;
130-
3. removal of redundant reads;
131-
4. global haplotype reconstruction;
132-
5. frequency estimation.
133-
134-
These can be run one after the other, or one can invoke `shorah.py`, that runs
135-
the whole process from bam file to frequency estimation and SNV calling.
136147

137148
## Coding style
138149
All changes to the C++ code in `src/cpp` should always be formatted according to the included `.clang-format` style by doing
@@ -141,4 +152,11 @@ All changes to the C++ code in `src/cpp` should always be formatted according to
141152

142153
in the root of the repository.
143154

144-
All changes to the python code in `src/shorah` should always be formatted conforming to the [PEP 8](https://www.python.org/dev/peps/pep-0008/) style guide. To this end, we advise to use [autopep8](https://pypi.python.org/pypi/autopep8).
155+
All changes to the python code in `src/shorah` should always be formatted conforming to the [PEP 8](https://www.python.org/dev/peps/pep-0008/) style guide. To this end, we advise to use [autopep8](https://pypi.python.org/pypi/autopep8).
156+
157+
## Contact
158+
159+
ShoRAH is maintained as part of the [V-pipe virus NGS pipeline](https://cbg-ethz.github.io/V-pipe/)
160+
and you can [easily reach out the developers on its website](https://cbg-ethz.github.io/V-pipe/contact).
161+
162+
You can also report your problems in the [issue tracker on GitHub](https://github.com/cbg-ethz/shorah/issues).

0 commit comments

Comments
 (0)