-
Notifications
You must be signed in to change notification settings - Fork 94
/
Copy pathfastq_to_fasta.xml
83 lines (63 loc) · 2.36 KB
/
fastq_to_fasta.xml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
<tool id="cshl_fastq_to_fasta" name="FASTQ to FASTA" version="@VERSION@">
<description>converter from FASTX-toolkit</description>
<expand macro="requirements" />
<macros>
<import>fastx_macros.xml</import>
</macros>
<command>gunzip -cf $input | fastq_to_fasta $SKIPN $RENAMESEQ -o $output -v
#if $input.ext == "fastqsanger":
-Q 33
#end if
</command>
<inputs>
<param format="fastqsanger,fastqsolexa,fastqillumina" name="input" type="data" label="FASTQ Library to convert" />
<param name="SKIPN" type="select" label="Discard sequences with unknown (N) bases ">
<option value="">yes</option>
<option value="-n">no</option>
</param>
<param name="RENAMESEQ" type="select" label="Rename sequence names in output file (reduces file size)">
<option value="-r">yes</option>
<option value="">no</option>
</param>
</inputs>
<tests>
<test>
<!-- FASTQ-To-FASTA, keep N, don't rename -->
<param name="input" value="fastq_to_fasta1.fastq" ftype="fastqsolexa" />
<param name="SKIPN" value=""/>
<param name="RENAMESEQ" value=""/>
<output name="output" file="fastq_to_fasta1a.out" />
</test>
<test>
<!-- FASTQ-To-FASTA, discard N, rename -->
<param name="input" value="fastq_to_fasta1.fastq" ftype="fastqsolexa" />
<param name="SKIPN" value="no"/>
<param name="RENAMESEQ" value="yes"/>
<output name="output" file="fastq_to_fasta1b.out" />
</test>
</tests>
<outputs>
<data format="fasta" name="output" metadata_source="input" />
</outputs>
<help>
**What it does**
This tool converts data from Solexa format to FASTA format (scroll down for format description).
--------
**Example**
The following data in Solexa-FASTQ format::
@CSHL_4_FC042GAMMII_2_1_517_596
GGTCAATGATGAGTTGGCACTGTAGGCACCATCAAT
+CSHL_4_FC042GAMMII_2_1_517_596
40 40 40 40 40 40 40 40 40 40 38 40 40 40 40 40 14 40 40 40 40 40 36 40 13 14 24 24 9 24 9 40 10 10 15 40
Will be converted to FASTA (with 'rename sequence names' = NO)::
>CSHL_4_FC042GAMMII_2_1_517_596
GGTCAATGATGAGTTGGCACTGTAGGCACCATCAAT
Will be converted to FASTA (with 'rename sequence names' = YES)::
>1
GGTCAATGATGAGTTGGCACTGTAGGCACCATCAAT
------
This tool is based on `FASTX-toolkit`__ by Assaf Gordon.
.. __: http://hannonlab.cshl.edu/fastx_toolkit/
</help>
<!-- FASTQ-to-FASTA is part of the FASTX-toolkit, by A.Gordon ([email protected]) -->
</tool>