Encounter `error: "Fastq record is too long"` when parsing Nanopore sequence data #13

sagrudd · 2023-01-17T09:29:21Z

What are the limits of the maximum sequence length within a record? I am imagining a workflow that should be regularly accommodating of reads over 100kb in length (and with recent ultra-long updates should occasionally expect multi Mb sequence reads.

What would be the most sustainable approach to working through this hurdle? Updating the buffer usize (and forking the project), reverting to bio::io::fastq? As a new to rust developer I'd welcome any comments as to e.g. how performance is going to suffer.

Would welcome some thoughts here - thanks!

iskandr · 2023-06-28T19:51:01Z

I have the same problem with PacBio reads.

Here's the culprit line that sets the buffer size to 68k: https://docs.rs/fastq/latest/src/fastq/lib.rs.html#133

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encounter `error: "Fastq record is too long"` when parsing Nanopore sequence data #13

Encounter `error: "Fastq record is too long"` when parsing Nanopore sequence data #13

sagrudd commented Jan 17, 2023

iskandr commented Jun 28, 2023

Encounter error: "Fastq record is too long" when parsing Nanopore sequence data #13

Encounter error: "Fastq record is too long" when parsing Nanopore sequence data #13

Comments

sagrudd commented Jan 17, 2023

iskandr commented Jun 28, 2023

Encounter `error: "Fastq record is too long"` when parsing Nanopore sequence data #13

Encounter `error: "Fastq record is too long"` when parsing Nanopore sequence data #13