Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Encounter error: "Fastq record is too long" when parsing Nanopore sequence data #13

Open
sagrudd opened this issue Jan 17, 2023 · 1 comment

Comments

@sagrudd
Copy link

sagrudd commented Jan 17, 2023

What are the limits of the maximum sequence length within a record? I am imagining a workflow that should be regularly accommodating of reads over 100kb in length (and with recent ultra-long updates should occasionally expect multi Mb sequence reads.

What would be the most sustainable approach to working through this hurdle? Updating the buffer usize (and forking the project), reverting to bio::io::fastq? As a new to rust developer I'd welcome any comments as to e.g. how performance is going to suffer.

Would welcome some thoughts here - thanks!

@iskandr
Copy link

iskandr commented Jun 28, 2023

I have the same problem with PacBio reads.

Here's the culprit line that sets the buffer size to 68k: https://docs.rs/fastq/latest/src/fastq/lib.rs.html#133

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants