Skip to content

loading signal_pa from pod5 is very slow for some files #171

@lpryszcz

Description

@lpryszcz

Hi, with some files (typically originating from last few hours of sequencing) I'm experiencing very slow access to signal_pa.

For example, it takes over 1 minute to load signal from a file of 2.3GB with 1.7k reads. I tried this both, with HDD and SSD, the loading time is similar.
Note, for other pod5 files it takes 10-20 seconds to load signal from 100k reads (file size 1.6G)!

reader = pod5.DatasetReader(fn)
for ri, read in enumerate(reader, 1):
    sig = read.signal_pa

I'm using the latest version of pod5 v0.3.15.

I'm interested in the pA signal from the beggining of the read. Is there a way to access it faster?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions