Fix over-segmentation in BlackRock #1789

h-mayorquin · 2025-10-08T03:24:54Z

This will fix #1770

Testing data is merged here https://gin.g-node.org/NeuralEnsemble/ephy_testing_data/pulls/167 and is used on testing here.

This PR fixes #1770 using the methodology discussed on #1773. To do this, we do three things:

We separate the parsing of the data blocks from the segmentation of the data
I also continue the refactor started in Blackrock: improve nev data reading #1772 and Blackrock: improve nev header reading #1771 to remove dynamically loaded functions per spec. Now, the data types of the headers and the data blocks are global variables and are separated from the code.
The gaps report (see Blackrock add summary of automatic data segmentation #1769) was only working for the PTP format. All the formats with timestamps report gaps now.
Following the discussion on General API for handling sample gaps on rawio #1773 this PR introduces the new gap_tolerance_ms to give user control over the size of the gaps that should create segments. By default, it is None and an error is raised if gaps are found. User can then opt-in to load the data by specifying a gap size.
I used a buffered version of the memmaps so we don't need to create a memamp per block. This reduces the number of memmaps created by the reader which is an OS limit.

This is a large PR but I think that @samuelgarcia prefers them that way rather than sausage sliced PRs and he is the person that might end up reviewing it. I am happy to break it apart if whoever is gona review this prefers it another way.

Tagging @cboulay here as requested in case if he has time to check it.

h-mayorquin · 2025-10-08T13:34:29Z

neo/rawio/blackrockrawio.py

+            data_offset = current_offset + header.dtype.itemsize
+            timestamp = header["timestamp"]
+
+            # Create data view into memmap for this block


Here, we use views on a linear memmaped buffer to avoid creating a memmap per data block. This is good because OSes usually limit the ammount of memamps you can create.

h-mayorquin · 2025-10-08T13:55:24Z

neo/rawio/blackrockrawio.py

            # Remove if raw loading becomes possible
            # raise IOError("For loading Blackrock file version 2.1 .nev files are required!")

-        # This requires nsX to be parsed already


Changed here to do this after segmenting.

h-mayorquin · 2025-10-08T13:56:43Z

neo/rawio/blackrockrawio.py

-                self.nsx_datas[nsx_nb] = _data_reader_fun(nsx_nb)
+                    data_spec = spec_version
+
+                # Parse data blocks (creates memmap, extracts data+timestamps)


This is the core of the PR:

We parse the data blocks (I improved the docstrings there)

We segment the file and report if necessary

We transform back to the previous data structures to keep the diff small.

h-mayorquin · 2025-10-08T13:57:24Z

neo/rawio/blackrockrawio.py

+        filesize = self._get_file_size(filename)
+        num_samples = int((filesize - bytes_in_headers) / (2 * channels) - 1)
+        offset = bytes_in_headers
+        # Create data view into memmap


Here is the same technique to avoid many memmaps: create an array with the buffer (one memmap) and then create views into it.

h-mayorquin added 6 commits September 8, 2025 21:13

standarize reading nsx

72907f8

fix

613e151

f-string use

d9d6251

checkpoint

780d0e3

antoher dynamic determined function removed

205744b

strings

ace8c0b

h-mayorquin self-assigned this Oct 8, 2025

add tests

a062917

h-mayorquin marked this pull request as ready for review October 8, 2025 13:30

h-mayorquin commented Oct 8, 2025

View reviewed changes

h-mayorquin added 3 commits October 8, 2025 07:40

naming and keep full git log for future reference

fd0a729

black to avoid future non-useful diffs

f1b5215

make test more specific and shorter to get some good old Sam love

6aff4ae

h-mayorquin commented Oct 8, 2025

View reviewed changes

zm711 added this to the 0.15.0 milestone Oct 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix over-segmentation in BlackRock #1789

Fix over-segmentation in BlackRock #1789

h-mayorquin commented Oct 8, 2025 •

edited

Loading

Uh oh!

h-mayorquin Oct 8, 2025

Uh oh!

h-mayorquin Oct 8, 2025 •

edited

Loading

Uh oh!

h-mayorquin Oct 8, 2025

Uh oh!

h-mayorquin Oct 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix over-segmentation in BlackRock #1789

Are you sure you want to change the base?

Fix over-segmentation in BlackRock #1789

Conversation

h-mayorquin commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

h-mayorquin Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

h-mayorquin Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

h-mayorquin Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

h-mayorquin Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

h-mayorquin commented Oct 8, 2025 •

edited

Loading

h-mayorquin Oct 8, 2025 •

edited

Loading