Adding MED format, second submission #1241

dancrepeau · 2023-03-26T18:37:04Z

Hello,

A month or so ago, my colleague Matt Stead and I pushed a fork with changes to support the MED format (medformat.org). This new pull request is for a version that's a bit more detailed: it finds discontinuities and creates separate Neo segments, and it groups channels together that have the same sampling frequencies. It also reads Note and Neuralynx type annotations, and has two time basis options, zero-based and original time.

Please let me know if there are questions/comments/criticisms! Thank you,
Dan

JuliaSprenger

Hi @dancrepeau!
Thanks for rewriting the IO! I think the segment & stream concept is not implemented consistently yet. I left some comments in the code. Tell me if you need more details on this. I think a test file with more than one segment (by design or discontinuous data) would be beneficial here to implement this. Based on the tests you wrote I guess the current test file is a single segment only, right?

JuliaSprenger · 2023-03-29T10:09:26Z

neo/io/__init__.py

 from neo.io.klustakwikio import KlustaKwikIO
 from neo.io.kwikio import KwikIO
 from neo.io.mearecio import MEArecIO
+from neo.io.medio import MedIO


This is nitpicking, but this change should be one line further down to be in alphabetical order....

neo/rawio/medrawio.py

JuliaSprenger · 2023-03-29T10:42:17Z

neo/rawio/medrawio.py

+        for seg_idx in range(self._nb_segment):
+            for template in self._stream_templates:


You are creating one stream per segment and stream template. However, streams are shared between all segments. I think you are generating more streams than required and could just use the stream_templates as streams. Or is this a feature of MED that in each segment you could have different streams and channels?

JuliaSprenger · 2023-03-29T10:52:16Z

neo/rawio/medrawio.py

+        spike_channels = np.array(spike_channels, dtype=_spike_channel_dtype)
+
+        # Events
+        # events are not being read by this Neo wrapper


You have implemented methods below to read event data, so you should also create the corresponding event header for those methods to be used.

JuliaSprenger · 2023-03-29T10:54:06Z

neo/rawio/medrawio.py

+        seg_ann = bl_ann['segments'][0]
+        seg_ann['name'] = 'Seg #0 Block #0'


You are implementing a multi-segment IO, so it would make sense provide a (less generic) name for all segments.

neo/rawio/medrawio.py

JuliaSprenger · 2023-03-29T13:01:09Z

neo/test/iotest/test_medio.py

+        for ana in seg.analogsignals:
+            assert isinstance(ana, AnalogSignalProxy)
+            ana = ana.load()
+            assert isinstance(ana, AnalogSignal)
+        for st in seg.spiketrains:
+            assert isinstance(st, SpikeTrainProxy)
+            st = st.load()
+            assert isinstance(st, SpikeTrain)


you are testing basic RawIO functionality here. This should be already covered in other tests in Neo, so I would not duplicate it here.

JuliaSprenger · 2023-03-29T13:01:29Z

neo/test/iotest/test_medio.py

+
+        seg = r.read_segment(lazy=False)
+
+        # There will only be one analogsignals in a MED reading


Suggested change

# There will only be one analogsignals in a MED reading

# There will only be one analogsignal in a MED reading

JuliaSprenger · 2023-03-29T13:03:27Z

neo/test/iotest/test_medio.py

+        bl = r.read_block(lazy=True)
+        self.assertTrue(bl.annotations)
+
+        seg = bl.segments[0]


Depending on your test file the IO should generate more than 1 segment. If there's more segments, could you also check those?

JuliaSprenger · 2023-03-29T13:04:35Z

neo/test/iotest/test_medio.py

+            self.assertNotEqual(st.size, 0)
+
+        # annotations
+        #assert 'seg_extra_info' in seg.annotations


You added MED specific annotations to the block. You could confirm that these are correctly annotated on the block level. (in test_read_block)

dancrepeau · 2023-03-29T23:54:50Z

Hi @JuliaSprenger!

Thank you for reviewing the code. I will go through your comments thoroughly in the coming days. I will comment right away that we are indeed testing against a recording that has discontinuous ranges, specifically "test.medd". Starting on line 76 of test_medrawio.py, we are verifying that it found 3 segments, and there are two unique streams in each segment (6 streams total).

If streams are supposed to be shared across all segments, then I will have to re-work the code. I'm using the convention that a continuous stretch of data is a segment, and within that, a stream is a grouping of channels that have the same sampling frequency. Please correct me if I have those ideas wrong.

My notion of a "template" (which admittedly isn't clear) is just a grouping of channels that have the same sampling frequency. However, there can be lots of discontinuitites within a recording, so I am multiplying the templates by continuous ranges, and assuming those are streams. I am assuming a stream can't have gaps in it.

Thanks again for all the help!
-Dan

JuliaSprenger · 2023-03-30T09:35:06Z

Hi @dancrepeau,
yes, I think the segment & stream design might need to be adjusted to fit the Neo & RawIO structures.
In general a Neo segment is intended to contain data that shares a common clock. So for many IOs multiple segments are generated when either the recording was paused and restarted or the recording system didn't manage to write all of the continuous data, but lost some samples at a certain time point.

In the RawIO design a stream is independently of segments describing different input channels, characterised e.g. for MED by their different sampling rates. When loading a dataset Neo creates one AnalogSignal per Segment per Stream. This also implies that all channels (recording traces) in a stream have to contain the same number of samples within a segment, to be able to represent the data of that AnalogSignal as a numpy array.

I hope that helps restructuring, let me know if you are still missing some information.

dancrepeau · 2023-03-30T19:31:21Z

Hi @JuliaSprenger,

Thank you for the clarifications. The terminology here can become very confusing. In MED, segments and continuous sections are independent concepts, as segments are primarily used to break recordings up into smaller files, but can also be used to indicate a new experiment within a recording.

I might suggest some changes to documentation to make these Neo concepts easier for newcomers to digest (and feel free to disregard me here, as I am jus trying to be helpful). The Neo core documentation (https://neo.readthedocs.io/en/stable/core.html) only shows a higher level view; indeed the word "stream" doesn't appear anywhere on this page. I found the file "baserawio.py" to be helpful in understanding the raw io concepts, however I came to some wrong conclusions. For example, the function:

def get_signal_t_start(self, block_index, seg_index, stream_index=None):
"""
Retrieve the t_start of a single section of the channels in a stream.
:param block_index:
:param seg_index:
:param stream_index:
:return: start time of section
"""
refers to a 'section' of a stream, yet 'section' is not a parameter to the function. I believe the correct interpretation is that 'segment' and 'section' are interchangeable (in this context). I came to the conclusion that 'section' is implied by the stream itself, and therefore there was only one start time to a stream.

Again, I appreciate the clarifications!
-Dan

JuliaSprenger reviewed Mar 29, 2023

View reviewed changes

JuliaSprenger added this to the 0.13.0 milestone Apr 2, 2023

This was referenced Apr 3, 2023

First draft of proposal for documentation rewrite #1178

Merged

Would like permission to push MED format branch #1218

Closed

JuliaSprenger added the New IO Class label Apr 4, 2023

dancrepeau closed this Apr 22, 2023

dancrepeau force-pushed the master branch from 981feff to 2d63e18 Compare April 22, 2023 15:39

PeterNSteinmetz mentioned this pull request Feb 27, 2024

Fix Neuralynx gaps detection #1418

Closed

		for seg_idx in range(self._nb_segment):
		for template in self._stream_templates:

		seg_ann = bl_ann['segments'][0]
		seg_ann['name'] = 'Seg #0 Block #0'


		seg = r.read_segment(lazy=False)

		# There will only be one analogsignals in a MED reading

	# There will only be one analogsignals in a MED reading
	# There will only be one analogsignal in a MED reading

Adding MED format, second submission #1241

Adding MED format, second submission #1241

Uh oh!

Conversation

dancrepeau commented Mar 26, 2023

Uh oh!

JuliaSprenger left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dancrepeau commented Mar 29, 2023

Uh oh!

JuliaSprenger commented Mar 30, 2023

Uh oh!

dancrepeau commented Mar 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants