Raymondwchang/streaming init cnmfe attrs #52

raymondwjang · 2025-03-01T00:12:31Z

📝 Description

populate transformers that initialize non-fundamental (i.e. anything besides footprints and fluorescence) matrices - sufficient statistics, residual buffers, etc.

as the way these transformers interact with components are varied - some take in footprints and output traces, some take in footprint AND traces and output pixel correlation, etc., I am adding a decorator class that works as an interface between the transformers and ComponentManager.

a little worried about how to ensure a proper plug-in with ComponentManager, as we keep on adding component measurement traits and different types of transformers.

📌 Related Issue

🔍 Type of Change

✨ New Feature: Introducing new functionality
🧹 Refactor: Code changes that neither fix a bug nor add a feature

🚀 Implementation Details

ManagerInterface works as a decorator interface. it takes care of all interactions with ComponentManager, so that ComponentManager can focus on creating/updating/removing components, while the transformers can be decoupled from how components are structured and solely deal with getting what it needs, transforming it, and returning the results.

The current data structure is:

ComponentManager manages all stored data, and works as the interface for all data manipulations
Actual measurement arrays and their attributes are stored in FootprintsManager and TracesManager. (More to be added as we add noise arrays, residual arrays, etc.)
The actual source (the fluorescing thing) of the measurements (footprint and trace) is either a Neuron or a Background object
The source and measurement array are linked by SourceID-ArrayIndex, where IDs are generated by Python's id function.
These links are dynamically managed by the ComponentManager.

The proposed workflow is:

new_frame comes in
it wants to reach a relevant River Transformer that will turn it into a usable data
before it gets to a Transformer, it hits ManagerInterface, which wraps around the Transformer.
ManagerInterface grabs bothComponentManager and new_frame
ManagerInterface looks at the Transformer and determines what already-collected data it needs from ComponentManager
ManagerInterface relays only the relevant data to the Transformer (i.e. FootprintInitializer)
This allows the River Transformer to only do the standard learn and transform
the transformed result is returned to ManagerInterface
the ManagerInterface knows what kind of data the Transformer returned, and asks ComponentManager to update its components.

🧪 Testing

Ran unit tests
Ran integration tests
Performed manual testing
Updated existing tests

🛠️ Dependencies

✅ Checklist

My code follows the project's style guidelines
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

🔗 Additional Resources

feat: manager_interface.py now takes care of interfacing with componentmanager.

codecov · 2025-03-01T00:17:23Z

Codecov Report

Attention: Patch coverage is 69.16667% with 37 lines in your changes missing coverage. Please review.

Project coverage is 89.93%. Comparing base (486c0dc) to head (95ee5aa).

Files with missing lines	Patch %	Lines
src/cala/streaming/initialization/pixel_stats.py	0.00%	36 Missing ⚠️
...cala/streaming/initialization/manager_interface.py	98.03%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #52      +/-   ##
==========================================
- Coverage   91.90%   89.93%   -1.97%     
==========================================
  Files          55       57       +2     
  Lines        1420     1510      +90     
==========================================
+ Hits         1305     1358      +53     
- Misses        115      152      +37

Flag	Coverage Δ
unittests	`89.93% <69.16%> (-1.97%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

sneakers-the-rat · 2025-03-03T21:54:20Z

just read the PR description, haven't read the code yet, but one initial reaction:

The source and measurement array are linked by SourceID-ArrayIndex, where IDs are generated by Python's id function.

The id function just produces the memory location that identifies that particular python object instance. this would not be durable e.g. between processes or runs. Content addressing is good, and is what you should use here - you want to make a unique identifier that can be used to reference data in a portable way, a hash is the best answer for this. whether you want to hash the array contents or just the metadata is a design question that depends on 'which are the parts that should be used to identify the data chunk'. let me read

raymondwjang · 2025-03-03T21:56:09Z

ohh that makes sense. i was wondering how it was coming up with the integer numbers

sneakers-the-rat

I am not exactly sure what i am reviewing for here, this seems fine?

sneakers-the-rat · 2025-03-03T21:57:43Z

src/cala/streaming/initialization/manager_interface.py

+
+
+@dataclass
+class SpatialInitializationResult:


give these a parent class - the main thing you want from Results classes is a consistent interface. so it seems like atm these are mutually exclusive complements: spatial initialization is x/y coords (right?) and temporal is timeseries. what about the other kinds of initializer types? how will you know how to combine them? will each produce a different kind of output, or will some produce overlapping types with the current result types? having a generic result class that has empty values for each of the possible kinds of things would be better than each initializer type having its own result class

sneakers-the-rat · 2025-03-03T21:59:33Z

src/cala/streaming/initialization/manager_interface.py

+            def learn_one(self, components: ComponentManager, X: xr.DataArray) -> T:
+                """Learn step extracts needed data from manager and passes to transformer."""
+                match initializer_type:
+                    case InitializerType.SPATIAL:


yeah immediately see the problems from above^. if each results class is unique, you need wrappers like this. presumably different kinds of transformers take different types of data, so having one results class and then always passing that one class to the transformers seems less fragile than this

i think you might be suggesting might have been what i had originally. at first there was no decorator and i was just adding/subtracting components directly inside the transformers, but i didn't like how tranformers were not only doing the raw transformations, but also taking on interfacing with the data structure.

sneakers-the-rat · 2025-03-03T22:02:01Z

src/cala/streaming/initialization/manager_interface.py

+    """
+
+    def decorator(transformer_class: Type[T]) -> Type[T]:
+        class ManagerWrappedTransformer(cast(type, transformer_class)):


I think that without generic __getattr__s that forward attribute access on to the wrapped class, you're going to get into a pretty tricky spot pretty quick

sneakers-the-rat · 2025-03-03T22:03:38Z

src/cala/streaming/initialization/pixel_stats.py

+    spatial_axes: tuple = ("height", "width")
+    """Spatial axes for pixel statistics"""
+
+    def validate(self):


do this as a __post_init__ or you'll inevitably forget to call it (and needing to call it is a pain in the ass)

… into a single file

…tests need to be updated.

…es or removals)

Signed-off-by: Raymond W. Chang <[email protected]>

raymondwjang added 2 commits February 28, 2025 15:29

refactor: single responsibility for initializers

6c109f6

feat: manager_interface.py now takes care of interfacing with componentmanager.

feat: init real cnmf-e related objects

95ee5aa

raymondwjang requested review from sneakers-the-rat and daharoni March 1, 2025 00:13

raymondwjang mentioned this pull request Mar 1, 2025

Interface Between Data Structure and Transformers #53

Closed

raymondwjang marked this pull request as ready for review March 1, 2025 00:58

raymondwjang marked this pull request as draft March 1, 2025 00:59

doc: update package docs

5e97cbb

sneakers-the-rat reviewed Mar 3, 2025

View reviewed changes

raymondwjang added 18 commits March 3, 2025 14:08

docs: update blueprint

a3d45cb

refactor: rename spatial -> footprint, temporal -> traces

a3841d8

debug: axis name handling

ddbce0f

debug: buffered initialization for plugging into a streaming format

0bac7ad

feat: use a runner instead of decorator

7fed9f1

feat: consolidate type decl/def for runners, transformers, components…

cf37214

… into a single file

chore: remove currently unused files

e3eee80

refactor: rename folders

52ea280

feat: centralize and define footprint/s, trace/s types

ee741aa

tests: path changes

88fa059

refactor: rename to storemanager

0160f42

💀: refactoring design complete. missed the commit moment and now all …

0f63378

…tests need to be updated.

feat: make create_many method for batch component generation

f5be35c

refactor: align component type keys with the actual class names

7f13cce

refactor: build collect method for the initialization steps (no updat…

e387848

…es or removals)

debug: remove deprecated type usage

4cc60e3

debug: remove deprecated type usage. fix typos

76af81a

chore: restructure test folder

0333457

raymondwjang added 15 commits March 6, 2025 03:58

test: test_runner_initialization passing

a52a920

tests: test_runner_dependency_resolution passing

478afbb

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_cyclic_dependency_detection passing

1d90fa3

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_runner.py passes

2d66661

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_pipe_config.py passes

639965f

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_types.py passing

4314952

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_footprints.py passing

8543ffa

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_traces.py passes

713a195

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_meta.py passes

59e524b

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_registry.py passing

11699e1

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_footprints.py enhanced

1f0ef17

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_meta.py changed to class based

6e0f3cf

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_types.py changed to class based

278004d

Signed-off-by: Raymond W. Chang <[email protected]>

tests: test_traces.py changed to class based

dd77dca

Signed-off-by: Raymond W. Chang <[email protected]>

tests: prepare to overhaul 4 test modules

e34d9a1

Signed-off-by: Raymond W. Chang <[email protected]>

raymondwjang marked this pull request as ready for review March 7, 2025 01:25

raymondwjang merged commit 8695b60 into main Mar 7, 2025
0 of 2 checks passed

raymondwjang linked an issue Mar 7, 2025 that may be closed by this pull request

Interface Between Data Structure and Transformers #53

Closed

raymondwjang deleted the raymondwchang/streaming-init-cnmfe-attrs branch March 21, 2025 19:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Raymondwchang/streaming init cnmfe attrs #52

Raymondwchang/streaming init cnmfe attrs #52

Uh oh!

raymondwjang commented Mar 1, 2025 •

edited

Loading

Uh oh!

codecov bot commented Mar 1, 2025

Uh oh!

sneakers-the-rat commented Mar 3, 2025

Uh oh!

raymondwjang commented Mar 3, 2025

Uh oh!

sneakers-the-rat left a comment

Uh oh!

sneakers-the-rat Mar 3, 2025

Uh oh!

sneakers-the-rat Mar 3, 2025

Uh oh!

raymondwjang Mar 4, 2025 •

edited

Loading

Uh oh!

sneakers-the-rat Mar 3, 2025

Uh oh!

sneakers-the-rat Mar 3, 2025

Uh oh!

Uh oh!

Uh oh!

Raymondwchang/streaming init cnmfe attrs #52

Raymondwchang/streaming init cnmfe attrs #52

Uh oh!

Conversation

raymondwjang commented Mar 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📝 Description

📌 Related Issue

🔍 Type of Change

🚀 Implementation Details

🧪 Testing

🛠️ Dependencies

✅ Checklist

🔗 Additional Resources

Uh oh!

codecov bot commented Mar 1, 2025

Codecov Report

Uh oh!

sneakers-the-rat commented Mar 3, 2025

Uh oh!

raymondwjang commented Mar 3, 2025

Uh oh!

sneakers-the-rat left a comment

Choose a reason for hiding this comment

Uh oh!

sneakers-the-rat Mar 3, 2025

Choose a reason for hiding this comment

Uh oh!

sneakers-the-rat Mar 3, 2025

Choose a reason for hiding this comment

Uh oh!

raymondwjang Mar 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sneakers-the-rat Mar 3, 2025

Choose a reason for hiding this comment

Uh oh!

sneakers-the-rat Mar 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

raymondwjang commented Mar 1, 2025 •

edited

Loading

raymondwjang Mar 4, 2025 •

edited

Loading