Make StereoCombiner configurable in ApplyModels #2724

Hckjs · 2025-03-24T13:27:41Z

Since the loaded joblib-pickled Reconstructors in ApplyModels

ctapipe/src/ctapipe/tools/apply_models.py

Line 152 in e5999e2

r = Reconstructor.read(path, parent=self, subarray=self.loader.subarray)

are already instantiated, one cannot change their traits via the config system, which leads to a not configurable StereoCombiner.

Instantiating a new Reconstructor.from_name(parent=self) and handing over its StereoCombiner would solve this Problem.

Fixes #2720

- Init new reconstructor with parent=self - Not working yet with DispReconstructor bc of different arguments

ctao-dpps-sonarqube · 2025-03-24T13:49:14Z

Analysis Details

0 Issues

0 Bugs
0 Vulnerabilities
0 Code Smells

Coverage and Duplications

100.00% Coverage (94.20% Estimated after merge)
0.00% Duplicated Code (0.70% Estimated after merge)

Project ID: cta-observatory_ctapipe_AY52EYhuvuGcMFidNyUs

View in SonarQube

maxnoe · 2025-03-24T14:00:08Z

src/ctapipe/tools/apply_models.py

@@ -149,7 +149,25 @@ def setup(self):

        self._reconstructors = []
        for path in self.reconstructor_paths:
-            r = Reconstructor.read(path, parent=self, subarray=self.loader.subarray)
+            r = Reconstructor.read(


This seems very hacky... Isn't there a better solution for this?

maxnoe · 2025-03-24T14:01:47Z

I think we should rather re-evaluate how we store the models on disk / read them back.

Pickling the whole Reconstructor instances is somewhat nasty. We could also think about excluding the StereoCombiner explicitly from the pickle or add an explicit configuration parameter for overriding the StereoCombiner per Reconstructor in apply-models.

Hckjs · 2025-03-25T16:16:58Z

I think we should rather re-evaluate how we store the models on disk / read them back.

Pickling the whole Reconstructor instances is somewhat nasty. We could also think about excluding the StereoCombiner explicitly from the pickle or add an explicit configuration parameter for overriding the StereoCombiner per Reconstructor in apply-models.

One could maybe allow_none for the stereo_combiner_cls traits in the Reconstructor subclasses and add a

    def _init_stereo_combiner(self, parent=None, overwrite=False):
        if self.stereo_combiner is None or overwrite:
            self.stereo_combiner = StereoCombiner.from_name(
                self.stereo_combiner_cls,
                prefix=self.prefix,
                property=self.property,
                parent=parent,
            )

which is called on its __init__, making it possible to not store the StereoCombiner but instead init the Combiner with

r._init_stereo_combiner(parent=self, overwrite=self.overwrite_stereo_combiner)

in ApplyModels. This would also allow to overwrite it.

Can't really think of a better solution yet...

kosack · 2025-03-28T10:43:46Z

I never liked that the whole reconstructor was pickled in the first place - it's easy, but also makes it very hard to customize how we store the models in the future, i.e. if we want to use some standard format. Decoupling how we store the models from the Reconstructor would be the best way, but that also requires a bit of thought and a lot of refactoring. E.g. how to know which reconstructor to construct before loading the model? How to store the parameter list?

One way would be to change what is actually serialized in Reconstructor.write() to be just the model and class name rather than the whole thing, and then in load() constructing a new class from the name and setting the model. Perhaps that could be done by using Reconstructor.__get_state__(), Reconstructr.__set_state__() (which is the protocol for setting what gets pickled in a class), but I'm not sure.

LukasBeiske · 2025-04-03T15:14:37Z

I never liked that the whole reconstructor was pickled in the first place - it's easy, but also makes it very hard to customize how we store the models in the future, i.e. if we want to use some standard format. Decoupling how we store the models from the Reconstructor would be the best way, but that also requires a bit of thought and a lot of refactoring. E.g. how to know which reconstructor to construct before loading the model? How to store the parameter list?

One way would be to change what is actually serialized in Reconstructor.write() to be just the model and class name rather than the whole thing, and then in load() constructing a new class from the name and setting the model. Perhaps that could be done by using Reconstructor.__get_state__(), Reconstructr.__set_state__() (which is the protocol for setting what gets pickled in a class), but I'm not sure.

How about, instead of pickling the whole Reconstructor, we pickle a dictionary containing the class name, the model(s), and all the other necessary configuration options.
Since some of these config options (e.g. log_target for regressors) should not be overwritten when applying the model, we would have to add checks for this when "loading" the model, but otherwise constructing a new class with the models and config options from that dictionary should not be a problem, right?
The other option would be to prevent overwriting any config options by default and add explicit config options to ctapipe-apply-models to change e.g. the StereoCombiner, like Max suggested.

I know, that this is not much different from pickling the whole Reconstructor. This also would not completely decouple the storing of the models from the reconstructor, since the content of this dictionary would differ for different subclasses of Reconstructor, but it would be a start and solve this issue for now.
Maybe it could later be replaced by a class defining the output of all the training tools similar to OptimizationResult for the cut optimization tool.

maxnoe · 2025-04-04T09:47:22Z

The dict is good I think, like that we can also attach a meta filed where we put in the reference metadata.

LukasBeiske · 2025-04-04T15:42:06Z

The dict is good I think, like that we can also attach a meta filed where we put in the reference metadata.

Ok, I'll get on this and open a second PR, since this is a bit wider in scope than this PR.

maxnoe · 2025-04-04T16:25:04Z

Maybe it's also time to give https://onnx.ai/sklearn-onnx/ a shot again and not rely on pickle.

Hckjs added 2 commits March 21, 2025 17:12

Overwrite stereo_combiner of loaded reconstrucor

b300837

- Init new reconstructor with parent=self - Not working yet with DispReconstructor bc of different arguments

Make StereoCombiner configurable in ApplyModels

67d91fc

Hckjs requested review from maxnoe, kosack and LukasBeiske March 24, 2025 13:27

Adding changelog

9ae85ca

Hckjs added the bug label Mar 24, 2025

This comment has been minimized.

Sign in to view

maxnoe reviewed Mar 24, 2025

View reviewed changes

LukasBeiske mentioned this pull request Jun 12, 2025

Rework ml model serialization #2773

Draft

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make StereoCombiner configurable in ApplyModels #2724

Make StereoCombiner configurable in ApplyModels #2724

Uh oh!

Hckjs commented Mar 24, 2025

Uh oh!

This comment has been minimized.

ctao-dpps-sonarqube bot commented Mar 24, 2025

Uh oh!

maxnoe Mar 24, 2025

Uh oh!

maxnoe commented Mar 24, 2025

Uh oh!

Hckjs commented Mar 25, 2025

Uh oh!

kosack commented Mar 28, 2025 •

edited

Loading

Uh oh!

LukasBeiske commented Apr 3, 2025 •

edited

Loading

Uh oh!

maxnoe commented Apr 4, 2025

Uh oh!

LukasBeiske commented Apr 4, 2025

Uh oh!

maxnoe commented Apr 4, 2025

Uh oh!

Uh oh!

Make StereoCombiner configurable in ApplyModels #2724

Are you sure you want to change the base?

Make StereoCombiner configurable in ApplyModels #2724

Uh oh!

Conversation

Hckjs commented Mar 24, 2025

Uh oh!

This comment has been minimized.

ctao-dpps-sonarqube bot commented Mar 24, 2025

Analysis Details

0 Issues

Coverage and Duplications

Uh oh!

maxnoe Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

maxnoe commented Mar 24, 2025

Uh oh!

Hckjs commented Mar 25, 2025

Uh oh!

kosack commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LukasBeiske commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maxnoe commented Apr 4, 2025

Uh oh!

LukasBeiske commented Apr 4, 2025

Uh oh!

maxnoe commented Apr 4, 2025

Uh oh!

Uh oh!

kosack commented Mar 28, 2025 •

edited

Loading

LukasBeiske commented Apr 3, 2025 •

edited

Loading