DM-43712: Add configurable buffer for template and refcat preload #204

hsinfang · 2024-09-30T19:39:40Z

No description provided.

kfindeisen

Many thanks for the well-organized changes! However, I'm a bit worried that they're eroding the (already shaky) coherence of MiddlewareInterface, specifically with the region state variable and the splitting of queries across multiple _filter_datasets calls. I tried to offer some specific suggestions, but I'd like to take another look at the final result.

python/activator/middleware_interface.py

tests/test_middleware_interface.py

python/activator/middleware_interface.py

etc/export_comCamSim.yaml

python/activator/middleware_interface.py

.github/workflows/build-service.yml

python/activator/middleware_interface.py

kfindeisen

Looks much better, thanks! But please make sure the code doesn't crash when there's no region.

python/activator/middleware_interface.py

kfindeisen · 2024-10-18T18:08:08Z

tests/test_middleware_interface.py

+    def test_compute_region(self):
+        """Test preload region computation."""
+        region = self.interface._compute_region()
+        self.assertTrue(isinstance(region, lsst.sphgeom.Region))


Since the tests don't include pointing/slew errors, the nextVisit boresight and exposure boresight are identical. Is it possible to test the values based on that?

I thought of that, but was not sure how to get the "truth" that is not from the exactly same code as in _compute_region(). Do you mean a comparison with a directly instantiated polygon?

I was thinking of using the visit-detector records, with matched padding. That should give similar results, shouldn't it?

I'm taking too long to make it work, so I decided to file a new ticket DM-47460 for it instead of delaying this PR.

I moved this test to MiddlewareInterfaceWriteableTest where visit-detector records are available (after ingesting raws and defining visits).

python/activator/middleware_interface.py

kfindeisen · 2024-10-18T19:24:16Z

tests/test_middleware_interface.py

+                src_butler.query_datasets.assert_called_once_with(
+                    "bias", instrument="LSSTComCamSim", explain=False)
+                existing_butler.query_datasets.assert_called_once_with(
+                    "bias", instrument="LSSTComCamSim", explain=False)


The explain=False is an implementation detail. Unfortunately, I don't know how to write a test without it other than inspecting Mock.call_args directly.

This should have been together with 674a4b1 Unlike the old DECam test dataset, the new test LSSTComCamSim dataset does not have crosstalk. So, use another dataset type to test. Also fix a time stamp missed in the previous commit.

The optimal value likely depends on the instrument and the skymap choice.

Instead of determining the instrument's wcsFlipX here, it is more robust to use its formatter, which knows its camera orientation from its obs package.

This changes the APIs of _export_skymap_and_templates and _export_refcats to take a lsst.sphgeom.Region directly. Note that the centroid of the region is not the same of the detector center, but it should not matter. Because htm7 can be too coarse compared to the patch size, using htm7 indices to search for templates may lead to preloading more patches than necessary and wasting time. This feature of using htm7 to search for overlapping templates is also about to be deprecated and replaced by the arbitrary spatial region query in Butler. The usage will be replaced when switching to the new butler query system.

This reduces the number of calib dataset types to loop through.

Querying butler one dataset type at a time is not necessary with the butler.registry.queryDatasets. But this is a preparation step before switching to the new query system which can only queries one dataset type at a time. Currently we can preload more types of calibs/refcats/templates than the actual pipelines really need. It's possible that some types are not preloaded but it's okay.

The new Butler query systems supports spatial-constraint query via lsst.sphgeom.Region directly. With this change, we use it in template and refcat search. This needs stack w_2024_38 or newer. make_export.py uses _filter_datasets so it needs to adjust to the new underlying API too.

Some unit tests were temporarily marked expectedFailure in 674a4b1. Now that we switch to the new query system, make them work again. The test repo was put together using middleware tools, which intrinsically uses butler repo's visit-detector regions with its padding from defineVisits config. That padding config is not the same as the preload region padding in prompt processing. This explains the patch differences in template selection.

kfindeisen requested changes Oct 2, 2024

View reviewed changes

hsinfang force-pushed the tickets/DM-43712 branch 3 times, most recently from 6839a4b to 5653980 Compare October 4, 2024 17:06

hsinfang force-pushed the tickets/DM-43712 branch 14 times, most recently from 319061d to 423db09 Compare October 17, 2024 18:44

kfindeisen approved these changes Oct 18, 2024

View reviewed changes

hsinfang force-pushed the tickets/DM-43712 branch from 423db09 to 2117f98 Compare October 22, 2024 17:55

kfindeisen mentioned this pull request Nov 5, 2024

DM-47387: Prompt Processing can't handle blank filters in nextVisit message #215

Merged

hsinfang force-pushed the tickets/DM-43712 branch 5 times, most recently from 2fb0771 to fca73e8 Compare November 7, 2024 23:23

hsinfang added 4 commits November 12, 2024 11:05

Update unit tests with the test data change

fd5b828

This should have been together with 674a4b1 Unlike the old DECam test dataset, the new test LSSTComCamSim dataset does not have crosstalk. So, use another dataset type to test. Also fix a time stamp missed in the previous commit.

Fix a missing space

c9b5dcd

Pad the detector region in the template search

432ed91

Make the preload padding configurable at the service level

e576636

The optimal value likely depends on the instrument and the skymap choice.

hsinfang added 9 commits November 12, 2024 11:05

Use the instrument's formatter to get the sky wcs

f8c0c3f

Instead of determining the instrument's wcsFlipX here, it is more robust to use its formatter, which knows its camera orientation from its obs package.

Use the actual template coadd types instead of wildcarding

7f3e400

Factor out the region computation

1fd83f7

Warn if preload region padding is smaller than defineVisits padding

59e1ae1

Constrain the calib types to those existing in the collection

58e8b8a

This reduces the number of calib dataset types to loop through.

hsinfang force-pushed the tickets/DM-43712 branch from fca73e8 to 7a4a239 Compare November 12, 2024 19:05

hsinfang merged commit ba0cfd5 into main Nov 13, 2024
6 of 8 checks passed

hsinfang deleted the tickets/DM-43712 branch November 13, 2024 00:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-43712: Add configurable buffer for template and refcat preload #204

DM-43712: Add configurable buffer for template and refcat preload #204

hsinfang commented Sep 30, 2024

kfindeisen left a comment •

edited

Loading

kfindeisen left a comment

kfindeisen Oct 18, 2024

hsinfang Oct 21, 2024

kfindeisen Oct 21, 2024

hsinfang Nov 7, 2024

kfindeisen Oct 18, 2024 •

edited

Loading

DM-43712: Add configurable buffer for template and refcat preload #204

DM-43712: Add configurable buffer for template and refcat preload #204

Conversation

hsinfang commented Sep 30, 2024

kfindeisen left a comment • edited Loading

Choose a reason for hiding this comment

kfindeisen left a comment

Choose a reason for hiding this comment

kfindeisen Oct 18, 2024

Choose a reason for hiding this comment

hsinfang Oct 21, 2024

Choose a reason for hiding this comment

kfindeisen Oct 21, 2024

Choose a reason for hiding this comment

hsinfang Nov 7, 2024

Choose a reason for hiding this comment

kfindeisen Oct 18, 2024 • edited Loading

Choose a reason for hiding this comment

kfindeisen left a comment •

edited

Loading

kfindeisen Oct 18, 2024 •

edited

Loading