Optimize volume_montecarlo performance #5124

fvngs · 2025-05-10T19:28:40Z

Improved array operations by pre-allocating arrays and reducing concatenations

Standard information about the request

A standard efficiency upgrade for the volume_montecarlo function.
It doesn't change the output of the function at all.

This change affects: the offline search, PyGRB
This change: follows style guidelines (See e.g. PEP8), has been proposed using the contribution guidelines

Motivation

I saw the function was using a lot of concat calls which could be reduced, as it can be a substantial performance bottleneck.

Testing performed

Size	Original (s)	Optimized (s)	Speedup
100	0.000651	0.000357	1.82x
1000	0.001938	0.000629	3.08x
10000	0.023510	0.007141	3.29x
100000	0.323696	0.057062	5.67x

The author of this pull request confirms they will adhere to the code of conduct

Improved array operations by pre-allocating arrays and reducing concatenations

pannarale · 2025-05-14T06:16:17Z

This does not affect PyGRB, as far as I know.

tdent · 2025-05-14T11:20:08Z

pycbc/sensitivity.py

    if distribution_param == 'distance':
-        found_weights = found_d ** d_power
-        missed_weights = missed_d ** d_power
+        found_weights = numpy.power(found_d, d_power)


I find this in the documentation for numpy power:
The ** operator can be used as a shorthand for np.power on ndarrays

Hence, I do not believe this change to use numpy.power can have any effect on performance, and it reduces readability IMO. Please revert these specific changes unless they are shown to significantly improve performance.

tdent · 2025-05-14T11:21:39Z

pycbc/sensitivity.py

    montecarlo_vtot = (4. / 3.) * numpy.pi * max_distance**3.

-    # arrays of weights for the MC integral
+    # Calculate weights based on distribution parameters


Suggested change

# Calculate weights based on distribution parameters

# Calculate weights for the MC integral

'MC integral' is useful scientific/statistical context to understand what is happening

tdent · 2025-05-14T11:22:15Z

pycbc/sensitivity.py

-    # over injections covering the sphere
-    mc_weight_samples = numpy.concatenate((found_weights, 0 * missed_weights))
-    mc_sum = sum(mc_weight_samples)
+        found_weights = (numpy.power(found_d, d_power) * 


The comment has been removed here, please reinstate it.

tdent · 2025-05-14T11:24:46Z

pycbc/sensitivity.py

+                        numpy.power(found_mchirp, mchirp_power))
+        missed_weights = (numpy.power(missed_d, d_power) * 
+                         numpy.power(missed_mchirp, mchirp_power))
+


The final else statement with NotImplementedError has been removed. I don't see why. We want the code to raise prompt and informative errors ..

tdent · 2025-05-14T11:27:20Z

pycbc/sensitivity.py

+    mc_weight_samples = numpy.zeros(total_size)
+    mc_weight_samples[:len(found_weights)] = found_weights
+
+    # Calculate Monte Carlo statistics


The explanation of the calculation in the previous calculation has been removed and replaced by a generic non-informative comment. Please reinstate the previous comment.

tdent · 2025-05-14T11:29:15Z

pycbc/sensitivity.py


    if limits_param == 'distance':
-        mc_norm = sum(all_weights)
+        mc_norm = sum(numpy.concatenate((found_weights, missed_weights)))


If numpy.concatenate is inefficient, why is it being used here?

tdent · 2025-05-14T11:30:49Z

pycbc/sensitivity.py

+    # Calculate Monte Carlo statistics
+    mc_sum = numpy.sum(mc_weight_samples)
+    mc_sum_squares = numpy.sum(mc_weight_samples ** 2)
+    mc_sample_variance = (mc_sum_squares / len(mc_weight_samples) - 


This calculation does not seem to reproduce the previous calculation using 'Ninj' in all cases. In addition, len(mc_weight_samples) is just the same as total_size so I don't see why the existing variable is not used.

tdent

I do not see what is the overall strategy here. In one place a concatenate operation has been removed but the operation is added in more than one other place. Arbitrary changes from ** to numpy.power which should have no effect on performance are made. In addition several nontrivial comments have been removed.

tdent · 2025-05-14T11:34:59Z

What specific example function call was made to obtain the advertised speedups?

sensitivity: optimize volume_montecarlo performance

2d36531

Improved array operations by pre-allocating arrays and reducing concatenations

ahnitz requested review from tdent and pannarale May 10, 2025 19:32

ahnitz assigned tdent May 10, 2025

tdent reviewed May 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize volume_montecarlo performance #5124

Optimize volume_montecarlo performance #5124

Uh oh!

fvngs commented May 10, 2025 •

edited

Loading

Uh oh!

pannarale commented May 14, 2025

Uh oh!

tdent May 14, 2025

Uh oh!

tdent May 14, 2025

Uh oh!

tdent May 14, 2025

Uh oh!

tdent May 14, 2025

Uh oh!

tdent May 14, 2025

Uh oh!

tdent May 14, 2025

Uh oh!

tdent May 14, 2025 •

edited

Loading

Uh oh!

tdent left a comment

Uh oh!

tdent commented May 14, 2025

Uh oh!

Uh oh!

	# Calculate weights based on distribution parameters
	# Calculate weights for the MC integral

Optimize volume_montecarlo performance #5124

Are you sure you want to change the base?

Optimize volume_montecarlo performance #5124

Uh oh!

Conversation

fvngs commented May 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Standard information about the request

Motivation

Contents

Testing performed

Uh oh!

pannarale commented May 14, 2025

Uh oh!

tdent May 14, 2025

Choose a reason for hiding this comment

Uh oh!

tdent May 14, 2025

Choose a reason for hiding this comment

Uh oh!

tdent May 14, 2025

Choose a reason for hiding this comment

Uh oh!

tdent May 14, 2025

Choose a reason for hiding this comment

Uh oh!

tdent May 14, 2025

Choose a reason for hiding this comment

Uh oh!

tdent May 14, 2025

Choose a reason for hiding this comment

Uh oh!

tdent May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tdent left a comment

Choose a reason for hiding this comment

Uh oh!

tdent commented May 14, 2025

Uh oh!

Uh oh!

fvngs commented May 10, 2025 •

edited

Loading

tdent May 14, 2025 •

edited

Loading