gh-131269: Avoid binding functions in random.py #131270

colesbury · 2025-03-15T01:00:43Z

This speeds up calls like random.randint() by about 10% and also avoids some scaling bottlenecks in the free threading build.

Issue: Avoid binding functions to temporaries in random.py #131269

This speeds up calls like `random.randint()` by about 10%.

picnixz

There are also some other places where we did from os import urandom as _urandom and used _urandom afterwards. Should this also be changed or is this is still a real optimisation? (namely, replace _urandom by os.urandom?)

colesbury · 2025-03-17T18:55:22Z

The from os import urandom as _urandom; _urandom() pattern may still be slightly faster than calls like os.urandom(). The difference is small enough that I would not use that pattern in new code, but I'm not sure it's worth changing existing code.

rhettinger · 2025-03-18T01:08:29Z

When I last looked at this a few months ago, the speed-up was questionable (it varied quite a bit across builds, compilers, and operating systems). The interpreter implementation is in rapid flux. For a long time, the pre-binding was faster. Then the LOAD_ATTR optimizations pulled ahead, but there is no reason think that advantage will persist. (We has a similar situation where LOAD_GLOBAL briefly became as fast as LOAD_LOCAL which seemed to invalidate previous efforts to use local variables). I would rather not "chase the interpreter" into a local minimum. and keep the current logic intact until the interpreter stablizes. Experience PyPy showing that prebinding could maintain its advantage even with JITted code.

Ultimately, I expect that the current code would win out because it hoists some of the lookup logic outside of the loop. Right now, it adds a STORE_FAST opcode but the cost of that will drop to almost free as the impending JIT work moves forward.

Can you elaborate more on the the "free-threading bottlenecks"? When I last looked at this while working on KDE for the statistics module, I found that prebinding wasn't the problem. Instead, what was needed was separate instances of random.Random() so that there was no shared state.

Addenda: IIRC this was also discussed in the forums this year and Guido was against sweeping through and replacing bound methods in existing stable code.

Maintainer's note: I personally find the calls to random() to be more readable than self.random() sprinkled in the middle of complicated formulas. Unless this edit must be made, I greatly prefer the current code some of which I have recently written.

colesbury · 2025-03-18T16:09:59Z

Hi @rhettinger - here's an example that demonstrates the free threading bottleneck, adapted from Paul Moore's program: montecarlo.py. It's nearly twice as fast with this change. It already uses separate instances of random.Random().

The underlying bottleneck is due to two issues:

The expression getrandbits = self.getrandbits creates a new method object that increases the reference count of the shared getrandbits function. Even though the random.Random instances are separate, the function is global. The self.getrandbits() call avoids creating the temporary method object.
The _PyType_LookupRef call in LOAD_ATTR also temporarily increases the reference count of the shared function. The specialized LOAD_ATTR_METHOD avoids this.

I think it's possible to change the implementation to avoid both of these issues, but that will be complicated and require new techniques.

The single threaded improvement is mostly related to avoiding the creation of the temporary method object and the bookkeeping that involves. I've seen an improvement for Python from to 3.9 to 3.14.

I agree that with a sufficiently advanced JIT the existing code may perform the same as the existing code, but I don't think we're there yet. I also agree that we don't want to make sweeping changes across the code base. I think this change is pretty small and targeted and the new code is still idiomatic Python.

rhettinger · 2025-03-18T17:25:24Z

How about making only the edit to randbelow methods (as proposed in the issue) and not sweeping through the rest of the module? I'm really uncomfortable with the latter. Unless there is compelling urgency for making all of these edits, we can discuss those at the sprints.

Also ISTM the monte carlo benchmark is a toy example and we shouldn't tune to it. Benchmark chasing is rarely advisable and even more so in a time where the interpreter implementation undergoing so many changes.

colesbury · 2025-03-19T21:19:09Z

I've limited the changes to _randbelow_with_getrandbits

pythongh-131269: Avoid binding functions in random.py

c92ca1f

This speeds up calls like `random.randint()` by about 10%.

bedevere-app bot mentioned this pull request Mar 15, 2025

Avoid binding functions to temporaries in random.py #131269

Closed

colesbury added the skip news label Mar 15, 2025

colesbury marked this pull request as ready for review March 17, 2025 13:20

colesbury requested a review from rhettinger as a code owner March 17, 2025 13:20

bedevere-app bot added the awaiting core review label Mar 17, 2025

colesbury requested a review from mpage March 17, 2025 13:21

mpage approved these changes Mar 17, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting core review labels Mar 17, 2025

picnixz approved these changes Mar 17, 2025

View reviewed changes

rhettinger self-assigned this Mar 18, 2025

rhettinger requested a review from tim-one March 18, 2025 01:09

Limit change to _randbelow_with_getrandbits

a0a9026

rhettinger approved these changes Mar 20, 2025

View reviewed changes

rhettinger merged commit 844765b into python:main Mar 20, 2025
38 checks passed

bedevere-app bot removed the awaiting merge label Mar 20, 2025

colesbury deleted the gh-131269-random branch March 27, 2025 20:10

seehwan pushed a commit to seehwan/cpython that referenced this pull request Apr 16, 2025

pythongh-131269: Minor optimization in random.py (python#131270)

40ae14a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-131269: Avoid binding functions in random.py #131270

gh-131269: Avoid binding functions in random.py #131270

Uh oh!

colesbury commented Mar 15, 2025 •

edited

Loading

Uh oh!

picnixz left a comment

Uh oh!

colesbury commented Mar 17, 2025

Uh oh!

rhettinger commented Mar 18, 2025 •

edited

Loading

Uh oh!

colesbury commented Mar 18, 2025

Uh oh!

rhettinger commented Mar 18, 2025 •

edited

Loading

Uh oh!

colesbury commented Mar 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gh-131269: Avoid binding functions in random.py #131270

gh-131269: Avoid binding functions in random.py #131270

Uh oh!

Conversation

colesbury commented Mar 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

picnixz left a comment

Choose a reason for hiding this comment

Uh oh!

colesbury commented Mar 17, 2025

Uh oh!

rhettinger commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

colesbury commented Mar 18, 2025

Uh oh!

rhettinger commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

colesbury commented Mar 19, 2025

Uh oh!

Uh oh!

Uh oh!

colesbury commented Mar 15, 2025 •

edited

Loading

rhettinger commented Mar 18, 2025 •

edited

Loading

rhettinger commented Mar 18, 2025 •

edited

Loading