-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix progress bar error when nested CompoundStep
samplers are assigned
#7730
base: main
Are you sure you want to change the base?
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #7730 +/- ##
==========================================
- Coverage 92.82% 92.82% -0.01%
==========================================
Files 107 107
Lines 18324 18322 -2
==========================================
- Hits 17010 17007 -3
- Misses 1314 1315 +1
🚀 New features to boost your workflow:
|
Can we have a smoke test that tries a bunch of step samplers for like 10 tune, and which also acts as a regression test for the issue? |
I added a solution to the bug, but I think it might be over-engineered. Basically the problem is that once we're inside the sampling loop, we don't have information about which step generated which stats. I assumed the stats could just be paired up with step samplers positionally, but that assumption is broken when there are nested CompoundSteps -- the nesting structure requires that we know more about which stats came from where. My solution was to add a Maybe I'm missing a simpler solution? Hoping this one at least starts a convo to get to somewhere better. |
I don't quite get the original problem to know if the solution is reasonable. Sounds to me like the CompoundStep should aggregate the stats for display, so the outer CompoundStep would only see two 2 entries , from NUTS and the inner CompoundStep? |
The update to the displayed statistics is done here. The progress manager has access to the return from Currently, the logic for a CompoundStep is to loop over the steps it contains, and apply each stats update function. So the If the list of steps is "flat", for example there is only one joint sampler (e.g. NUTS) or if every variable has its own independent step (e.g. Metropolis with blocking=False), this logic works, because the list step methods held by the outer-most CompoundStep will be aligned with the flat stats lists. See here for where this happens. If, however, the CompoundStep itself holds another CompoundStep, this looping |
Where does this flattening of stats happen? |
I think it's here, due to the use of |
#7721 reports an error in the presence of nested
CompoundStep
. Here's a prettier version of what pymc gives for the example in that issue:So there are 4 steps, but there's a compound step on the outside and on the inside. At each step, we get a flat list of 4 dictionaries holding statistics for each step. Currently, the logic for updating the progress bars makes the assumption that the list of step statistics returned at each step matches the list of step samplers. It was assumed that, if there is a compound step, there should only be one, so it can zip over the steps. Here is the display stat update for
CompoundStep
:The problem is that if there's a nested structure, one of the
udpate_fns
will do this loop again. Sincestep_stats
does not have the same nested structure, it ends up iterating over dictionary keys and raising the error.Open to suggestions on how to proceed, because the solution isn't obvious. Opened this as a draft PR to address the problem ASAP, since its been reported 3 times now.
Description
Related Issue
get_domain_of_finite_discrete_rv
ofCategorical
pymc-extras#331Checklist
Type of change
📚 Documentation preview 📚: https://pymc--7730.org.readthedocs.build/en/7730/