Fix for Robust Estimation Memory Issue by azane · Pull Request #548 · BasisResearch/chirho

azane · 2024-07-05T15:12:04Z

Addresses memory issue stemming from vmap over torch.func.jvp in MonteCarloInfluenceEstimator. Instead, uses reverse mode autodiff for Jacobian of functional (largely because parameter dimensionality will typically far exceed dimensionality of functional) and then manually right multiplies param_eif (the fisher matrix X data log probability). Right multiplication is performed agnostically wrt both pytree structures and tensor shapes (emulating torch.func.jvp, with slightly more agnosticity actually).

Memory use is orders of magnitude lower, to the point of not being noticeable.

One possible difference (/cause of original problem): the vmap over jvp was potentially estimating and computing the jacobian separately for each batch in param_eif. This is very redundant, but also meant each batch saw different randomness in the Jacobian estimate, thereby propagating some notion of variability in the Jacobian estimate to the user. This implementation estimates/computes the Jacobian once only for all batches in param_eif. This may or may not be desirable, but it's important to note that doing so separately for each batch comes at very high computational cost.

Adds tests for the alternative jvp implementation, including a test of memory consumption.

…asic reverse diff and manual mat vec product.

eb8680

Nice sleuthing! Great to see such big performance gains.

azane added 6 commits July 2, 2024 19:02

messy experiments and profiling, with fix drafted and working using b…

780902a

…asic reverse diff and manual mat vec product.

first passing shape test of generalized pytree jacobian vector product.

80db404

passes shape and equivalence test with vmapped forward jvp.

3af70ec

passes robust module tests, still needs cleanup.

1407fcf

refactors tests a bit to include separate smoke

8b387e6

cleans up and comments.

548b1b2

azane added status:WIP Work-in-progress not yet ready for review module:robust labels Jul 5, 2024

azane self-assigned this Jul 5, 2024

azane added 3 commits July 5, 2024 11:14

cleans up scratch files.

d329682

lints and adds comment in mceif handler

21861f0

removes unused imports

0dad1a7

azane requested a review from eb8680 July 5, 2024 15:34

azane added status:awaiting review Awaiting response from reviewer and removed status:WIP Work-in-progress not yet ready for review labels Jul 5, 2024

azane added 2 commits July 5, 2024 11:57

adds explicit test asserting mangeable memory usage.

d8fb7d6

updates comment.

bd02c78

eb8680 approved these changes Jul 6, 2024

View reviewed changes

eb8680 merged commit be862d1 into master Jul 6, 2024

eb8680 deleted the az-robust-memory-fix branch July 6, 2024 02:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for Robust Estimation Memory Issue#548

Fix for Robust Estimation Memory Issue#548
eb8680 merged 11 commits intomasterfrom
az-robust-memory-fix

azane commented Jul 5, 2024 •

edited

Loading

Uh oh!

eb8680 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

azane commented Jul 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eb8680 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

azane commented Jul 5, 2024 •

edited

Loading