Abandoned; moved to another branch (improvements to TorchForwardSimulator) #479

rileyjmurray · 2024-08-20T13:45:26Z

No description provided.

The creation of COPA layouts relies on a number of specialized circuit structures which require non-trivial time to construct. In the context of iterative GST estimation with nested circuit lists (i.e. the default) this results in unnecessarily repeat construction of these objects. This is an initial implementation of a caching scheme allowing for more efficient re-use of these circuit structures across iterations.

…layout-creation

Cache the expanded SPAM-free circuits to reduce recomputing things unnecessarily.

This updates the implementation of the SeparatePOVMCircuit containter class. The most important change is adding an attribute for the full_effect_labels that avoids uneeded reconstruction. To add protection then, to ensure that this is kept in sync with everything else, the povm_label and effect_labels attributes (which feed into full_effect_labels) have been promoted to properties with setters that ensure the full_effect_labels are kept synced.

Adds a new method to OpModel that allows for doing instrument expansion and povm expansion in bulk, speeding things up be avoiding recomputation of shared quantities. Also adds a pipeline for re-using completed or split circuits (as produced by the related OpModel methods) for more efficient re-use of done work.

Some minor performance oriented tweaks to the init for COPA layouts.

Refactor some of the ordered dictionaries in matrix layout creation into regular ones.

…layout-creation

Start adding infrastructure for caching things used in MDC store creation and for plumbing in stuff from layout creation.

Performance optimization for the method for adding omitted frequencies to incorporate caching of the number of outcomes per circuit (which is somewhat expensive since it goes through the instrument/povm expansion code). Additionally refactor some other parts of this code for improved efficiency. Also makes a few minor tweaks to the method for adding counts to speed that up as well. Can probably make this a bit faster still by merging the two calls to reduce redundancy, but that is a future us problem. Additionally make a few microoptimizations to the dataset code for grabbing counts, and to slicetools adding a function for directly giving a numpy array for a slice (instead of needing to cast from a list). Miscellaneous cleanup of old commented out code that doesn't appear needed any longer.

Fix a bug I introduced in dataset indexing into something that could be None.

Another minor bug caught by testing.

Not sure why this didn't get caught on the circuit update branch, but oh well...

…layout-creation

Fixes minor error in split_circuits.

Improve the performance of __getitem__ when indexing into static circuits by making use of the _copy_init code path.

Implement caching of circuit structures tailored to the map forward simulator's requirements.

This finishes the process of refactoring expand_instruments_and_separate_povm from a circuit method to a method of OpModel.

Refactor expand_instruments_and_separate_povm to use the multi-circuit version under the hood to reduce code duplication.

…layout-creation

Refactor cache creation functions into static methods of the corresponding forward simulator class. Also add an empty base version of this method, and clean up a few miscellaneous things caught by review.

Includes a number of performance improvements and refinements to the implementation of the KM tree partitioning algorithm. Changes include: - More efficient re-use of computed subtree weights and level partitions - A custom copying function that avoids the use of the incredibly slow deepcopy function. - Less copying in general by changing when graph modifications are applied. -Bisection instead of linear search for getting initial KM partition.

Change the default atom count heuristic so that only a single atom is created when there is a single processor and no memory limit.

Cleans up the lindbladerrorgen.py module. -Removes large blocks of old commented out code and debug statements. - Adds new docstrings for methods that were previously missing them. - First pass at bringing the existing docstrings up to date with current implementation.

Update the implementation of the error generator representation update code for the dense rep case. The results are functionally identical, but are measurably faster. (Einsum is ~2-3X faster than tensordot for this particular case, e.g.). We also now do the entire error generator construction in a single shot instead of block-by-block to get additional benefits from vectorization.

The start of composed effect was 300 lines of old commented out implementation. This commit is simply to remove that fluff.

Refactor the single parameter `set_parameter_value` method to call the multi-parameter implementation under the hood. Add additional performance tweaks to the logic for determining when to update an element of the cache. What I came up with was that when the layer rules are the ExplicitLayerRules and we are updating an effect which is known to belong to a POVM which has already been updated then we can skip the cache update for those effects.

Add a few tweaks to the PrefixTable splitting algorithm to support multiple native state preps. Also fixes a bug for __contains__ comparisons between LabelTupTup and LabelStr. Add support for setting model parameter values by their parameter labels.

Can now specify a parameter label in addition to an integer index for model parameter updates.

Fixes and inefficiency in dm_mapfill_probs which was resulting in effect reps being recalculated unnecessarily, which was a big performance penalty, especially for composed error generator type reps.

Slightly more efficient parity implementation (fewer operations), and add the compiler hint to inline the parity function. In profiling this makes a surprisingly big difference.

Adds in pickle handling for circuits to address an issue with hash randomization.

This commit makes changes to the implementation of the new prefix table splitting algorithm in an attempt to make it deterministic. Previously we made direct use of a number of networkx iterators which turned out to be set like, and as such they had nondeterministic order for their returned values. This is fine in the single threaded setting, but with MPI this meant we would end up with different splittings for the different ranks (I expect there to be many degenerate solutions to the splitting problem, so this isn't a crazy thing to see). This resulted in bugs when using MPI. Hopefully this should fix things...

This reverts commit 5f34a9a.

Add checks for the existence of a model parameter interposer when using the circuit parameter dependence code. Currently that option is not supported.

Attempt at updating the default atom count heuristic to favor having the same number of atoms as processors. Should be revisited at some point to confirm it performs as anticipated.

Remove some profiling and debug bindings for cython extensions.

This is my first pass at updating the default evotype behavior for casting so that we prefer dense representations when using a small number of qubits. Right now the threshold is arbitrarily set to 3 qubits, but this should be reevaluated as needed.

Fix the evotype casting I accidentally broke with typo.

Change the default maximum cache size from 0 to None for the map forward simulator.

…uit outcome probabilities in a slightly more vectorized way.

rileyjmurray · 2025-07-24T16:46:46Z

Moved to PR #613

rileyjmurray and others added 30 commits May 22, 2024 17:02

main changes (breaks some calling functions elsewhere)

55da605

check in

f82655a

remove change that wasnt strictly in-scope for the PR

d0e1bde

remove changes that werent strictly necessary

b932571

tests pass

4f47d1f

remove is_normal function

2cd29ab

add a comment and remove unused imports

14c444b

Merge branch 'feature-faster-circuit-primitives' into feature-faster-…

66e7f78

…layout-creation

Add caching for spam-free circuit expansion

9bc47bc

Cache the expanded SPAM-free circuits to reduce recomputing things unnecessarily.

Minor COPA Layout __init__ tweaks

d97f786

Some minor performance oriented tweaks to the init for COPA layouts.

Refactor some OrderedDicts into regular ones

544fb55

Refactor some of the ordered dictionaries in matrix layout creation into regular ones.

Merge branch 'feature-faster-circuit-primitives' into feature-faster-…

1d4e5a0

…layout-creation

Start the process of adding caching to MDC store creation

91d5ebb

Start adding infrastructure for caching things used in MDC store creation and for plumbing in stuff from layout creation.

Fix dataset bug

e8e7004

Fix a bug I introduced in dataset indexing into something that could be None.

Another minor bugfix caught by testing

aa22c3c

Another minor bug caught by testing.

Another minor bugfix caught by testing

be80255

Update test_stdinputparser.py

ff13da6

Not sure why this didn't get caught on the circuit update branch, but oh well...

Merge branch 'feature-faster-circuit-primitives' into feature-faster-…

81bdacb

…layout-creation

Fix indentation error

f8c5840

Fixes minor error in split_circuits.

Faster implementation of __getitem__

0417c20

Improve the performance of __getitem__ when indexing into static circuits by making use of the _copy_init code path.

Implement caching for map layout creation

c39101d

Implement caching of circuit structures tailored to the map forward simulator's requirements.

Fix bugs in new extract_labels implementation

6cc69bc

Finish refactoring expand_instruments_and_separate_povm

1ff8aeb

This finishes the process of refactoring expand_instruments_and_separate_povm from a circuit method to a method of OpModel.

Refactor expand_instruments_and_separate_povm

5db3e59

Refactor expand_instruments_and_separate_povm to use the multi-circuit version under the hood to reduce code duplication.

Merge branch 'feature-faster-circuit-primitives' into feature-faster-…

7f7a08d

…layout-creation

Refactor cache creation functions

53e2da6

Refactor cache creation functions into static methods of the corresponding forward simulator class. Also add an empty base version of this method, and clean up a few miscellaneous things caught by review.

Corey Ostrove and others added 25 commits September 27, 2024 21:09

Change default atom heuristic

b69c44e

Change the default atom count heuristic so that only a single atom is created when there is a single processor and no memory limit.

Spring cleaning

6c076e7

Cleans up the lindbladerrorgen.py module. -Removes large blocks of old commented out code and debug statements. - Adds new docstrings for methods that were previously missing them. - First pass at bringing the existing docstrings up to date with current implementation.

Clean up composedeffect

3f59ec6

The start of composed effect was 300 lines of old commented out implementation. This commit is simply to remove that fluff.

Add option to update parameters by name

d6deda1

Can now specify a parameter label in addition to an integer index for model parameter updates.

Fix an inefficiency in dm_mapfill_probs

5f34a9a

Fixes and inefficiency in dm_mapfill_probs which was resulting in effect reps being recalculated unnecessarily, which was a big performance penalty, especially for composed error generator type reps.

Minor tweak to effectcrep

81ced99

Slightly more efficient parity implementation (fewer operations), and add the compiler hint to inline the parity function. In profiling this makes a surprisingly big difference.

Add pickle handling for circuits

5b0b2f4

Adds in pickle handling for circuits to address an issue with hash randomization.

Revert "Fix an inefficiency in dm_mapfill_probs"

c14b090

This reverts commit 5f34a9a.

Merge branch 'develop' into feature-lazier-model-param-updates

119e968

Add parameter interposer checks

12c9252

Add checks for the existence of a model parameter interposer when using the circuit parameter dependence code. Currently that option is not supported.

Update the default atom heuristic

ce983fc

Attempt at updating the default atom count heuristic to favor having the same number of atoms as processors. Should be revisited at some point to confirm it performs as anticipated.

Remove some debug bindings

383e4bd

Remove some profiling and debug bindings for cython extensions.

Fix cast bug

bdcca50

Fix the evotype casting I accidentally broke with typo.

Change default cache size

ec27fa7

Change the default maximum cache size from 0 to None for the map forward simulator.

tweak stateless_data

6ff5d59

logging

1344608

add trivial __getstate__ and __setstate__ needed for serialization

3ba841d

remove some logging and profiling code (well, just comment out)

001ea1b

fix enable_backward=True handling. Add a function for evaluating circ…

1cf25e3

…uit outcome probabilities in a slightly more vectorized way.

rileyjmurray mentioned this pull request Jul 15, 2025

Forward simulation on the GPU #607

Open

Merge branch 'faster-torch-rebased' into faster-torch

c16213a

rileyjmurray changed the title ~~WIP: improvements to TorchForwardSimulator~~ Abandoned; moved to another branch (improvements to TorchForwardSimulator) Jul 24, 2025

rileyjmurray closed this Jul 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Abandoned; moved to another branch (improvements to TorchForwardSimulator) #479

Abandoned; moved to another branch (improvements to TorchForwardSimulator) #479

Uh oh!

rileyjmurray commented Aug 20, 2024

Uh oh!

rileyjmurray commented Jul 24, 2025

Uh oh!

Uh oh!

Abandoned; moved to another branch (improvements to TorchForwardSimulator) #479

Abandoned; moved to another branch (improvements to TorchForwardSimulator) #479

Uh oh!

Conversation

rileyjmurray commented Aug 20, 2024

Uh oh!

rileyjmurray commented Jul 24, 2025

Uh oh!

Uh oh!