Optimizer data storage fixes #2533

W0lfShAd0w · 2025-09-10T15:47:37Z

Pull Request Description

What issue does this change request address? (Use "#" before the issue to link it, i.e., #42.)

This fixes the data desyncing issues that were observed in single- and multi-objective GA optimization. In addition, this corrects issues in the NSGA-ii survivor selection that were causing results to be incorrectly overwritten, leading to erroneous data being reported. This PR has some overlap/may not be entirely separable from PR #2532.

What are the significant changes in functionality due to this change request?

For Change Control Board: Change Request Review

The following review must be completed by an authorized member of the Change Control Board.

1. Review all computer code.
2. If any changes occur to the input syntax, there must be an accompanying change to the user manual and xsd schema. If the input syntax change deprecates existing input files, a conversion script needs to be added (see Conversion Scripts).
3. Make sure the Python code and commenting standards are respected (camelBack, etc.) - See on the wiki for details.
4. Automated Tests should pass, including run_tests, pylint, manual building and xsd tests. If there are changes to Simulation.py or JobHandler.py the qsub tests must pass.
5. If significant functionality is added, there must be tests added to check this. Tests should cover all possible options. Multiple short tests are preferred over one large test. If new development on the internal JobHandler parallel system is performed, a cluster test must be added setting, in XML block, the node <internalParallel> to True.
6. If the change modifies or adds a requirement or a requirement based test case, the Change Control Board's Chair or designee also needs to approve the change. The requirements and the requirements test shall be in sync.
7. The merge request must reference an issue. If the issue is closed, the issue close checklist shall be done.
8. If an analytic test is changed/added is the the analytic documentation updated/added?
9. If any test used as a basis for documentation examples (currently found in raven/tests/framework/user_guide and raven/docs/workshop) have been changed, the associated documentation must be reviewed and assured the text matches the example.

…ange in GA with new mutation and crossover type

… identified and fixed by Khang where non-objective optimization values were not being returned by the GA.

…s removes the arbitrary restriction requiring DataObjects to include both 'Inputs' and 'Outputs' nodes, despite the situations where that doesn't makes sense.

…e in the _SolutionExport of NSGA-ii that resulted in model inputs and outputs being coupledi incorrectly. This hotfix does NOT correct the same issue with RAVEN's estimation for the 'final' best values.

…ing of optimizer results and the crowding distance calculation for NSGA-ii. Tested with the GA test suite and stress tested with the ZDT1 test in particular.

… hardcoded seed value (e.g. 5489) has been replaced with the default seed value of 'None', which prompts numpy.random to take a high-entropy seed value from the OS (e.g. the system clock). A new subnode was added to <RunInfo> to allow for a globalSeed to be set in RAVEN prior to any code execution, which ensures a user-supplied RNG seed is applied before any RNG calls are made, if desired. Setting this globalSeed value to 5489 was necessary in all test files to ensure backwards compatibility with old gold results.

Equilibrium cycle work june25

…o be calculated incorrectly. (#2) Co-authored-by: Rollins <[email protected]>

…on on the output values prior to calculating the fitness. This had to be implemented separately from the standard RAVEN noramlizeData methodology, as we didn't want to normalize the inputs or return the output values to the user in a normalized format; the normalized values are ONLY needed to estimate the fitness when requested.

…en the solution inputs, constraints, and objectives. The culprit was a dict.update() line that was overwriting the correct values with desynced values. This line was necessary because the self._solutionExport() is not being defined correctly. This will be fixed in a subsequent commit.

… the solutions. The desyncing occurs when the 'populationFitness' local variable (used by single-objective optimization) is stored as the 'self.fitness' attribute of the Algorithm by the survivor selection method. NSGA-II can use the populationFitness local variable just fine, so the 'self.fitness' attribute is superfluous anyway.

… support a reduced input format. Penalty scaling factors are now interpreted as a 2d-array of shape (len(objVar),constraintNum). Function docstrings have been updated accordingly.

…e the way the kwargs dict was being provided to the function.

…solutions to make sure each part of the reproduction process was using the correct values and data for grandparents, parents, and children and that these data were being stored appropriately without overwriting. This in turn fixed the fitness value desyncing issue in NSGA-II as well.

…ividuals from GA are correctly added to and printed with the list of final solutions in the _solutionExport.

…ce redundancy and prevent data from being deleted unnecessarily.

… identified and fixed by Khang where non-objective optimization values were not being returned by the GA.

…e in the _SolutionExport of NSGA-ii that resulted in model inputs and outputs being coupledi incorrectly. This hotfix does NOT correct the same issue with RAVEN's estimation for the 'final' best values.

…ing of optimizer results and the crowding distance calculation for NSGA-ii. Tested with the GA test suite and stress tested with the ZDT1 test in particular.

…en the solution inputs, constraints, and objectives. The culprit was a dict.update() line that was overwriting the correct values with desynced values. This line was necessary because the self._solutionExport() is not being defined correctly. This will be fixed in a subsequent commit.

… the solutions. The desyncing occurs when the 'populationFitness' local variable (used by single-objective optimization) is stored as the 'self.fitness' attribute of the Algorithm by the survivor selection method. NSGA-II can use the populationFitness local variable just fine, so the 'self.fitness' attribute is superfluous anyway.

…solutions to make sure each part of the reproduction process was using the correct values and data for grandparents, parents, and children and that these data were being stored appropriately without overwriting. This in turn fixed the fitness value desyncing issue in NSGA-II as well.

…ifferent branch. This needs to be re-added in a future merge.

Jimmy-INL · 2025-10-15T16:38:24Z

@W0lfShAd0w
Almost all the GA tests failed.

Jimmy-INL · 2025-10-15T16:52:23Z

@W0lfShAd0w, I have found the issue, but I want you to find it too. This will help you navigate our regression testing system.

Jimmy-INL · 2025-10-15T17:22:49Z

ravenframework/Optimizers/GeneticAlgorithm.py


-    objectiveVal = []
+    currentPop_objvals = []
    for i in range(len(self._objectiveVar)):


You are mixing Camel Case with with Snake Case (currentPopInputs vs current_pop_inputs) In raven we do adopt Camel case we never use '_'. Please modify all variable names to match this notion.

Jimmy-INL · 2025-10-15T17:33:49Z

ravenframework/Optimizers/GeneticAlgorithm.py

+        currentPop_fitsbysoln = datasetToDataArray(currentPop_fitness, self._objectiveVar).data.tolist()
+    ## 5. Compute the rank of current population
+        currentPop_ranks = frontUtils.rankNonDominatedFrontiers(np.array(currentPop_fitsbysoln), isFitness=True)
+        currentPop_ranks = xr.DataArray(currentPop_ranks,


This is only defined if self._isMultiObjective is true. But it is used even in multiobjective and hence it will error out because it is used before assigned to a value. How did you test this? it will never run.

…EN. Also, fixed a bug where unneeded multiobjective variables were expected but not initialized in single objective GA.

* The outdated behavior of having randomUtils initialize the RNG with a hardcoded seed value (e.g. 5489) has been replaced with the default seed value of 'None', which prompts numpy.random to take a high-entropy seed value from the OS (e.g. the system clock). A new subnode was added to <RunInfo> to allow for a globalSeed to be set in RAVEN prior to any code execution, which ensures a user-supplied RNG seed is applied before any RNG calls are made, if desired. Setting this globalSeed value to 5489 was necessary in all test files to ensure backwards compatibility with old gold results. * globalSeed parameter of RunInfo now supports 'None' as a valid input. * Added a print statement for when no GlobalSeed is provided. * Minor changes made to address comments in PR idaholab#2534. Several tests were updated to have the proper expliciting seeding of the RNG. The unseeded test in testRandomUtils.py was modified to check 5 random floats for any repeats, which could indicate the RNG is failing. * global seed added to more tests to ensure consistency with golds. * Minor change to clarify output messages from globalSeed check. * Deprecate Rattlesnake and Mammoth (idaholab#2519) * remove Rattlesnake, Mammoth, and Instant tests * remove Rattlesnake and Mammoth codeinterfaces * removing from CodeInterface factory * removing Mammoth and Rattlesnake references in the docs * Added globalSeed parameter to test input file for backwards compatability. * Modified the tolerance on the multiYearDWT test to account for uncertainties in the fitted coefficients due to the small amount of training data. * Increased tolerances further for multiYearDWT test to get around fitting inconsistencies on the Linux OS test machines. This will be raised and corrected in an issue. * Attempt at addressing the possible intermittent test error on Fedora machine. --------- Co-authored-by: Rollins <[email protected]> Co-authored-by: Rollins <[email protected]> Co-authored-by: Gabriel J. Soto Gonzalez <[email protected]> Co-authored-by: Rollins <[email protected]> Co-authored-by: rollnk <[email protected]>

khnguy22 and others added 29 commits June 11, 2025 20:35

Adding modification for equilibrium cycle optimzation samplers and ch…

b984f8f

…ange in GA with new mutation and crossover type

update and add PRLO module

0613cad

Minor bug fixes and formatting improvements. Fixed the bug previously…

83067a3

… identified and fixed by Khang where non-objective optimization values were not being returned by the GA.

Fixed plot name printed in error messages.

0f5b01e

Added logic to allow 'Inputs', 'Outputs', and 'Index' to be None. Thi…

8307e5f

…s removes the arbitrary restriction requiring DataObjects to include both 'Inputs' and 'Outputs' nodes, despite the situations where that doesn't makes sense.

Applied hotfix from Khang Nguyen that corrects a synchronization issu…

05b2414

…e in the _SolutionExport of NSGA-ii that resulted in model inputs and outputs being coupledi incorrectly. This hotfix does NOT correct the same issue with RAVEN's estimation for the 'final' best values.

Further improvements from Khang and Mohammad Abdo to improve the sync…

54b6af7

…ing of optimizer results and the crowding distance calculation for NSGA-ii. Tested with the GA test suite and stress tested with the ZDT1 test in particular.

Merge pull request #1 from W0lfShAd0w/EquilibriumCycleWork_June25

f10a52a

Equilibrium cycle work june25

fixed a bug frontUtils.py that was causing nonfitness pareto fronts t…

3b4706f

…o be calculated incorrectly. (#2) Co-authored-by: Rollins <[email protected]>

globalSeed parameter of RunInfo now supports 'None' as a valid input.

7baf9fc

Changed the utility of the scaling factors in fitness.py so that they…

144c65d

… support a reduced input format. Penalty scaling factors are now interpreted as a 2d-array of shape (len(objVar),constraintNum). Function docstrings have been updated accordingly.

Fixed the way default scaling factors were being applied to accomodat…

ba2943f

…e the way the kwargs dict was being provided to the function.

Adopted some lines from Josh Cogliati to ensure that the most fit ind…

0471b64

…ividuals from GA are correctly added to and printed with the list of final solutions in the _solutionExport.

Added a print statement for when no GlobalSeed is provided.

bdd5ed6

Improved the data storing used in the sampler._solutionExport to redu…

4500020

…ce redundancy and prevent data from being deleted unnecessarily.

Minor bug fixes and formatting improvements. Fixed the bug previously…

148e8f9

… identified and fixed by Khang where non-objective optimization values were not being returned by the GA.

Applied hotfix from Khang Nguyen that corrects a synchronization issu…

a343d95

…e in the _SolutionExport of NSGA-ii that resulted in model inputs and outputs being coupledi incorrectly. This hotfix does NOT correct the same issue with RAVEN's estimation for the 'final' best values.

Further improvements from Khang and Mohammad Abdo to improve the sync…

c455ae3

…ing of optimizer results and the crowding distance calculation for NSGA-ii. Tested with the GA test suite and stress tested with the ZDT1 test in particular.

removed tab character.

4416d28

removed trailing whitespace from comment line.

ce2d41e

Commented out a line that is an artifact from cherry-picking from a d…

4088540

…ifferent branch. This needs to be re-added in a future merge.

Jimmy-INL self-requested a review October 15, 2025 16:31

Jimmy-INL reviewed Oct 15, 2025

View reviewed changes

rollnk and others added 3 commits October 22, 2025 11:55

Fixed new variable name to match the camelCase convention used in RAV…

c6f5be5

…EN. Also, fixed a bug where unneeded multiobjective variables were expected but not initialized in single objective GA.

Merge branch 'devel' into optimizer_data_storage_fixes

8baec95

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimizer data storage fixes #2533

Optimizer data storage fixes #2533

Uh oh!

W0lfShAd0w commented Sep 10, 2025

Uh oh!

Jimmy-INL commented Oct 15, 2025 •

edited

Loading

Uh oh!

Jimmy-INL commented Oct 15, 2025

Uh oh!

Jimmy-INL Oct 15, 2025

Uh oh!

Jimmy-INL Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimizer data storage fixes #2533

Are you sure you want to change the base?

Optimizer data storage fixes #2533

Uh oh!

Conversation

W0lfShAd0w commented Sep 10, 2025

Pull Request Description

What issue does this change request address? (Use "#" before the issue to link it, i.e., #42.)

What are the significant changes in functionality due to this change request?

For Change Control Board: Change Request Review

Uh oh!

Jimmy-INL commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Jimmy-INL commented Oct 15, 2025

Uh oh!

Jimmy-INL Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Jimmy-INL Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Jimmy-INL commented Oct 15, 2025 •

edited

Loading