Rng improvements #2534

W0lfShAd0w · 2025-09-10T15:50:05Z

Pull Request Description

What issue does this change request address? (Use "#" before the issue to link it, i.e., #42.)

The outdated behavior of having randomUtils initialize the RNG with a hardcoded seed value (e.g. 5489) has been replaced with the default seed value of 'None', which prompts numpy.random to use high-entropy initial state. A new input subnode was added to to allow for a globalSeed to be set in RAVEN prior to any code execution, which ensures a user-supplied RNG seed is applied before any RNG calls are made, if desired. Setting this globalSeed value to 5489 was necessary in all test files to ensure backwards compatibility with old gold results.

What are the significant changes in functionality due to this change request?

For Change Control Board: Change Request Review

The following review must be completed by an authorized member of the Change Control Board.

1. Review all computer code.
2. If any changes occur to the input syntax, there must be an accompanying change to the user manual and xsd schema. If the input syntax change deprecates existing input files, a conversion script needs to be added (see Conversion Scripts).
3. Make sure the Python code and commenting standards are respected (camelBack, etc.) - See on the wiki for details.
4. Automated Tests should pass, including run_tests, pylint, manual building and xsd tests. If there are changes to Simulation.py or JobHandler.py the qsub tests must pass.
5. If significant functionality is added, there must be tests added to check this. Tests should cover all possible options. Multiple short tests are preferred over one large test. If new development on the internal JobHandler parallel system is performed, a cluster test must be added setting, in XML block, the node <internalParallel> to True.
6. If the change modifies or adds a requirement or a requirement based test case, the Change Control Board's Chair or designee also needs to approve the change. The requirements and the requirements test shall be in sync.
7. The merge request must reference an issue. If the issue is closed, the issue close checklist shall be done.
8. If an analytic test is changed/added is the the analytic documentation updated/added?
9. If any test used as a basis for documentation examples (currently found in raven/tests/framework/user_guide and raven/docs/workshop) have been changed, the associated documentation must be reviewed and assured the text matches the example.

…ange in GA with new mutation and crossover type

… identified and fixed by Khang where non-objective optimization values were not being returned by the GA.

…s removes the arbitrary restriction requiring DataObjects to include both 'Inputs' and 'Outputs' nodes, despite the situations where that doesn't makes sense.

…e in the _SolutionExport of NSGA-ii that resulted in model inputs and outputs being coupledi incorrectly. This hotfix does NOT correct the same issue with RAVEN's estimation for the 'final' best values.

…ing of optimizer results and the crowding distance calculation for NSGA-ii. Tested with the GA test suite and stress tested with the ZDT1 test in particular.

… hardcoded seed value (e.g. 5489) has been replaced with the default seed value of 'None', which prompts numpy.random to take a high-entropy seed value from the OS (e.g. the system clock). A new subnode was added to <RunInfo> to allow for a globalSeed to be set in RAVEN prior to any code execution, which ensures a user-supplied RNG seed is applied before any RNG calls are made, if desired. Setting this globalSeed value to 5489 was necessary in all test files to ensure backwards compatibility with old gold results.

Equilibrium cycle work june25

…o be calculated incorrectly. (#2) Co-authored-by: Rollins <[email protected]>

…on on the output values prior to calculating the fitness. This had to be implemented separately from the standard RAVEN noramlizeData methodology, as we didn't want to normalize the inputs or return the output values to the user in a normalized format; the normalized values are ONLY needed to estimate the fitness when requested.

…en the solution inputs, constraints, and objectives. The culprit was a dict.update() line that was overwriting the correct values with desynced values. This line was necessary because the self._solutionExport() is not being defined correctly. This will be fixed in a subsequent commit.

… the solutions. The desyncing occurs when the 'populationFitness' local variable (used by single-objective optimization) is stored as the 'self.fitness' attribute of the Algorithm by the survivor selection method. NSGA-II can use the populationFitness local variable just fine, so the 'self.fitness' attribute is superfluous anyway.

… support a reduced input format. Penalty scaling factors are now interpreted as a 2d-array of shape (len(objVar),constraintNum). Function docstrings have been updated accordingly.

…e the way the kwargs dict was being provided to the function.

…solutions to make sure each part of the reproduction process was using the correct values and data for grandparents, parents, and children and that these data were being stored appropriately without overwriting. This in turn fixed the fitness value desyncing issue in NSGA-II as well.

…ividuals from GA are correctly added to and printed with the list of final solutions in the _solutionExport.

…ce redundancy and prevent data from being deleted unnecessarily.

… hardcoded seed value (e.g. 5489) has been replaced with the default seed value of 'None', which prompts numpy.random to take a high-entropy seed value from the OS (e.g. the system clock). A new subnode was added to <RunInfo> to allow for a globalSeed to be set in RAVEN prior to any code execution, which ensures a user-supplied RNG seed is applied before any RNG calls are made, if desired. Setting this globalSeed value to 5489 was necessary in all test files to ensure backwards compatibility with old gold results.

j-bryan

The proposed change is much needed for improved reproducibility and random sampling. Only a few minor changes are requested to keep code within the RAVEN code standards and to make the provided global seed conversion script suitable to be run on any machine. Once all tests are passing, this pull request can be approved.

ravenframework/utils/randomUtils.py

scripts/conversionScripts/convert_globalseed.py

…ests were updated to have the proper expliciting seeding of the RNG. The unseeded test in testRandomUtils.py was modified to check 5 random floats for any repeats, which could indicate the RNG is failing.

j-bryan

One more small comment

j-bryan · 2025-09-16T16:29:40Z

.gitignore

 tests/framework/ROM/TimeSeries/SyntheticHistory/LogARMA/
 tests/framework/ROM/TimeSeries/SyntheticHistory/VARMA/
 tests/framework/ROM/TimeSeries/SyntheticHistory/ZeroFilterDiscontinuous/
+*.xml.bak


Make sure you don't commit any *.xml.bak files, but we shouldn't need to have this in the .gitignore file.

moosebuild · 2025-09-16T16:37:46Z

Job Test mac on 26fc55f : invalidated by @j-bryan

failed in Set python environment

…lity.

moosebuild · 2025-09-23T17:00:43Z

Job Test mac on a30e2d3 : invalidated by @j-bryan

failed in Set python environment

…ainties in the fitted coefficients due to the small amount of training data.

…ing inconsistencies on the Linux OS test machines. This will be raised and corrected in an issue.

moosebuild · 2025-10-08T20:17:16Z

Job Mingw Test on 392a784 : invalidated by @joshua-cogliati-inl

restarted civet

…machine.

* The outdated behavior of having randomUtils initialize the RNG with a hardcoded seed value (e.g. 5489) has been replaced with the default seed value of 'None', which prompts numpy.random to take a high-entropy seed value from the OS (e.g. the system clock). A new subnode was added to <RunInfo> to allow for a globalSeed to be set in RAVEN prior to any code execution, which ensures a user-supplied RNG seed is applied before any RNG calls are made, if desired. Setting this globalSeed value to 5489 was necessary in all test files to ensure backwards compatibility with old gold results. * globalSeed parameter of RunInfo now supports 'None' as a valid input. * Added a print statement for when no GlobalSeed is provided. * Minor changes made to address comments in PR idaholab#2534. Several tests were updated to have the proper expliciting seeding of the RNG. The unseeded test in testRandomUtils.py was modified to check 5 random floats for any repeats, which could indicate the RNG is failing. * global seed added to more tests to ensure consistency with golds. * Minor change to clarify output messages from globalSeed check. * Deprecate Rattlesnake and Mammoth (idaholab#2519) * remove Rattlesnake, Mammoth, and Instant tests * remove Rattlesnake and Mammoth codeinterfaces * removing from CodeInterface factory * removing Mammoth and Rattlesnake references in the docs * Added globalSeed parameter to test input file for backwards compatability. * Modified the tolerance on the multiYearDWT test to account for uncertainties in the fitted coefficients due to the small amount of training data. * Increased tolerances further for multiYearDWT test to get around fitting inconsistencies on the Linux OS test machines. This will be raised and corrected in an issue. * Attempt at addressing the possible intermittent test error on Fedora machine. --------- Co-authored-by: Rollins <[email protected]> Co-authored-by: Rollins <[email protected]> Co-authored-by: Gabriel J. Soto Gonzalez <[email protected]> Co-authored-by: Rollins <[email protected]> Co-authored-by: rollnk <[email protected]>

* Rng improvements (#3) * The outdated behavior of having randomUtils initialize the RNG with a hardcoded seed value (e.g. 5489) has been replaced with the default seed value of 'None', which prompts numpy.random to take a high-entropy seed value from the OS (e.g. the system clock). A new subnode was added to <RunInfo> to allow for a globalSeed to be set in RAVEN prior to any code execution, which ensures a user-supplied RNG seed is applied before any RNG calls are made, if desired. Setting this globalSeed value to 5489 was necessary in all test files to ensure backwards compatibility with old gold results. * globalSeed parameter of RunInfo now supports 'None' as a valid input. * Added a print statement for when no GlobalSeed is provided. * Minor changes made to address comments in PR idaholab#2534. Several tests were updated to have the proper expliciting seeding of the RNG. The unseeded test in testRandomUtils.py was modified to check 5 random floats for any repeats, which could indicate the RNG is failing. * global seed added to more tests to ensure consistency with golds. * Minor change to clarify output messages from globalSeed check. * Deprecate Rattlesnake and Mammoth (idaholab#2519) * remove Rattlesnake, Mammoth, and Instant tests * remove Rattlesnake and Mammoth codeinterfaces * removing from CodeInterface factory * removing Mammoth and Rattlesnake references in the docs * Added globalSeed parameter to test input file for backwards compatability. * Modified the tolerance on the multiYearDWT test to account for uncertainties in the fitted coefficients due to the small amount of training data. * Increased tolerances further for multiYearDWT test to get around fitting inconsistencies on the Linux OS test machines. This will be raised and corrected in an issue. * Attempt at addressing the possible intermittent test error on Fedora machine. --------- Co-authored-by: Rollins <[email protected]> Co-authored-by: Rollins <[email protected]> Co-authored-by: Gabriel J. Soto Gonzalez <[email protected]> Co-authored-by: Rollins <[email protected]> Co-authored-by: rollnk <[email protected]> * Optimizer data storage fixes (#5) * Minor bug fixes and formatting improvements. Fixed the bug previously identified and fixed by Khang where non-objective optimization values were not being returned by the GA. * Applied hotfix from Khang Nguyen that corrects a synchronization issue in the _SolutionExport of NSGA-ii that resulted in model inputs and outputs being coupledi incorrectly. This hotfix does NOT correct the same issue with RAVEN's estimation for the 'final' best values. * Further improvements from Khang and Mohammad Abdo to improve the syncing of optimizer results and the crowding distance calculation for NSGA-ii. Tested with the GA test suite and stress tested with the ZDT1 test in particular. * Temporary fix made to correct desyncing issues in the Optimizer between the solution inputs, constraints, and objectives. The culprit was a dict.update() line that was overwriting the correct values with desynced values. This line was necessary because the self._solutionExport() is not being defined correctly. This will be fixed in a subsequent commit. * Fixed a bug in NSGA-II causing the fitness values to be desynced from the solutions. The desyncing occurs when the 'populationFitness' local variable (used by single-objective optimization) is stored as the 'self.fitness' attribute of the Algorithm by the survivor selection method. NSGA-II can use the populationFitness local variable just fine, so the 'self.fitness' attribute is superfluous anyway. * Overhauled the method by which NSGA-II was storing data/tracking for solutions to make sure each part of the reproduction process was using the correct values and data for grandparents, parents, and children and that these data were being stored appropriately without overwriting. This in turn fixed the fitness value desyncing issue in NSGA-II as well. * removed tab character. * removed trailing whitespace from comment line. * Commented out a line that is an artifact from cherry-picking from a different branch. This needs to be re-added in a future merge. * Fixed new variable name to match the camelCase convention used in RAVEN. Also, fixed a bug where unneeded multiobjective variables were expected but not initialized in single objective GA. --------- Co-authored-by: Rollins <[email protected]> Co-authored-by: Rollins <[email protected]> Co-authored-by: rollnk <[email protected]> --------- Co-authored-by: Rollins <[email protected]> Co-authored-by: Rollins <[email protected]> Co-authored-by: Gabriel J. Soto Gonzalez <[email protected]> Co-authored-by: Rollins <[email protected]> Co-authored-by: rollnk <[email protected]>

khnguy22 and others added 23 commits June 11, 2025 20:35

Adding modification for equilibrium cycle optimzation samplers and ch…

b984f8f

…ange in GA with new mutation and crossover type

update and add PRLO module

0613cad

Minor bug fixes and formatting improvements. Fixed the bug previously…

83067a3

… identified and fixed by Khang where non-objective optimization values were not being returned by the GA.

Fixed plot name printed in error messages.

0f5b01e

Added logic to allow 'Inputs', 'Outputs', and 'Index' to be None. Thi…

8307e5f

…s removes the arbitrary restriction requiring DataObjects to include both 'Inputs' and 'Outputs' nodes, despite the situations where that doesn't makes sense.

Applied hotfix from Khang Nguyen that corrects a synchronization issu…

05b2414

…e in the _SolutionExport of NSGA-ii that resulted in model inputs and outputs being coupledi incorrectly. This hotfix does NOT correct the same issue with RAVEN's estimation for the 'final' best values.

Further improvements from Khang and Mohammad Abdo to improve the sync…

54b6af7

…ing of optimizer results and the crowding distance calculation for NSGA-ii. Tested with the GA test suite and stress tested with the ZDT1 test in particular.

Merge pull request #1 from W0lfShAd0w/EquilibriumCycleWork_June25

f10a52a

Equilibrium cycle work june25

fixed a bug frontUtils.py that was causing nonfitness pareto fronts t…

3b4706f

…o be calculated incorrectly. (#2) Co-authored-by: Rollins <[email protected]>

globalSeed parameter of RunInfo now supports 'None' as a valid input.

7baf9fc

Changed the utility of the scaling factors in fitness.py so that they…

144c65d

… support a reduced input format. Penalty scaling factors are now interpreted as a 2d-array of shape (len(objVar),constraintNum). Function docstrings have been updated accordingly.

Fixed the way default scaling factors were being applied to accomodat…

ba2943f

…e the way the kwargs dict was being provided to the function.

Adopted some lines from Josh Cogliati to ensure that the most fit ind…

0471b64

…ividuals from GA are correctly added to and printed with the list of final solutions in the _solutionExport.

Added a print statement for when no GlobalSeed is provided.

bdd5ed6

Improved the data storing used in the sampler._solutionExport to redu…

4500020

…ce redundancy and prevent data from being deleted unnecessarily.

globalSeed parameter of RunInfo now supports 'None' as a valid input.

4c64114

Added a print statement for when no GlobalSeed is provided.

020ac52

j-bryan requested changes Sep 11, 2025

View reviewed changes

ravenframework/utils/randomUtils.py Outdated Show resolved Hide resolved

scripts/conversionScripts/convert_globalseed.py Outdated Show resolved Hide resolved

Rollins added 2 commits September 15, 2025 14:32

Minor changes made to address comments in PR idaholab#2534. Several t…

d12f00a

…ests were updated to have the proper expliciting seeding of the RNG. The unseeded test in testRandomUtils.py was modified to check 5 random floats for any repeats, which could indicate the RNG is failing.

global seed added to more tests to ensure consistency with golds.

26fc55f

W0lfShAd0w requested a review from j-bryan September 16, 2025 16:26

j-bryan requested changes Sep 16, 2025

View reviewed changes

Minor change to clarify output messages from globalSeed check.

661b0f2

Rollins added 2 commits September 22, 2025 12:46

Added globalSeed parameter to test input file for backwards compatabi…

dc2c069

…lity.

resolve merge conflicts with idaholab/devel

a30e2d3

Rollins and others added 2 commits September 23, 2025 12:50

Modified the tolerance on the multiYearDWT test to account for uncert…

e98f7bc

…ainties in the fitted coefficients due to the small amount of training data.

Increased tolerances further for multiYearDWT test to get around fitt…

392a784

…ing inconsistencies on the Linux OS test machines. This will be raised and corrected in an issue.

Attempt at addressing the possible intermittent test error on Fedora …

a5fbc8e

…machine.

W0lfShAd0w requested a review from j-bryan October 28, 2025 15:25

Merge branch 'devel' into RNG_improvements

ac6132f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rng improvements #2534

Rng improvements #2534

Uh oh!

W0lfShAd0w commented Sep 10, 2025

Uh oh!

j-bryan left a comment

Uh oh!

Uh oh!

Uh oh!

j-bryan left a comment

Uh oh!

j-bryan Sep 16, 2025

Uh oh!

moosebuild commented Sep 16, 2025

Uh oh!

moosebuild commented Sep 23, 2025

Uh oh!

moosebuild commented Oct 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Rng improvements #2534

Are you sure you want to change the base?

Rng improvements #2534

Uh oh!

Conversation

W0lfShAd0w commented Sep 10, 2025

Pull Request Description

What issue does this change request address? (Use "#" before the issue to link it, i.e., #42.)

What are the significant changes in functionality due to this change request?

For Change Control Board: Change Request Review

Uh oh!

j-bryan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

j-bryan left a comment

Choose a reason for hiding this comment

Uh oh!

j-bryan Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

moosebuild commented Sep 16, 2025

Uh oh!

moosebuild commented Sep 23, 2025

Uh oh!

moosebuild commented Oct 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants