Support for Stoch Wt Avg (SWA) closes #321 #320
Draft
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stochastic Weight Averaging (SWA) is (quoting/paraphrasing from their page):
See the PyTorch SWA page for more.
Description
Relatively simple change in
exp_manager.py
. It allows an additional key"swa"
to be included inpolicy_kwargs
, e.g.Motivation and Context
SWA might help improve stability and reduce sensitivity to random seeds in some DRL applications.
Closes #321
Types of changes
Checklist:
make format
(required)make check-codestyle
andmake lint
(required)make pytest
andmake type
both pass. (required)Note: we are using a maximum length of 127 characters per line