Added hyperparameter tuning for RecurrentPPO #415

technocrat13 · 2023-10-23T15:24:09Z

Hyperparameter tuning for RecurrentPPO was non-existent as hyperparams_opt.py did not accept 'ppo_lstm' as a valid argument

Description

Extended sample_ppo_params() to be called by sample_ppo_lstm_params(), trail is then updated with some lstm specific hyperparams, and "policy_kwargs" is updated
Added "tiny" to sample_ppo_params() to support smaller neural nets for the LSTM (solution 2 in issue #409)

Motivation and Context

closes [Bug]: ppo_lstm not implemented in hyperparams_opt.py #409
ReccurentPPO's hyperparameters can not be tuned by passing "ppo_lstm" to -optimize
I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist:

I've read the CONTRIBUTION guide (required)
I have updated the changelog accordingly (required).
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.
I have reformatted the code using make format (required)
I have checked the codestyle using make check-codestyle and make lint (required)
I have ensured make pytest and make type both pass. (required)

Note: we are using a maximum length of 127 characters per line

…3-zoo

araffin

LGTM, thanks =)

Could you please update the changelog?
as you use master branch (protected by default), I couldn't push the changes...

technocrat13 · 2023-10-28T06:16:23Z

updated changelog, let me know if anything else is required, I can add you as a contributor, seems a little late to create a branch (I will keep this in mind next time)

technocrat13 added 8 commits October 18, 2023 17:13

ppo_lstm sampling added

71dbc7a

solution 2, added tiny to ppo

15f3a16

updated tests

86f0f64

added ppo_lstm to test_hyperparms_opt.py

b8de93e

updated formatting in hyperparams_opt.py

261fabd

Merge branch 'master' of https://github.com/technocrat13/rl-baselines…

b73ead8

…3-zoo

Merge branch 'master' into master

2c377db

Merge branch 'master' into master

7ad82f7

araffin self-requested a review October 26, 2023 12:30

araffin requested changes Oct 27, 2023

View reviewed changes

Update CHANGELOG.md

22bdcc8

araffin approved these changes Oct 28, 2023

View reviewed changes

araffin merged commit e98c00e into DLR-RM:master Oct 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added hyperparameter tuning for RecurrentPPO #415

Added hyperparameter tuning for RecurrentPPO #415

technocrat13 commented Oct 23, 2023 •

edited

Loading

araffin left a comment

technocrat13 commented Oct 28, 2023

Added hyperparameter tuning for RecurrentPPO #415

Added hyperparameter tuning for RecurrentPPO #415

Conversation

technocrat13 commented Oct 23, 2023 • edited Loading

Description

Motivation and Context

Types of changes

Checklist:

araffin left a comment

Choose a reason for hiding this comment

technocrat13 commented Oct 28, 2023

technocrat13 commented Oct 23, 2023 •

edited

Loading