RL for Dummies

RL code and experiments for meta-RL algorithms and simple RL algorithms like REINFORCE, in the style of CleanRL and MinimalRL.

There are dozens of high-quality deep RL libraries out there, but most of them are not hackable. To understand one algorithm from start to finish requires traversing dozens of classes across dozens of modules. Great for engineering and extensibility, not so great for getting your hands dirty as fast as possible.

Projects like MinimalRL and CleanRL solve this problem wonderfully, but each have some slight shortcomings. MinimalRL implements each algorithm in a single file for either the CartPole or Pendulum environments. This is fine, but a bit too minimal. I still want to be able to track experiments in Tensorboard. I still want to be able to run the algorithms on different environments. CleanRL includes all the bells and whistles, but only implements the canonical deep RL algorithms. I wanted to see simpler algorithms like A2C as well.

Supported behavior for CleanRL-based environments

Dynamically load environment and network to use with it
...Work on dqn_simple_jax.py first, and then do a sweep of other algorithms and ensure they are all the same.
We put a focus on Jax (not yet).

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.vscode		.vscode
cloud		cloud
envs		envs
experiments		experiments
probe_envs		probe_envs
rl_for_dummies		rl_for_dummies
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
Taskfile.yml		Taskfile.yml
icon.png		icon.png
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RL for Dummies

Supported behavior for CleanRL-based environments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

jugheadjones10/rl-for-dummies

Folders and files

Latest commit

History

Repository files navigation

RL for Dummies

Supported behavior for CleanRL-based environments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages