Overview

(# ball-and-stick-rl

Overview

This repo uses the SAC algorithm to train a robot to balance on top of a rolling sphere while tracking a target velocity. The robot is essentially an inverted pendulum with three omni-directional wheels in contact with the sphere. The agent must control the three motor torques to keep the pendulum upright and track the target velocity.

Setup

This repo depends on a fork of MuJoCo which contains a small change to support anisotropic friction for the omni-wheels in contact with the sphere. You'll first need to clone and build the fork, including the python bindings, and then edit the absolute path to the mujoco-3.3.5.tar.gz in the pyproject.toml file.

Install dependencies with poetry

poetry install

Training

Launch training with

./train_sac.sh

Testing

To visualize a trained model in MuJoCo run

./test_sac.sh

To Launch Viewer

To launch the MuJoCoviewer and imported the ball-and-stick, run

./viewer.sh

Training Metrics

The training metrics are logged to https://wanb.ai

For SAC they look something like this:

Observations

The PPO algorithm was also tested but it did not work well (as implemented anyway). The SAC algorithm does learn to balance but seems plateau at a sub-optimal performance level and has difficulty learning to track the target velocity.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.vscode		.vscode
ball_and_stick_rl		ball_and_stick_rl
mujoco_models		mujoco_models
scripts		scripts
static		static
.gitignore		.gitignore
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
test_sac.sh		test_sac.sh
train_sac.sh		train_sac.sh
viewer.sh		viewer.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

Setup

Training

Testing

To Launch Viewer

Training Metrics

Observations

About

Uh oh!

Releases

Packages

Languages

david-wb/ball-and-stick-rl

Folders and files

Latest commit

History

Repository files navigation

Overview

Setup

Training

Testing

To Launch Viewer

Training Metrics

Observations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages