Code for Policy Continuation with Hindsight Inverse Dynamics (PCHID)
- Bitflip
The bitflip environment was introduced by Hindsight experience replay (HER)
Our implementation of bitflip and HER refers RL-Adventure-2 and Baselines
- GridWorld
Our experiment on the GridWorld domain is based on previous work of Value Iteration Networks (VIN) and the PyTorch implementation of VIN
- FetchReach
The FetchReach environment is one of four OpenAI Fetch envs. As for the other three environments (Push, Slide, PickAndPlace), synchoronous improvement is needed to further improve PCHID's learning efficiency.