This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmark environments and conclude a series of research works for goal-conditioned reinforcement learning (GCRL).
| Approach | Resolved Goal Type | Sub-Goal | Relabel | Optimization |
|---|---|---|---|---|
| UVFA | Vector/Image | ✅ | ||
| HER | Vector/Image | ✅ | ||
| Laplacian Representation Learning | Vector/Image | ✅ | ||
| SR | Vector/Image | ✅ | ||
| DDL | Vector/Image | ✅ | ✅ | |
| SoRB | Vector/Image | ✅ | ||
| Skew-Fit | Vector | ✅ | ||
| HER with Demonstrations | Vector | ✅ | ||
| GoalGAN | Vector | ✅ | ||
| Sub-goal Discovery | Vector | ✅ | ||
| HGG | Vector | ✅ | ||
| CHER | Vector | ✅ | ||
| G-HER | Vector | ✅ | ||
| Hindsight Planner | Vector | ✅ | ||
| GDP | Vector | ✅ | ||
| VDS | Vector | ✅ | ||
| MEGA | Vector | ✅ | ✅ | |
| PlanGAN | Vector | ✅ | ||
| AIM | Vector | ✅ | ||
| I-HER | Vector | ✅ | ||
| GCSL | Vector | ✅ | ||
| MapGo | Vector | ✅ | ✅ | |
| L3P | Vector | ✅ | ||
| RIG | Image | ✅ | ||
| DISCERN | Image | ✅ | ||
| HVF | Image | ✅ | ||
| AMIGo | Image | ✅ | ||
| GAP | Image | ✅ | ||
| LEXA | Image | ✅ |
Update: Most of these environments has been integrated in Gymnasium-Robotics.
Note almost all environments support various kinds of goal structures, i.e., vector, image, language, etc.
| Environment | Benchmark Type | Common Used Goal Structure | Description |
|---|---|---|---|
| FetchReach | Gym Robotics/Mujoco | Vector/Image | A robotic arm reaches a target position |
| FetchPush | Gym Robotics/Mujoco | Vector/Image | A robotic arm pushes a block to a target position |
| FetchPickAndPlace | Gym Robotics/Mujoco | Vector/Image | A robotic arm first picks up the block then places it to a target position |
| FetchSlide | Gym Robotics/Mujoco | Vector/Image | A robotic arm slides a buck to a goal position |
| HandManipulateBlock | Gym Robotics/Mujoco | Vector/Image | Orient a block using a robot hand |
| HandManipulateEgg | Gym Robotics/Mujoco | Vector/Image | Orient an egg using a robot hand |
| HandManipulatePen | Gym Robotics/Mujoco | Vector/Image | Orient a pen using a robot hand |
| HalfCheetah | Mujoco | Vector/Image | Make a 2D cheetah robot run and keep a specific speed |
| AntLocomotion/Ant Maze | Mujoco | Vector | Make a 3D four-legged robot walk to a target position |
| Sawyer | Mujoco | Vector | A sawyer robot reaches a target position |
| Reacher | Mujoco | Vector/Image | A reacher robot reaches a target position |
| N-D Mass | Mujoco | Vector | A ball reaches a target position |
| DoorOpening | Mujoco | Image | A sawyer robot opens a door by latching onto the handle |
| Dy-Reaching | Mujoco | Image | A robotic arm reaches a target position moving along a straight line |
| Dy-Circling | Mujoco | Image | A robotic arm reaches a target position moving along a circle |
| Dy-Pushing | Mujoco | Image | A robotic arm moves the box to a moving target position |
| Dy-Pouring | Mujoco | Image | A robotic arm grips the can and pours the water into a cup |
| WaterMaze | DeepMind Lab | Image | Get to a target position described by an image |
| Seaquest | Atari | Image | Get to a target position described by an image |
| Montezuma’s Revenge | Atari | Image | Get to a target position described by an image |
| Reacher | DeepMind Control Suit | Image | A two-link planar reacher poses an aimed shape |
| Manipulator | DeepMind Control Suit | Image | A 3-DoF finger robot rotates the body into a target orientation |
| Finger | DeepMind Control Suit | Image | A planar manipulator brings an object in a target location |
| Place Object in Tray | PyBullet | Image | A robotic arm puts toys into a tray |
| Opening Drawer | PyBullet | Image | A robotic arm opens the drawer |
| PickandPlace | PyBullet | Image | A robotic arm first picks up the block then places it to a target position |
| Five Object Manipulation | CLEVR-Robot/Mujoco | Image/Language | A point mass agent arranges, orders and sorts 5 fixed objects |
| Diverse Object Manipulation | CLEVR-Robot/Mujoco | Image/Language | A point mass agent orders objects in different shapes and colors |
| Playground | Playground | Image/Language | A 2D hand moves and grasps/releases different objects |
| VizDoom | ViZDoom | Image/Language | AI bots reach specific place in the first person shooter 3D game Doom |