Safe-Panda-gym

We develop a modification to the Panda Gym by adding constraints to the environments like Unsafe regions and, constraints on the task. The aim is to develop an environment to test CMDPs (Constraint Markov Decision Process) / Safe-RL algorithms such as CPO, PPO - Lagrangian and algorithms developed by the team.

Safe-Panda-gym is developed with the following key features:

Add safe environments considering the constraints, like PandaReachSafe-v2, PandaPushSafe-v2, PandaSlideSafe-v2, PickAndPlaceSafe-v2 and PandaStackSafe-v2.
Support image-based environments, which can be found in rgb_rendering_safe.py in the test_safe_envs folder.
Support SafePO-Baselines to train the safe environments in our repo, which can be seen in the train_safe_rl_algorithms folder.

Safe-Panda-Gym is a project maintained by Tosin and Shengjie Wang. We encourage modifications and recommendations like new constraints, new environments, bug fixes, and Image-based observation environments intended to be used for Dreamer-v2 like Model-based algorithms.

Safe Multi Task env

We add environments intended to be used to learn multi-task or sub-goal RL, as some tasks build on another, and knowledge used to solve one can be transferable to another.

Documentation

Check out the documentation in Panda Gym.

Installation

Add Safe Rl submodule

git submodule add .git

From source

git clone https://github.com/tohsin/Safe-panda-gym.git
pip install -e .

Usage

import gymnasium as gym
import panda_gym
import time
env = gym.make("PandaReachSafe-v2", render_mode="human")
obs_dim = env.observation_space.shape

obs = env.reset()
done = False

while not done:
    action = env.action_space.sample()
    obs, reward, done, info = env.step(action)
    cost = info['cost']
    env.render(mode='human')
    print(cost)
    time.sleep(2)


env.close()

More testing examples can be found in the test_safe_env folder.

Safe Environments


`PandaReachSafe-v2`	`PandaPushSafe-v2`

`PandaSlideSafe-v2`	`PickAndPlaceSafe-v2`

`PandaStackSafe-v2`

Extra Environments by the Team


`PandaStack3-v2`	`PandaStackPyramid-v2`

`PandaBuildL-v2`

Baselines results

Baselines results are obtained by SafePO.

Citation

If you think the bechmark is useful, please cite as

@misc{SafePandaGym,
  author = {Tosin Oseni, Shengjie Wang},
  title = {Safe Panda Gym},
  year = {2022},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/tohsin/Safe-panda-gym}},
}

@article{gallouedec2021pandagym,
  title        = {{panda-gym: Open-Source Goal-Conditioned Environments for Robotic Learning}},
  author       = {Gallou{\'e}dec, Quentin and Cazin, Nicolas and Dellandr{\'e}a, Emmanuel and Chen, Liming},
  year         = 2021,
  journal      = {4th Robot Learning Workshop: Self-Supervised and Lifelong Learning at NeurIPS},
}

Name		Name	Last commit message	Last commit date
Latest commit History 224 Commits
.github		.github
docs		docs
examples		examples
panda_gym		panda_gym
test		test
test_safe_envs		test_safe_envs
train_safe_rl_algorithms		train_safe_rl_algorithms
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Safe-Panda-gym

Safe Multi Task env

Documentation

Installation

Add Safe Rl submodule

From source

Usage

Safe Environments

Extra Environments by the Team

Baselines results

Citation

About

Releases

Packages

Languages

License

tohsin/Safe-panda-gym

Folders and files

Latest commit

History

Repository files navigation

Safe-Panda-gym

Safe Multi Task env

Documentation

Installation

Add Safe Rl submodule

From source

Usage

Safe Environments

Extra Environments by the Team

Baselines results

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages