CrossDex

Official code for "Cross-Embodiment Dexterous Grasping with Reinforcement Learning" (ICLR 2025)

TODO List

Code for processing eigengrasps and training retargeting networks.
Code for embodiment randomization.
Code for RL and DAgger.

Requirements

Use python 3.8.19. pip install -r requirements.txt.
Install IsaacGym Preview 4 and IsaacGymEnvs.
Install manopth.
Install dex-retargeting: download our modified code here and unzip. cd dex-retargeting & pip install -e .. The code is developed on this.

Eigengrasp

Files results/pca_$N_grab.pkl are eigengrasps with $N PCA eigen vectors from the GRAB dataset. Data format in each .pkl is:

'eigen_vectors': (N,45) numpy array, the eigengrasps corresponding to 45-dim finger axis-angles in Mano
'min_values': (N,) numpy array, min values on each axis
'max_values': (N,) numpy array, max values on each axis
'D_mean': (45,) numpy array, mean of the original data
'D_std': (45,) numpy array, std of the original data

Run results/vis_pca_data.py to control the 9-dim coordinates and visualize the corresponding Mano hand pose.

Hand Retargeting

We use dexpilot to retarget Mano pose to dexterous hand joint angles. cd retargeting and run vis_eigengrasp_to_dexhand.py to visualize Mano-to-any-hand retargeting.

To accelerate batch computation for parallel RL training, we train retargeting neural networks.

Download GRAB dataset, place s1.pkl~s10.pkl files under ../GRAB/hand_dataset/. Run generate_dataset.py to generate paired training data of 45-dim mano pose and X-dim robot pose. Dataset saved in dataset/. Use the option --robot_name to specify the robot hand.
Run train_retartgeting_nn.py to train the retargeting neural network. Use --robot_name to specify the hand. The checkpoint, config, tensorboard log will be saved in models/.
Run vis_nn_retargeting.py to qualitatively check the performance of the learned model.

Policy Learning

Use Robot Randomization

Download meshes here and unzip. Move the folder meshes to robot_randomization/.
In robot_randomization/, run create_random_robots.py to randomize the xyz offsets of the hand-arm mounting joint, generating 20 variants for each robot.

Cross-Embodiment Reinforcement Learning

Download YCB objects here, unzip it, and move the folder ycb_assets to assets/.
cd rl/, follow the scripts in run.sh to train a state-based policy on four types of hands for one object. For example, run

python run_ppo_multidex.py \
num_envs=8192 \
task=MultiDexGrasp \
train.params.max_iterations=40000 \
task.env.observationType="armdof+keypts+objpose+lastact" \
task.env.asset.objectAssetFile="ycb_assets/urdf/077_rubiks_cube.urdf" \
task.env.randomizeRobot=True

Vision-Based Distillation

Generate config file for expert policies: cd rl/. Assuming checkpoint directories for all objects are placed under runs_multidex/, run python generate_expert_yaml.py --path runs_multidex. The generated config file is expert.yaml, its format should be:

004_sugar_box:
  ckpt: model_40000.pt
  path: runs_multidex/004_sugar_box_2024-09-03_11-40-07
005_tomato_soup_can:
  ckpt: model_40000.pt
  path: runs_multidex/005_tomato_soup_can_2024-09-03_11-40-28
......

Move expert.yaml to tasks_crossdex/expert/.

Follow the scripts in run.sh to train on four types of hands for all objects. For example, run

python run_dagger_multidex.py \
num_envs=16384 \
task=MultiDexGrasp \
train=MultiDexGraspDAGGER \
task.env.enablePointCloud=True \
task.env.observationType="armdof+keypts+objpose+lastact" \
task.env.studentObservationType="armdof+keypts+lastact+objpcl" \
task.env.multiTask=True \
task.env.multiTaskLabel="no" \
task.env.asset.objectAssetDir="ycb_assets/urdf" \
expert=expert \
train.params.max_iterations=20000 \
task.env.randomizeRobot=True

Citation

@article{yuan2024cross,
  title={Cross-embodiment dexterous grasping with reinforcement learning},
  author={Yuan, Haoqi and Zhou, Bohan and Fu, Yuhui and Lu, Zongqing},
  journal={arXiv preprint arXiv:2410.02479},
  year={2024}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CrossDex

TODO List

Requirements

Eigengrasp

Hand Retargeting

Policy Learning

Use Robot Randomization

Cross-Embodiment Reinforcement Learning

Vision-Based Distillation

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
GRAB/hand_dataset		GRAB/hand_dataset
assets		assets
mano-models		mano-models
results		results
retargeting		retargeting
rl		rl
robot_randomization		robot_randomization
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

PKU-RL/CrossDex

Folders and files

Latest commit

History

Repository files navigation

CrossDex

TODO List

Requirements

Eigengrasp

Hand Retargeting

Policy Learning

Use Robot Randomization

Cross-Embodiment Reinforcement Learning

Vision-Based Distillation

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages