This repository contains PyTorch implementations of two deep reinforcement learning algorithms (PPO, VPG) for various environments including CartPole, Pendulum, and Panda robotic tasks.
Success: mild --> still struggles to converge consistenly, but don't we all?
- Python 3.8+
- PyTorch
- Gymnasium
- Panda-gym
- TensorBoard
- Other dependencies as specified in
setup.py
- Clone the repository:
git clone https://github.com/yourusername/lr_challenge.git
cd lr_challenge
- Create and activate a virtual environment:
python -m venv venv
# On Windows
.\venv\Scripts\activate
# On Linux/Mac
source venv/bin/activate
- Install the package in development mode:
pip install -e .
The repository contains various training scripts organized by environment:
python scripts/cartpole/cartpole_ppo_train.py
python scripts/pendulum/pendulum_ppo_train.py
python scripts/panda/pnp_pposb3_train.py # Pick and Place task
python scripts/panda/panda_slide_ppo_train.py # Sliding task
Training progress can be monitored using TensorBoard. The training scripts automatically log metrics to the runs
directory.
To start TensorBoard:
- Activate your virtual environment if not already activated
- Run TensorBoard:
tensorboard --logdir runs
- Open your web browser and navigate to
http://localhost:6006
TensorBoard will display training metrics including:
- Reward statistics
- Policy loss
- Value loss
- Other relevant metrics
lr_challenge/
├── lr_challenge/
│ ├── algorithms/ # RL algorithm implementations
│ ├── learning/ # Core learning utilities
│ └── ...
├── scripts/ # Training scripts
│ ├── cartpole/
│ ├── pendulum/
│ └── panda/
└── runs/ # TensorBoard logs and saved models
MIT License - See LICENSE file for details