RepoTransBench Dataset v1.0
This release contains the complete RepoTransBench dataset (374MB) with 1,897 repository samples across 13 translation pairs.
What's included:
projects.tar.gz: Complete dataset with all repository samples- Automated test suites with high coverage
- Repository structures with dependencies
Usage:
# Download and extract
wget https://github.com/DeepSoftwareAnalytics/RepoTransBench/releases/download/v1.0/projects.tar.gz
tar -xzf projects.tar.gz
# Or use our script
python download_dataset.py