Skip to content

RepoTransBench Dataset v1.0

Latest

Choose a tag to compare

@wangyanli3630 wangyanli3630 released this 29 Jul 19:14
· 5 commits to main since this release
7c096b0

RepoTransBench Dataset v1.0

This release contains the complete RepoTransBench dataset (374MB) with 1,897 repository samples across 13 translation pairs.

What's included:

  • projects.tar.gz: Complete dataset with all repository samples
  • Automated test suites with high coverage
  • Repository structures with dependencies

Usage:

# Download and extract
wget https://github.com/DeepSoftwareAnalytics/RepoTransBench/releases/download/v1.0/projects.tar.gz
tar -xzf projects.tar.gz

# Or use our script
python download_dataset.py