You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @Hodge931 , yes that should be sufficient, although it might be best to use the exact commit that we used when we generated those results (maybe @xingyaoww can provide that).
Is there an existing issue for the same bug?
Describe the bug and reproduction steps
Is it sufficient to reproduce the swebench results (53.8% in verified set) by following the readme at https://github.com/All-Hands-AI/OpenHands/tree/main/evaluation/benchmarks/swe_bench?
Thanks so much!
OpenHands Installation
Docker command in README
OpenHands Version
No response
Operating System
None
Logs, Errors, Screenshots, and Additional Context
No response
The text was updated successfully, but these errors were encountered: