Hub
Docs
Try for Free
xiangyi-li
/
webarena
mirrored 20 minutes ago
Benchmark Card
Files and versions
Leaderboard
like
1
main
scripts
check_error_runs.py
4.96 kB
collect_obs.py
1.58 kB
generate_test_data.py
838 B
html2json.py
4.87 kB
webarena-zeno.ipynb
10.9 kB
fix type errors
2 years ago
minor
2 years ago
minor
2 years ago
Apply pre-commit formatting fixes
3 months ago
Shuyan Zhou
Merge pull request #183 from alzambranolu13/patch-2 Update README.md
dce0468
release commit
2 years ago
webarena/xiangyi-li · BenchFlow