Hub
    Docs
Try for Free
xiangyi-li
/
webarena
mirrored 4 minutes ago
Benchmark CardFiles and versionsLeaderboard
  • Hub
  • Contact
DiscordGitHubXLinkedIn
0
  1. scripts
  • check_error_runs.py
    4.96 kB
    ​
  • collect_obs.py
    1.63 kB
    ​
  • generate_test_data.py
    838 B
    ​
  • html2json.py
    4.87 kB
    ​
  • webarena-zeno.ipynb
    10.9 kB
    ​
fix type errors
2 years ago
minor
2 years ago
minor
2 years ago
add human trajectories
2 years ago
Shuyan ZhouUpdate README.mddaee18d
release commit
2 years ago