Hub
    Docs
Try for Free
xiangyi-li
/
webarena
mirrored 4 minutes ago
Benchmark CardFiles and versionsLeaderboard
  • Hub
  • Contact
DiscordGitHubXLinkedIn
0
  • .github
    -
    ​
  • .gitignore
    2.21 kB
    ​
  • .pre-commit-config.yaml
    638 B
    ​
  • CITATION.cff
    353 B
    ​
  • LICENSE
    11.4 kB
    ​
  • README.md
    9.72 kB
    ​
  • agent
    -
    ​
  • browser_env
    -
    ​
  • check_errors.sh
    814 B
    ​
  • config_files
    -
    ​
  • environment_docker
    -
    ​
  • evaluation_harness
    -
    ​
  • llms
    -
    ​
  • media
    -
    ​
  • minimal_example.py
    4.62 kB
    ​
  • parallel_run.sh
    2.53 kB
    ​
  • prepare.sh
    120 B
    ​
  • requirements.txt
    156 B
    ​
  • resources
    -
    ​
  • run.py
    14.5 kB
    ​
  • scripts
    -
    ​
  • setup.cfg
    369 B
    ​
  • setup.py
    69 B
    ​
  • tests
    -
    ​
fix type errors
2 years ago
fix type errors
2 years ago
Shuyan ZhouUpdate README.mddaee18d
minor
2 years ago
add openai and transformers lib version
2 years ago
Update README.md
8 months ago
add parallel running script
2 years ago
minor
2 years ago
add v2 execution trajectories
2 years ago
add comment
2 years ago
fix typo
2 years ago
Update helper_functions.py
a year ago
Create CITATION.cff
2 years ago
Update tests configs to fit the current settings
2 years ago
add instruction for self-hosting webarena
2 years ago
update README
2 years ago
add human trajectories
2 years ago
add human trajectories
2 years ago
Update README.md
7 months ago
fix typo in intent
a year ago
Merge remote-tracking branch 'origin/main' into new_eval
2 years ago
release commit
2 years ago
release commit
2 years ago
release commit
2 years ago
release commit
2 years ago