Hub
Docs
Try for Free
xiangyi-li
/
webarena
mirrored 11 minutes ago
like
0
Benchmark Card
Files and versions
Leaderboard
configs
-
test_evaluators.py
10.3 kB
test_helper_functions.py
946 B
main
/
tests
test_evaluation_harness
update test example due to html escape
2 years ago
remove exact from evalutor names
2 years ago
Shuyan Zhou
Update README.md
daee18d
remove beartype for efficency purpose
2 years ago