Hub
Docs
Try for Free
xiangyi-li
/
webarena
mirrored 2 minutes ago
Benchmark Card
Files and versions
Leaderboard
main
/
tests
test_evaluation_harness
configs
-
test_evaluators.py
10.3 kB
test_helper_functions.py
946 B
like
0
update test example due to html escape
2 years ago
remove exact from evalutor names
2 years ago
Shuyan Zhou
Update README.md
daee18d
remove beartype for efficency purpose
2 years ago