Hub
Docs
Try for Free
abderrahmane-br
/
humaneval
mirrored 2 minutes ago
Benchmark Card
Files and versions
Leaderboard
main
like
1
Dockerfile
231 B
README.md
15 B
benchflow_interface.py
3.4 kB
data
-
entrypoint.sh
1.61 kB
evaluate_from_api.py
6.43 kB
requirements.txt
44 B
test_agent.py
6.38 kB
utils
-
refactor: reorganize to BenchHub folder structure
a month ago
refactor: reorganize to BenchHub folder structure
a month ago
refactor: removed local testing code
a month ago
feat: added debugging log messages
a month ago
feat: added debugging log messages
a month ago
feat: added debugging log messages
a month ago
feat: added debugging log messages
a month ago
initial commit
a month ago
fix: updated post_log function
a month ago
Abderrhmn
fix: updated post_log function
48fca39