Hub
    Docs
Try for Free
xiangyi-li
/
OS-World
mirrored 7 minutes ago
Benchmark CardFiles and versionsLeaderboard
  • Hub
  • Contact
DiscordGitHubXLinkedIn
0
  • README.md
    7.76 kB
    ​
  • __init__.py
    108 B
    ​
  • getters
    -
    ​
  • metrics
    -
    ​
feat: enhance image comparison functionality in gimp.py - Added resizing logic to handle images of different sizes before comparison, ensuring consistent evaluation. - Implemented mode conversion to ensure both images are in the same format for accurate comparison. - Enhanced structure check by MSE to support conversion of numpy arrays to PIL Images, improving compatibility. - Maintained existing logic while improving robustness and accuracy of image comparison methods.
3 months ago
TimothyxxxAdd new section in README for OSWorld-MCP project 8365edc
Add safe browsing feature to Chrome evaluator - Implemented `get_enable_safe_browsing` function to retrieve safe browsing settings based on the operating system. - Updated the `__init__.py` to include the new function. - Modified JSON examples to reflect the change from enabling enhanced safety browsing to enabling safe browsing. - Added necessary commands in the JSON examples for setting up preferences for safe browsing.
a month ago
ver Dec22nd re-organized the evaluator structure to improve the extensibility
2 years ago
Clean code; Add todos in desktop_env README
2 years ago
  1. /
  2. desktop_env
  3. evaluators