Hub
    Docs
Try for Free
xiangyi-li
/
OS-World
mirrored 4 minutes ago
Benchmark CardFiles and versionsLeaderboard
  • Hub
  • Contact
DiscordGitHubXLinkedIn
0
  • 3aaa4e37-dc91-482e-99af-132a612d40f3.json
    1.5 kB
    ​
  • 4188d3a4-077d-46b7-9c86-23e1a036f6c1.json
    2.2 kB
    ​
  • 51b11269-2ca8-4b2a-9163-f21758420e78.json
    2.17 kB
    ​
  • 6054afcb-5bab-4702-90a0-b259b5d3217c.json
    2.4 kB
    ​
  • 7a4e4bc8-922c-4c84-865c-25ba34136be1.json
    2.17 kB
    ​
  • 7efeb4b1-3d19-4762-b163-63328d66303b.json
    2.06 kB
    ​
  • 8b1ce5f2-59d2-4dcc-b0b0-666a714b9a14.json
    2.41 kB
    ​
  • a9f325aa-8c05-4e4f-8341-9e4358565f4f.json
    2.23 kB
    ​
  • abed40dc-063f-4598-8ba5-9fe749c0615d.json
    2.24 kB
    ​
  • eb03d19a-b88d-4de4-8a64-ca0ac66f426b.json
    2.15 kB
    ​
  • ecb0df7a-4e8d-4a03-b162-053391d3afaf.json
    2.78 kB
    ​
  1. /
  2. examples_windows
  3. evaluation_examples
  4. excel
yuanmengqifeat: enhance run_coact.py with logging and configuration options - Added logging configuration to capture runtime logs in both file and console with adjustable log levels. - Introduced new command-line arguments for provider name, region, and client password to improve flexibility and security. - Updated process_task function to accommodate new parameters, ensuring compatibility with existing logic. - Modified prompt templates in coding_agent.py and cua_agent.py to use the client password placeholder for enhanced security. 84f407a
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
2 months ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
2 months ago
feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation - Add detailed README for file cache repository - Implement migration script with retry logic and browser simulation - Support automatic file type detection and deduplication - Ensure reliable hosting for OSWorld evaluation files
2 months ago
feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation - Add detailed README for file cache repository - Implement migration script with retry logic and browser simulation - Support automatic file type detection and deduplication - Ensure reliable hosting for OSWorld evaluation files
2 months ago
feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation - Add detailed README for file cache repository - Implement migration script with retry logic and browser simulation - Support automatic file type detection and deduplication - Ensure reliable hosting for OSWorld evaluation files
2 months ago
feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation - Add detailed README for file cache repository - Implement migration script with retry logic and browser simulation - Support automatic file type detection and deduplication - Ensure reliable hosting for OSWorld evaluation files
2 months ago
feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation - Add detailed README for file cache repository - Implement migration script with retry logic and browser simulation - Support automatic file type detection and deduplication - Ensure reliable hosting for OSWorld evaluation files
2 months ago
feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation - Add detailed README for file cache repository - Implement migration script with retry logic and browser simulation - Support automatic file type detection and deduplication - Ensure reliable hosting for OSWorld evaluation files
2 months ago
feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation - Add detailed README for file cache repository - Implement migration script with retry logic and browser simulation - Support automatic file type detection and deduplication - Ensure reliable hosting for OSWorld evaluation files
2 months ago
feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation - Add detailed README for file cache repository - Implement migration script with retry logic and browser simulation - Support automatic file type detection and deduplication - Ensure reliable hosting for OSWorld evaluation files
2 months ago
feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation - Add detailed README for file cache repository - Implement migration script with retry logic and browser simulation - Support automatic file type detection and deduplication - Ensure reliable hosting for OSWorld evaluation files
2 months ago