Hub
    Docs
Try for Free
xiangyi-li
/
OS-World
mirrored 7 minutes ago
Benchmark CardFiles and versionsLeaderboard
  • Hub
  • Contact
DiscordGitHubXLinkedIn
0
  1. /
  2. examples
  3. evaluation_examples
  4. multi_apps
  • 00fa164e-2612-4439-992e-157d019a8436.json
    2.61 kB
    ​
  • 02ce9a50-7af2-47ed-8596-af0c230501f8.json
    1.73 kB
    ​
  • 09a37c51-e625-49f4-a514-20a773797a8a.json
    1.8 kB
    ​
  • 0c825995-5b70-4526-b663-113f4c999dd2.json
    4.22 kB
    ​
  • 0e5303d4-8820-42f6-b18d-daf7e633de21.json
    2.43 kB
    ​
  • 185f29bd-5da0-40a6-b69c-ba7f4e0324ef.json
    4.33 kB
    ​
  • 1f18aa87-af6f-41ef-9853-cdb8f32ebdea.json
    4.79 kB
    ​
  • 20236825-b5df-46e7-89bf-62e1d640a897.json
    1.95 kB
    ​
  • 227d2f97-562b-4ccb-ae47-a5ec9e142fbb.json
    1.93 kB
    ​
  • 22a4636f-8179-4357-8e87-d1743ece1f81.json
    2.79 kB
    ​
  • 236833a3-5704-47fc-888c-4f298f09f799.json
    2.14 kB
    ​
  • 2373b66a-092d-44cb-bfd7-82e86e7a3b4d.json
    1.53 kB
    ​
  • 26150609-0da3-4a7d-8868-0faf9c5f01bb.json
    2.85 kB
    ​
  • 26660ad1-6ebb-4f59-8cba-a8432dfe8d38.json
    1.66 kB
    ​
  • 2b9493d7-49b8-493a-a71b-56cd1f4d6908.json
    2.5 kB
    ​
  • 2c1ebcd7-9c6d-4c9a-afad-900e381ecd5e.json
    6.29 kB
    ​
  • 2c9fc0de-3ee7-45e1-a5df-c86206ad78b5.json
    2.59 kB
    ​
  • 2fe4b718-3bd7-46ec-bdce-b184f5653624.json
    1.44 kB
    ​
  • 337d318b-aa07-4f4f-b763-89d9a2dd013f.json
    2.1 kB
    ​
  • 36037439-2044-4b50-b9d1-875b5a332143.json
    1.59 kB
    ​
  • 3680a5ee-6870-426a-a997-eba929a0d25c.json
    2.3 kB
    ​
  • 3a93cae4-ad3e-403e-8c12-65303b271818.json
    6.03 kB
    ​
  • 3c8f201a-009d-4bbe-8b65-a6f8b35bb57f.json
    892 B
    ​
  • 3e3fc409-bff3-4905-bf16-c968eee3f807.json
    2.24 kB
    ​
  • 3f05f3b9-29ba-4b6b-95aa-2204697ffc06.json
    8.88 kB
    ​
  • 415ef462-bed3-493a-ac36-ca8c6d23bf1b.json
    5.99 kB
    ​
  • 42d25c08-fb87-4927-8b65-93631280a26f.json
    5.68 kB
    ​
  • 42f4d1c7-4521-4161-b646-0a8934e36081.json
    1.57 kB
    ​
  • 46407397-a7d5-4c6b-92c6-dbe038b1457b.json
    3.98 kB
    ​
  • 47f7c0ce-a5fb-4100-a5e6-65cd0e7429e5.json
    2.59 kB
    ​
  • 48c46dc7-fe04-4505-ade7-723cba1aa6f6.json
    5.08 kB
    ​
  • 48d05431-6cd5-4e76-82eb-12b60d823f7d.json
    1.41 kB
    ​
  • 4c26e3f3-3a14-4d86-b44a-d3cedebbb487.json
    1.49 kB
    ​
  • 4e9f0faf-2ecc-4ae8-a804-28c9a75d1ddc.json
    2.41 kB
    ​
  • 510f64c8-9bcc-4be1-8d30-638705850618.json
    3.4 kB
    ​
  • 51f5801c-18b3-4f25-b0c3-02f85507a078.json
    1.76 kB
    ​
  • 58565672-7bfe-48ab-b828-db349231de6b.json
    2.12 kB
    ​
  • 5990457f-2adb-467b-a4af-5c857c92d762.json
    2.69 kB
    ​
  • 5bc63fb9-276a-4439-a7c1-9dc76401737f.json
    2.3 kB
    ​
  • 5df7b33a-9f77-4101-823e-02f863e1c1ae.json
    2.12 kB
    ​
  • 67890eb6-6ce5-4c00-9e3d-fb4972699b06.json
    2.81 kB
    ​
  • 68a25bd4-59c7-4f4d-975e-da0c8509c848.json
    2.42 kB
    ​
  • 69acbb55-d945-4927-a87b-8480e1a5bb7e.json
    1.38 kB
    ​
  • 6d72aad6-187a-4392-a4c4-ed87269c51cf.json
    588 B
    ​
  • 6f4073b8-d8ea-4ade-8a18-c5d1d5d5aa9a.json
    2.78 kB
    ​
  • 716a6079-22da-47f1-ba73-c9d58f986a38.json
    1.52 kB
    ​
  • 74d5859f-ed66-4d3e-aa0e-93d7a592ce41.json
    4.66 kB
    ​
  • 778efd0a-153f-4842-9214-f05fc176b877.json
    2.57 kB
    ​
  • 788b3701-3ec9-4b67-b679-418bfa726c22.json
    4.53 kB
    ​
  • 78aed49a-a710-4321-a793-b611a7c5b56b.json
    4.61 kB
    ​
  • 7e287123-70ca-47b9-8521-47db09b69b14.json
    7.42 kB
    ​
  • 7f35355e-02a6-45b5-b140-f0be698bcf85.json
    1.27 kB
    ​
  • 7ff48d5b-2df2-49da-b500-a5150ffc7f18.json
    3.98 kB
    ​
  • 81c425f5-78f3-4771-afd6-3d2973825947.json
    2.04 kB
    ​
  • 82e3c869-49f6-4305-a7ce-f3e64a0618e7.json
    4.16 kB
    ​
  • 869de13e-bef9-4b91-ba51-f6708c40b096.json
    6.78 kB
    ​
  • 873cafdd-a581-47f6-8b33-b9696ddb7b05.json
    1.82 kB
    ​
  • 881deb30-9549-4583-a841-8270c65f2a17.json
    7.93 kB
    ​
  • 897e3b53-5d4d-444b-85cb-2cdc8a97d903.json
    2.79 kB
    ​
  • 8df7e444-8e06-4f93-8a1a-c5c974269d82.json
    1.98 kB
    ​
  • 8e116af7-7db7-4e35-a68b-b0939c066c78.json
    6.45 kB
    ​
  • 91190194-f406-4cd6-b3f9-c43fac942b22.json
    1.34 kB
    ​
  • 9219480b-3aed-47fc-8bac-d2cffc5849f7.json
    2.63 kB
    ​
  • 937087b6-f668-4ba6-9110-60682ee33441.json
    634 B
    ​
  • 98e8e339-5f91-4ed2-b2b2-12647cb134f4.json
    1.63 kB
    ​
  • 9f3bb592-209d-43bc-bb47-d77d9df56504.json
    2.27 kB
    ​
  • a0b9dc9c-fc07-4a88-8c5d-5e3ecad91bcb.json
    3.57 kB
    ​
  • a503b07f-9119-456b-b75d-f5146737d24f.json
    1.21 kB
    ​
  • a74b607e-6bb5-4ea8-8a7c-5d97c7bbcd2a.json
    1.8 kB
    ​
  • a82b78bb-7fde-4cb3-94a4-035baf10bcf0.json
    2.7 kB
    ​
  • aad10cd7-9337-4b62-b704-a857848cedf2.json
    1.92 kB
    ​
  • acb0f96b-e27c-44d8-b55f-7cb76609dfcd.json
    1.32 kB
    ​
  • aceb0368-56b8-4073-b70e-3dc9aee184e0.json
    2.18 kB
    ​
  • b337d106-053f-4d37-8da0-7f9c4043a66b.json
    2.12 kB
    ​
  • b5062e3e-641c-4e3a-907b-ac864d2e7652.json
    3.29 kB
    ​
  • b52b40a5-ad70-4c53-b5b0-5650a8387052.json
    2.96 kB
    ​
  • bb83cab4-e5c7-42c7-a67b-e46068032b86.json
    2 kB
    ​
  • bc2b57f3-686d-4ec9-87ce-edf850b7e442.json
    3.4 kB
    ​
  • c2751594-0cd5-4088-be1b-b5f2f9ec97c4.json
    1.6 kB
    ​
  • c7c1e4c3-9e92-4eba-a4b8-689953975ea4.json
    2.51 kB
    ​
  • c867c42d-a52d-4a24-8ae3-f75d256b5618.json
    2.48 kB
    ​
  • ce2b64a2-ddc1-4f91-8c7d-a88be7121aac.json
    2.65 kB
    ​
  • d1acdb87-bb67-4f30-84aa-990e56a09c92.json
    4.02 kB
    ​
  • d68204bf-11c1-4b13-b48b-d303c73d4bf6.json
    1.46 kB
    ​
  • d9b7c649-c975-4f53-88f5-940b29c47247.json
    1.99 kB
    ​
  • da52d699-e8d2-4dc5-9191-a2199e0b6a9b.json
    2.47 kB
    ​
  • da922383-bfa4-4cd3-bbad-6bebab3d7742.json
    1.91 kB
    ​
  • dd60633f-2c72-42ba-8547-6f2c8cb0fdb0.json
    1.47 kB
    ​
  • deec51c9-3b1e-4b9e-993c-4776f20e8bb2.json
    2.66 kB
    ​
  • df67aebb-fb3a-44fd-b75b-51b6012df509.json
    2.44 kB
    ​
  • e135df7c-7687-4ac0-a5f0-76b74438b53e.json
    2.71 kB
    ​
  • e1fc0df3-c8b9-4ee7-864c-d0b590d3aa56.json
    1.6 kB
    ​
  • e2392362-125e-4f76-a2ee-524b183a3412.json
    3.14 kB
    ​
  • e8172110-ec08-421b-a6f5-842e6451911f.json
    2.08 kB
    ​
  • eb303e01-261e-4972-8c07-c9b4e7a4922a.json
    2.59 kB
    ​
  • ee9a3c83-f437-4879-8918-be5efbb9fac7.json
    2.7 kB
    ​
  • f5c13cdd-205c-4719-a562-348ae5cd1d91.json
    3.48 kB
    ​
  • f7dfbef3-7697-431c-883a-db8583a4e4f9.json
    3.04 kB
    ​
  • f8369178-fafe-40c2-adc4-b9b08a125456.json
    941 B
    ​
  • f8cfa149-d1c1-4215-8dac-4a0932bad3c2.json
    1.75 kB
    ​
  • f918266a-b3e0-4914-865d-4faa564f1aef.json
    1.45 kB
    ​
Kaixin LiGet VM IP again when getting screenshot fails (#215) In rare cases, the IP of the VM changes after it launches. We can get the IP every time we retry to ensure the correct connection.347238e
add proxy
9 days ago
fix: correct URL encoding in JSON examples for invoice paths
7 days ago
fix: correct URL encoding in JSON examples for invoice paths
7 days ago
edit prompt
9 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
10 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago
refactor: update URLs in multiple JSON files to ensure proper encoding of special characters
8 days ago