排名 模型 得分
🥇
gemini-3-pro Google · Proprietary
1369±36.6 4 - 10 204 $2 / $12 1M 1369.081 [1332.5, 1405.6] 347.87 · 不稳 Proprietary
🥈
gemini-3-flash Google · Proprietary
1354±45.9 8 - 21 127 $0.50 / $3 1M 1354.229 [1308.3, 1400.1] 548.91 · 不稳 Proprietary
🥉
gemini-3-flash (thinking-minimal) Google · Proprietary
1344±50.6 15 - 29 105 $0.50 / $3 1M 1343.877 [1293.3, 1394.5] 666.50 · 不稳 Proprietary
4
grok-4-0709 xAI · Proprietary
1286±29.5 71 - 100 351 $3 / $15 256K 1286.032 [1256.6, 1315.5] 225.87 · 不稳 Proprietary
5
gemini-2.5-pro Google · Proprietary
1282±19.6 29 - 47 808 $1.25 / $10 1M 1281.568 [1261.9, 1301.2] 100.29 · 不稳 Proprietary
6
gpt-5-high OpenAI · Proprietary
1263±25.3 45 - 65 437 $1.25 / $10 400K 1262.804 [1237.5, 1288.1] 166.80 · 不稳 Proprietary
7
chatgpt-4o-latest-20250326 OpenAI · Proprietary
1254±29.4 33 - 53 280 $5 / $15 128K 1253.982 [1224.6, 1283.3] 224.40 · 不稳 Proprietary
8
gpt-5.1-high OpenAI · Proprietary
1252±53.2 21 - 40 90 $1.25 / $10 400K 1252.297 [1199.1, 1305.5] 737.74 · 不稳 Proprietary
9
gemini-2.5-flash Google · Proprietary
1248±23.3 71 - 96 510 $0.30 / $2.50 1M 1247.830 [1224.5, 1271.2] 141.60 · 不稳 Proprietary
10
qwen3-vl-235b-a22b-instruct 阿里巴巴 · Apache 2.0
1245±41.9 61 - 93 138 $0.20 / $0.88 262.1K 1244.679 [1202.7, 1286.6] 457.85 · 不稳 Apache 2.0
11
o3-2025-04-16 OpenAI · Proprietary
1237±24.3 48 - 69 513 $2 / $8 200K 1236.748 [1212.4, 1261.1] 154.30 · 不稳 Proprietary
12
gpt-5-mini-high OpenAI · Proprietary
1223±31.0 101 - 126 290 $0.25 / $2 400K 1223.436 [1192.4, 1254.5] 250.42 · 不稳 Proprietary
13
gpt-5-chat OpenAI · Proprietary
1219±27.5 51 - 80 393 $1.25 / $10 128K 1219.480 [1191.9, 1247.0] 197.54 · 不稳 Proprietary
14
o4-mini-2025-04-16 OpenAI · Proprietary
1217±26.7 102 - 126 408 $1.10 / $4.40 200K 1216.931 [1190.3, 1243.6] 185.21 · 不稳 Proprietary
15
gpt-4.1-2025-04-14 OpenAI · Proprietary
1206±26.8 69 - 93 415 $2 / $8 1M 1205.839 [1179.1, 1232.6] 186.41 · 不稳 Proprietary
16
gemma-3-27b-it Google · Gemma
1199±28.9 131 - 147 331 $0.08 / $0.16 131.1K 1199.213 [1170.3, 1228.1] 217.77 · 不稳 Gemma
17
gpt-5.1 OpenAI · Proprietary
1198±46.2 40 - 58 115 $1.25 / $10 400K 1198.408 [1152.2, 1244.6] 556.22 · 不稳 Proprietary
18
gemini-2.5-flash-lite-preview-06-17-thinking Google · Proprietary
1188±26.5 122 - 137 430 $0.10 / $0.40 1M 1188.000 [1161.5, 1214.5] 182.99 · 不稳 Proprietary
19
gpt-4.1-mini-2025-04-14 OpenAI · Proprietary
1184±27.2 111 - 131 379 $0.40 / $1.60 1M 1183.977 [1156.8, 1211.2] 192.76 · 不稳 Proprietary
20
mistral-small-3.1-24b-instruct-2503 Mistral · Apache 2.0
1156±35.3 207 - 221 256 $0.10 / $0.30 32K 1156.007 [1120.7, 1191.3] 324.66 · 不稳 Apache 2.0
21
mistral-medium-2508 Mistral · Proprietary
1152±27.6 74 - 98 435 $2.70 / $8.10 32K 1152.103 [1124.5, 1179.7] 198.58 · 不稳 Proprietary
22
mistral-small-2506 Mistral · Apache 2.0
1150±37.3 137 - 163 217 $0.10 / $0.30 32K 1150.479 [1113.2, 1187.7] 361.34 · 不稳 Apache 2.0
23
mistral-medium-2505 Mistral · Proprietary
1144±35.9 105 - 127 206 $0.40 / $2 131.1K 1144.110 [1108.2, 1180.1] 336.39 · 不稳 Proprietary

没有找到相关模型