排名 模型 得分
🥇
gemini-3.1-pro-preview Google · Proprietary
1387±31.2 1 - 7 514 $2 / $12 1M 1386.698 [1355.5, 1417.9] 252.91 · 不稳 Proprietary
🥈
gemini-3-pro Google · Proprietary
1376±22.8 4 - 10 1.4K $2 / $12 1M 1375.769 [1352.9, 1398.6] 135.91 · 不稳 Proprietary
🥉
dola-seed-2.0-preview · Proprietary
1365±47.0 181 1364.516 [1317.5, 1411.6] 575.95 · 不稳 Proprietary
4
kimi-k2.5-instant 月之暗面 · Modified MIT
1349±46.4 44 - 74 167 $0.44 / $2 262.1K 1349.473 [1303.0, 1395.9] 561.20 · 不稳 Modified MIT
5
qwen3.5-397b-a17b 阿里巴巴 · Apache 2.0
1342±41.6 28 - 52 237 $0.39 / $2.34 262.1K 1341.839 [1300.2, 1383.5] 451.23 · 不稳 Apache 2.0
6
kimi-k2.5-thinking 月之暗面 · Modified MIT
1341±33.6 25 - 47 386 $0.60 / $3 Unknown 1341.129 [1307.5, 1374.7] 293.67 · 不稳 Modified MIT
7
gemini-3-flash Google · Proprietary
1332±27.3 8 - 21 745 $0.50 / $3 1M 1332.255 [1305.0, 1359.6] 194.06 · 不稳 Proprietary
8
qwen3.5-122b-a10b 阿里巴巴 · Apache 2.0
1323±45.1 60 - 88 193 $0.26 / $2.08 262.1K 1322.527 [1277.5, 1367.6] 528.48 · 不稳 Apache 2.0
9
qwen3.5-27b 阿里巴巴 · Apache 2.0
1316±45.8 77 - 108 184 $0.20 / $1.56 262.1K 1315.955 [1270.2, 1361.7] 545.46 · 不稳 Apache 2.0
10
gpt-5.2-high OpenAI · Proprietary
1309±33.3 37 - 57 404 $1.75 / $14 400K 1309.410 [1276.1, 1342.8] 289.46 · 不稳 Proprietary
11
grok-4.20-multi-agent-beta-0309 xAI · Proprietary
1306±53.0 8 - 21 129 $2 / $6 2M 1305.628 [1252.6, 1358.6] 730.96 · 不稳 Proprietary
12
gemini-3-flash (thinking-minimal) Google · Proprietary
1303±28.7 15 - 29 617 $0.50 / $3 1M 1303.000 [1274.3, 1331.7] 214.02 · 不稳 Proprietary
13
gemini-2.5-flash-preview-09-2025 Google · Proprietary
1291±31.3 81 - 107 399 $0.30 / $2.50 1M 1291.282 [1259.9, 1322.6] 255.84 · 不稳 Proprietary
14
grok-4.20-beta-0309-reasoning xAI · Proprietary
1289±47.1 7 - 15 172 $2 / $6 2M 1289.101 [1242.0, 1336.2] 576.96 · 不稳 Proprietary
15
gemini-2.5-pro Google · Proprietary
1289±20.3 29 - 47 2.6K $1.25 / $10 1M 1288.756 [1268.4, 1309.1] 107.44 · 不稳 Proprietary
16
qwen3-vl-235b-a22b-instruct 阿里巴巴 · Apache 2.0
1289±26.2 61 - 93 733 $0.20 / $0.88 262.1K 1288.586 [1262.4, 1314.7] 178.04 · 不稳 Apache 2.0
17
gemini-2.5-flash Google · Proprietary
1284±20.8 71 - 96 2.0K $0.30 / $2.50 1M 1284.353 [1263.6, 1305.1] 112.31 · 不稳 Proprietary
18
gpt-5.2-chat-latest-20260210 OpenAI · Proprietary
1282±40.8 8 - 20 240 $1.75 / $14 128K 1282.359 [1241.6, 1323.1] 432.96 · 不稳 Proprietary
19
gemini-3.1-flash-lite-preview Google · Proprietary
1281±33.7 37 - 59 430 $0.25 / $1.50 1M 1280.658 [1247.0, 1314.3] 294.77 · 不稳 Proprietary
20
gpt-5.1-high OpenAI · Proprietary
1280±31.0 21 - 40 465 $1.25 / $10 400K 1279.828 [1248.8, 1310.8] 249.86 · 不稳 Proprietary
21
gpt-5.2 OpenAI · Proprietary
1278±32.2 40 - 59 428 $1.75 / $14 400K 1278.059 [1245.9, 1310.3] 269.87 · 不稳 Proprietary
22
ernie-5.0-preview-1220 百度 · Proprietary
1269±46.6 178 1268.904 [1222.3, 1315.5] 564.92 · 不稳 Proprietary
23
gpt-5.4-mini-high OpenAI · Proprietary
1266±59.2 17 - 39 114 $2.50 / $15 1.1M 1265.821 [1206.7, 1325.0] 910.93 · 不稳 Proprietary
24
gpt-5-chat OpenAI · Proprietary
1256±22.9 51 - 80 1.3K $1.25 / $10 128K 1256.155 [1233.3, 1279.0] 136.20 · 不稳 Proprietary
25
gpt-5.1 OpenAI · Proprietary
1255±28.3 40 - 58 569 $1.25 / $10 400K 1254.969 [1226.7, 1283.2] 207.97 · 不稳 Proprietary
26
grok-4-0709 xAI · Proprietary
1254±23.4 71 - 100 1.2K $3 / $15 256K 1254.206 [1230.8, 1277.6] 142.13 · 不稳 Proprietary
27
chatgpt-4o-latest-20250326 OpenAI · Proprietary
1248±23.1 33 - 53 1.1K $5 / $15 128K 1247.678 [1224.6, 1270.7] 138.55 · 不稳 Proprietary
28
gpt-5-high OpenAI · Proprietary
1236±22.6 45 - 65 1.4K $1.25 / $10 400K 1235.861 [1213.3, 1258.5] 133.05 · 不稳 Proprietary
29
grok-4-1-fast-reasoning xAI · Proprietary
1233±42.8 47 - 67 230 $0.20 / $0.50 2M 1232.664 [1189.8, 1275.5] 477.27 · 不稳 Proprietary
30
o3-2025-04-16 OpenAI · Proprietary
1231±21.7 48 - 69 1.6K $2 / $8 200K 1230.810 [1209.1, 1252.5] 122.49 · 不稳 Proprietary
31
gpt-5.4-nano-high OpenAI · Proprietary
1230±54.7 77 - 109 126 $2.50 / $15 1.1M 1229.682 [1175.0, 1284.4] 778.62 · 不稳 Proprietary
32
gemini-2.5-flash-lite-preview-09-2025-no-thinking Google · Proprietary
1229±29.9 116 - 132 415 $0.10 / $0.40 1M 1229.464 [1199.6, 1259.4] 232.73 · 不稳 Proprietary
33
o1-2024-12-17 OpenAI · Proprietary
1227±57.9 88 - 110 99 $15 / $60 200K 1227.074 [1169.2, 1285.0] 873.00 · 不稳 Proprietary
34
gpt-4.1-2025-04-14 OpenAI · Proprietary
1226±21.5 69 - 93 1.4K $2 / $8 1M 1225.701 [1204.2, 1247.2] 120.33 · 不稳 Proprietary
35
gpt-5-mini-high OpenAI · Proprietary
1215±24.5 101 - 126 1.0K $0.25 / $2 400K 1214.881 [1190.4, 1239.4] 156.42 · 不稳 Proprietary
36
o4-mini-2025-04-16 OpenAI · Proprietary
1204±22.7 102 - 126 1.3K $1.10 / $4.40 200K 1204.209 [1181.5, 1227.0] 134.65 · 不稳 Proprietary
37
gemini-2.5-flash-lite-preview-06-17-thinking Google · Proprietary
1188±24.6 122 - 137 1.0K $0.10 / $0.40 1M 1188.000 [1163.4, 1212.6] 157.91 · 不稳 Proprietary
38
gpt-4.1-mini-2025-04-14 OpenAI · Proprietary
1186±22.8 111 - 131 1.2K $0.40 / $1.60 1M 1185.872 [1163.1, 1208.7] 135.09 · 不稳 Proprietary
39
gemini-1.5-pro-002 Google · Proprietary
1179±40.1 143 - 169 395 $3.50 / $10.50 2.1M 1178.622 [1138.5, 1218.7] 418.74 · 不稳 Proprietary
40
mistral-medium-2508 Mistral · Proprietary
1177±22.5 74 - 98 1.4K $2.70 / $8.10 32K 1177.164 [1154.6, 1199.7] 132.23 · 不稳 Proprietary
41
gpt-4.5-preview-2025-02-27 OpenAI · Proprietary
1167±63.1 29 - 55 77 $75 / $150 128K 1166.664 [1103.6, 1229.7] 1035.89 · 不稳 Proprietary
42
gemma-3-27b-it Google · Gemma
1154±26.5 131 - 147 749 $0.08 / $0.16 131.1K 1153.570 [1127.1, 1180.0] 182.19 · 不稳 Gemma
43
claude-3-7-sonnet-20250219 Anthropic · Proprietary
1146±48.9 127 - 142 154 $3 / $15 200K 1145.547 [1096.6, 1194.5] 623.63 · 不稳 Proprietary
44
mistral-small-2506 Mistral · Apache 2.0
1140±31.0 137 - 163 487 $0.10 / $0.30 32K 1139.813 [1108.8, 1170.8] 250.35 · 不稳 Apache 2.0
45
mistral-medium-2505 Mistral · Proprietary
1139±30.3 105 - 127 485 $0.40 / $2 131.1K 1138.516 [1108.2, 1168.8] 239.27 · 不稳 Proprietary
46
gemini-2.0-flash-001 Google · Proprietary
1137±32.9 135 - 156 342 $0.10 / $0.40 1M 1136.647 [1103.7, 1169.6] 282.30 · 不稳 Proprietary
47
mistral-small-3.1-24b-instruct-2503 Mistral · Apache 2.0
1137±27.5 207 - 221 731 $0.10 / $0.30 32K 1136.582 [1109.1, 1164.1] 197.00 · 不稳 Apache 2.0
48
gemini-1.5-flash-002 Google · Proprietary
1129±40.9 196 - 220 370 $0.07 / $0.30 1M 1129.057 [1088.1, 1170.0] 435.63 · 不稳 Proprietary
49
gpt-4o-2024-05-13 OpenAI · Proprietary
1124±34.1 150 - 173 1.5K $5 / $15 128K 1123.981 [1089.9, 1158.1] 302.65 · 不稳 Proprietary
50
llama-4-maverick-17b-128e-instruct Meta · Llama 4
1121±39.3 173 - 200 234 $0.63 / $1.80 131.1K 1121.243 [1081.9, 1160.5] 402.10 · 不稳 Llama 4
51
claude-3-5-sonnet-20241022 Anthropic · Proprietary
1112±34.8 127 - 138 514 $3 / $15 200K 1111.780 [1077.0, 1146.6] 315.04 · 不稳 Proprietary
52
qwen2.5-vl-72b-instruct 阿里巴巴 · Qwen
1098±54.0 119 1098.311 [1044.3, 1152.3] 759.72 · 不稳 Qwen
53
claude-3-5-sonnet-20240620 Anthropic · Proprietary
1097±35.1 152 - 179 1.6K $3 / $15 200K 1096.990 [1061.8, 1132.1] 321.57 · 不稳 Proprietary
54
pixtral-large-2411 Mistral · MRL
1075±48.9 156 1074.658 [1025.8, 1123.6] 622.51 · 不稳 MRL
55
llama-4-scout-17b-16e-instruct Meta · Llama
1070±40.3 179 - 206 238 $0.40 / $0.70 8.2K 1070.500 [1030.2, 1110.8] 423.09 · 不稳 Llama
56
qwen2-vl-72b 阿里巴巴 · Qwen
1064±42.1 291 1063.652 [1021.5, 1105.8] 462.30 · 不稳 Qwen
57
internvl2-26b · MIT
1057±45.4 287 1056.651 [1011.3, 1102.0] 535.87 · 不稳 MIT
58
gpt-4-turbo-2024-04-09 OpenAI · Proprietary
1037±36.9 179 - 205 937 $10 / $30 128K 1036.757 [999.9, 1073.6] 354.22 · 不稳 Proprietary
59
gpt-4o-mini-2024-07-18 OpenAI · Proprietary
1025±34.1 184 - 209 952 $0.15 / $0.60 128K 1024.531 [990.4, 1058.6] 302.38 · 不稳 Proprietary
60
gemini-1.5-pro-001 Google · Proprietary
1022±36.4 179 - 205 1.2K $3.50 / $10.50 2.1M 1022.091 [985.7, 1058.5] 344.77 · 不稳 Proprietary
61
gemini-2.0-flash-lite-preview-02-05 Google · Proprietary
1017±60.9 141 - 168 98 $0.07 / $0.30 1M 1016.900 [956.0, 1077.8] 965.69 · 不稳 Proprietary
62
gpt-4o-2024-08-06 OpenAI · Proprietary
1002±49.5 157 - 187 186 $2.50 / $10 128K 1001.503 [952.0, 1051.0] 637.36 · 不稳 Proprietary
63
qwen2-vl-7b-instruct · Apache 2.0
991±42.8 298 991.265 [948.5, 1034.0] 476.13 · 不稳 Apache 2.0
64
claude-3-opus-20240229 Anthropic · Proprietary
986±36.7 181 - 206 1.0K $15 / $75 200K 985.884 [949.2, 1022.5] 349.68 · 不稳 Proprietary
65
gemini-1.5-flash-8b-001 Google · Proprietary
966±41.4 242 - 253 332 $0.07 / $0.30 1M 965.600 [924.2, 1007.0] 445.49 · 不稳 Proprietary
66
gemini-1.5-flash-001 Google · Proprietary
952±37.1 221 - 236 950 $0.07 / $0.30 1M 951.505 [914.4, 988.6] 359.16 · 不稳 Proprietary
67
llama-3.2-vision-90b-instruct Meta · Llama 3.2
944±39.8 393 944.275 [904.4, 984.1] 413.30 · 不稳 Llama 3.2
68
pixtral-12b-2409 Mistral · Apache 2.0
943±41.2 335 942.577 [901.4, 983.8] 442.22 · 不稳 Apache 2.0
69
internvl2-4b · MIT
942±52.5 177 942.144 [889.6, 994.6] 717.39 · 不稳 MIT
70
claude-3-sonnet-20240229 Anthropic · Proprietary
934±37.2 225 - 242 915 $3 / $15 200K 934.083 [896.8, 971.3] 361.03 · 不稳 Proprietary
71
molmo-72b-0924 · Apache 2.0
928±52.4 182 928.460 [876.0, 980.9] 715.93 · 不稳 Apache 2.0
72
claude-3-haiku-20240307 Anthropic · Proprietary
907±37.2 242 - 252 1.0K $0.25 / $1.25 200K 907.357 [870.1, 944.6] 360.75 · 不稳 Proprietary
73
llama-3.2-vision-11b-instruct Meta · Llama 3.2
907±45.8 266 906.784 [861.0, 952.6] 546.69 · 不稳 Llama 3.2
74
llava-v1.6-34b · Apache 2.0
907±43.3 415 906.568 [863.3, 949.8] 487.45 · 不稳 Apache 2.0
75
molmo-7b-d-0924 · Apache 2.0
882±51.4 171 882.151 [830.8, 933.5] 686.81 · 不稳 Apache 2.0

没有找到相关模型