排名 模型 得分
🥇
gpt-5.4-medium (codex-harness) OpenAI · Proprietary
1473±47.6 5 - 32 165 $2.50 / $15 1.1M 1473.494 [1425.9, 1521.1] 589.64 · 不稳 Proprietary
🥈
gpt-5.4-high (codex-harness) OpenAI · Proprietary
1458±48.6 11 - 25 160 $2.50 / $15 1.1M 1457.633 [1409.0, 1506.2] 614.80 · 不稳 Proprietary
🥉
gpt-5.3-codex (codex-harness) OpenAI · Proprietary
1423±31.9 10 - 18 364 $1.75 / $14 400K 1422.540 [1390.7, 1454.4] 264.56 · 不稳 Proprietary
4
gpt-5.4-mini-high OpenAI · Proprietary
1419±43.9 4 - 40 207 $0.75 / $4.50 400K 1419.164 [1375.3, 1463.0] 501.25 · 不稳 Proprietary
5
gpt-5.2 OpenAI · Proprietary
1412±17.0 4 - 76 1.5K $1.75 / $14 400K 1411.964 [1395.0, 1428.9] 74.80 · 不稳 Proprietary
6
gpt-5-medium OpenAI · Proprietary
1401±13.2 27 - 46 3.8K $1.25 / $10 400K 1401.140 [1388.0, 1414.3] 45.11 · 不稳 Proprietary
7
gpt-5.1-medium OpenAI · Proprietary
1398±10.0 27 - 46 6.1K $1.25 / $10 400K 1398.119 [1388.1, 1408.1] 26.05 · 不稳 Proprietary
8
gpt-5.1 OpenAI · Proprietary
1360±9.0 4 - 68 10.0K $1.25 / $10 400K 1359.653 [1350.6, 1368.7] 21.15 · 不稳 Proprietary
9
gpt-5.2-codex OpenAI · Proprietary
1340±12.4 46 - 52 3.1K $1.75 / $14 400K 1340.351 [1328.0, 1352.7] 39.79 · 不稳 Proprietary
10
gpt-5.1-codex OpenAI · Proprietary
1336±10.1 48 - 58 6.2K $1.25 / $10 400K 1336.233 [1326.1, 1346.3] 26.50 · 不稳 Proprietary
11
gpt-5.1-codex-mini OpenAI · Proprietary
1247±17.8 63 - 71 1.4K $0.25 / $2 400K 1246.973 [1229.2, 1264.7] 82.10 · 不稳 Proprietary

没有找到相关模型