排名 模型 得分
🥇
gpt-5.4-medium (codex-harness) OpenAI · Proprietary
1474±47.6 5 - 32 165 $2.50 / $15 1.1M 1474.164 [1426.5, 1521.8] 590.21 · 不稳 Proprietary
🥈
gpt-5.4-high (codex-harness) OpenAI · Proprietary
1458±48.6 11 - 25 160 $2.50 / $15 1.1M 1458.188 [1409.6, 1506.8] 615.05 · 不稳 Proprietary
🥉
gpt-5.3-codex (codex-harness) OpenAI · Proprietary
1423±31.9 10 - 18 364 $1.75 / $14 400K 1423.016 [1391.1, 1454.9] 264.69 · 不稳 Proprietary
4
55.0
gpt-5.4-mini-high OpenAI · Proprietary
1417±42.5 4 - 40 225 $0.75 / $4.50 400K 1416.586 [1374.1, 1459.1] 470.72 · 不稳 Proprietary
5
72.0
gpt-5.2 OpenAI · Proprietary
1412±17.0 4 - 76 1.5K $1.75 / $14 400K 1411.946 [1395.0, 1428.9] 74.94 · 不稳 Proprietary
6
gpt-5-medium OpenAI · Proprietary
1401±13.2 27 - 46 3.8K $1.25 / $10 400K 1401.078 [1387.9, 1414.3] 45.24 · 不稳 Proprietary
7
gpt-5.1-medium OpenAI · Proprietary
1398±10.0 27 - 46 6.1K $1.25 / $10 400K 1398.090 [1388.1, 1408.1] 26.17 · 不稳 Proprietary
8
51.0
gpt-5.1 OpenAI · Proprietary
1360±9.0 4 - 68 10.0K $1.25 / $10 400K 1359.629 [1350.6, 1368.7] 21.27 · 不稳 Proprietary
9
gpt-5.2-codex OpenAI · Proprietary
1340±12.4 46 - 52 3.1K $1.75 / $14 400K 1340.387 [1328.0, 1352.8] 39.90 · 不稳 Proprietary
10
gpt-5.1-codex OpenAI · Proprietary
1336±10.1 48 - 58 6.2K $1.25 / $10 400K 1336.199 [1326.1, 1346.3] 26.61 · 不稳 Proprietary
11
gpt-5.1-codex-mini OpenAI · Proprietary
1247±17.8 63 - 71 1.4K $0.25 / $2 400K 1246.905 [1229.1, 1264.7] 82.24 · 不稳 Proprietary

没有找到相关模型