排名 模型 得分
🥇
gpt-5.4-medium (codex-harness) OpenAI · Proprietary
1474±47.6 5 - 32 165 $2.50 / $15 1.1M 1473.719 [1426.2, 1521.3] 588.87 · 不稳 Proprietary
🥈
gpt-5.4-high (codex-harness) OpenAI · Proprietary
1458±48.6 11 - 25 160 $2.50 / $15 1.1M 1457.806 [1409.2, 1506.4] 614.68 · 不稳 Proprietary
🥉
gpt-5.3-codex (codex-harness) OpenAI · Proprietary
1422±31.8 10 - 18 363 $1.75 / $14 400K 1421.628 [1389.8, 1453.5] 263.67 · 不稳 Proprietary
4
24.0
gpt-5.4-mini-high OpenAI · Proprietary
1417±39.5 4 - 40 254 $0.75 / $4.50 400K 1416.850 [1377.3, 1456.4] 406.63 · 不稳 Proprietary
5
44.0
gpt-5.2 OpenAI · Proprietary
1412±17.0 4 - 76 1.5K $1.75 / $14 400K 1412.122 [1395.1, 1429.1] 75.23 · 不稳 Proprietary
6
gpt-5-medium OpenAI · Proprietary
1401±13.2 27 - 46 3.8K $1.25 / $10 400K 1401.226 [1388.0, 1414.5] 45.53 · 不稳 Proprietary
7
gpt-5.1-medium OpenAI · Proprietary
1398±10.1 27 - 46 6.1K $1.25 / $10 400K 1398.294 [1388.2, 1408.4] 26.43 · 不稳 Proprietary
8
46.0
gpt-5.1 OpenAI · Proprietary
1360±9.1 4 - 68 10.0K $1.25 / $10 400K 1359.716 [1350.6, 1368.8] 21.54 · 不稳 Proprietary
9
gpt-5.2-codex OpenAI · Proprietary
1340±12.4 46 - 52 3.1K $1.75 / $14 400K 1340.071 [1327.7, 1352.5] 40.14 · 不稳 Proprietary
10
gpt-5.1-codex OpenAI · Proprietary
1336±10.2 48 - 58 6.2K $1.25 / $10 400K 1336.376 [1326.2, 1346.5] 26.88 · 不稳 Proprietary
11
gpt-5.1-codex-mini OpenAI · Proprietary
1247±17.8 63 - 71 1.4K $0.25 / $2 400K 1247.054 [1229.2, 1264.9] 82.55 · 不稳 Proprietary

没有找到相关模型