排名 模型 得分
🥇
gpt-5.4-medium (codex-harness) OpenAI · Proprietary
1464±43.2 5 - 32 189 $2.50 / $15 1.1M 1463.980 [1420.8, 1507.1] 485.03 · 不稳 Proprietary
🥈
gpt-5.4-high (codex-harness) OpenAI · Proprietary
1457±48.1 11 - 25 162 $2.50 / $15 1.1M 1457.464 [1409.3, 1505.6] 603.17 · 不稳 Proprietary
🥉
gpt-5.3-codex (codex-harness) OpenAI · Proprietary
1424±31.8 10 - 18 363 $1.75 / $14 400K 1423.732 [1391.9, 1455.5] 263.40 · 不稳 Proprietary
4
29
gpt-5.2 OpenAI · Proprietary
1412±16.8 4 - 76 1.5K $1.75 / $14 400K 1411.650 [1394.9, 1428.4] 73.41 · 不稳 Proprietary
5
gpt-5-medium OpenAI · Proprietary
1401±13.0 27 - 46 3.8K $1.25 / $10 400K 1400.636 [1387.7, 1413.6] 43.74 · 不稳 Proprietary
6
gpt-5.1-medium OpenAI · Proprietary
1398±9.8 27 - 46 6.1K $1.25 / $10 400K 1397.853 [1388.1, 1407.6] 24.77 · 不稳 Proprietary
7
28
gpt-5.1 OpenAI · Proprietary
1359±8.7 4 - 68 10.0K $1.25 / $10 400K 1359.395 [1350.7, 1368.1] 19.86 · 不稳 Proprietary
8
gpt-5.2-codex OpenAI · Proprietary
1342±12.0 46 - 52 3.2K $1.75 / $14 400K 1342.276 [1330.2, 1354.3] 37.80 · 不稳 Proprietary
9
gpt-5.1-codex OpenAI · Proprietary
1336±9.8 48 - 58 6.2K $1.25 / $10 400K 1335.977 [1326.1, 1345.8] 25.23 · 不稳 Proprietary
10
gpt-5.1-codex-mini OpenAI · Proprietary
1246±17.6 63 - 71 1.4K $0.25 / $2 400K 1246.151 [1228.5, 1263.8] 80.69 · 不稳 Proprietary

没有找到相关模型