排名 模型 得分
🥇
gpt-5.4-medium (codex-harness) OpenAI · Proprietary
1455±45.0 5 - 32 175 $2.50 / $15 1.1M 1454.853 [1409.8, 1499.9] 528.15 · 不稳 Proprietary
🥈
gpt-5.4-high (codex-harness) OpenAI · Proprietary
1453±51.0 11 - 25 140 $2.50 / $15 1.1M 1452.611 [1401.6, 1503.6] 676.08 · 不稳 Proprietary
🥉
gpt-5.3-codex (codex-harness) OpenAI · Proprietary
1416±32.5 10 - 18 345 $1.75 / $14 400K 1415.559 [1383.0, 1448.1] 275.71 · 不稳 Proprietary
4
20
gpt-5.2 OpenAI · Proprietary
1402±15.8 4 - 76 1.6K $1.75 / $14 400K 1401.779 [1386.0, 1417.6] 65.00 · 不稳 Proprietary
5
gpt-5-medium OpenAI · Proprietary
1399±12.7 27 - 46 3.9K $1.25 / $10 400K 1399.421 [1386.7, 1412.1] 42.05 · 不稳 Proprietary
6
gpt-5.1-medium OpenAI · Proprietary
1394±9.6 27 - 46 6.4K $1.25 / $10 400K 1394.241 [1384.6, 1403.9] 24.21 · 不稳 Proprietary
7
19
gpt-5.1 OpenAI · Proprietary
1359±8.7 4 - 68 10.5K $1.25 / $10 400K 1359.231 [1350.6, 1367.9] 19.53 · 不稳 Proprietary
8
gpt-5.2-codex OpenAI · Proprietary
1340±12.1 46 - 52 3.2K $1.75 / $14 400K 1340.451 [1328.3, 1352.6] 38.19 · 不稳 Proprietary
9
gpt-5.1-codex OpenAI · Proprietary
1334±9.7 48 - 58 6.5K $1.25 / $10 400K 1334.091 [1324.4, 1343.8] 24.42 · 不稳 Proprietary
10
gpt-5.1-codex-mini OpenAI · Proprietary
1250±17.1 63 - 71 1.5K $0.25 / $2 400K 1250.069 [1233.0, 1267.2] 76.19 · 不稳 Proprietary

没有找到相关模型