排名 模型 得分
🥇
gpt-5.4-medium (codex-harness) OpenAI · Proprietary
1464±43.3 5 - 32 189 $2.50 / $15 1.1M 1464.481 [1421.2, 1507.8] 487.51 · 不稳 Proprietary
🥈
gpt-5.4-high (codex-harness) OpenAI · Proprietary
1459±48.3 11 - 25 162 $2.50 / $15 1.1M 1458.968 [1410.7, 1507.3] 607.69 · 不稳 Proprietary
🥉
gpt-5.3-codex (codex-harness) OpenAI · Proprietary
1425±31.8 10 - 18 363 $1.75 / $14 400K 1425.125 [1393.4, 1456.9] 262.68 · 不稳 Proprietary
4
gpt-5.2 OpenAI · Proprietary
1409±16.4 4 - 76 1.5K $1.75 / $14 400K 1409.411 [1393.0, 1425.8] 69.68 · 不稳 Proprietary
5
gpt-5-medium OpenAI · Proprietary
1400±12.8 27 - 46 3.8K $1.25 / $10 400K 1400.408 [1387.6, 1413.2] 42.87 · 不稳 Proprietary
6
gpt-5.1-medium OpenAI · Proprietary
1396±9.7 27 - 46 6.2K $1.25 / $10 400K 1396.175 [1386.5, 1405.9] 24.62 · 不稳 Proprietary
7
gpt-5.1 OpenAI · Proprietary
1359±8.7 4 - 68 10.2K $1.25 / $10 400K 1359.063 [1350.4, 1367.8] 19.72 · 不稳 Proprietary
8
gpt-5.2-codex OpenAI · Proprietary
1342±12.0 46 - 52 3.2K $1.75 / $14 400K 1342.024 [1330.0, 1354.0] 37.54 · 不稳 Proprietary
9
gpt-5.1-codex OpenAI · Proprietary
1334±9.8 48 - 58 6.3K $1.25 / $10 400K 1334.046 [1324.2, 1343.8] 25.02 · 不稳 Proprietary
10
gpt-5.1-codex-mini OpenAI · Proprietary
1249±17.3 63 - 71 1.5K $0.25 / $2 400K 1248.641 [1231.3, 1266.0] 78.31 · 不稳 Proprietary

没有找到相关模型