排名 模型 得分
🥇
175.0
olmo-3.1-32b-instruct Allen AI · Apache 2.0
1348±10.9 72 - 184 3.0K $0.20 / $0.60 65.5K 1348.373 [1337.5, 1359.3] 31.02 · 不稳 Apache 2.0
🥈
209.0
olmo-3-32b-think Allen AI · Apache 2.0
1334±15.4 118 - 220 1.5K $0.15 / $0.50 65.5K 1334.065 [1318.6, 1349.5] 61.98 · 不稳 Apache 2.0
🥉
226.0
olmo-3.1-32b-think Allen AI · Apache 2.0
1322±13.2 111 - 210 2.1K $0.15 / $0.50 65.5K 1321.532 [1308.4, 1334.7] 45.13 · 不稳 Apache 2.0
4
178.0
molmo-2-8b Allen AI · Apache 2.0
1298±41.9 143 - 242 217 $0.20 / $0.20 36.9K 1298.220 [1256.3, 1340.1] 456.40 · 不稳 Apache 2.0
5
247.0
olmo-2-0325-32b-instruct Allen AI · Apache-2.0
1244±24.1 214 - 266 538 $0.05 / $0.20 128K 1244.130 [1220.1, 1268.2] 150.60 · 不稳 Apache-2.0
6
224.0
llama-3.1-tulu-3-70b Allen AI · Llama 3.1
1238±23.7 172 - 258 499 N/A N/A 1238.342 [1214.6, 1262.1] 146.65 · 不稳 Llama 3.1
7
266.0
llama-3.1-tulu-3-8b Allen AI · Llama 3.1
1180±25.8 240 - 291 452 N/A N/A 1179.629 [1153.8, 1205.4] 173.39 · 不稳 Llama 3.1
8
288.0
tulu-2-dpo-70b Allen AI · AI2 ImpACT Low-risk
1126±17.7 273 - 319 1.2K N/A N/A 1125.983 [1108.2, 1143.7] 82.00 · 不稳 AI2 ImpACT Low-risk
9
329.0
olmo-7b-instruct Allen AI · Apache-2.0
1012±19.1 306 - 323 1.1K $0.20 / $0.20 N/A 1011.684 [992.6, 1030.8] 94.68 · 不稳 Apache-2.0

没有找到相关模型