Name: AI2 困难模式 · 大语言模型排行榜 - 最佳文本与对话 AI 模型对比 2026-04
Creator: LM Arena
License: https://creativecommons.org/licenses/by/4.0/
Keywords: AI2,困难模式,AI 大语言模型排行榜,AI 模型排行

排名Rank ⇕	模型Model ⇕	得分Score ↓
› 🥇	168.0	olmo-3.1-32b-instruct Allen AI · Apache 2.0	1323±8.0	72 - 184	6.5K 票	$0.20 / $0.60	65.5K	1322.947 [1315.0, 1330.9]	16.47 · 不稳	Apache 2.0
上下文Context65.5K 价格Price$0.20 / $0.60 方差Variance16.47 协议LicenseApache 2.0 综合排名Overall Rank168.0 投票数Votes6.5K
› 🥈	177.0	olmo-3-32b-think Allen AI · Apache 2.0	1302±11.2	118 - 220	3.0K 票	$0.15 / $0.50	65.5K	1302.182 [1291.0, 1313.4]	32.53 · 不稳	Apache 2.0
上下文Context65.5K 价格Price$0.15 / $0.50 方差Variance32.53 协议LicenseApache 2.0 综合排名Overall Rank177.0 投票数Votes3.0K
› 🥉	180.0	molmo-2-8b Allen AI · Apache 2.0	1294±28.6	143 - 242	438 票	$0.20 / $0.20	36.9K	1293.558 [1265.0, 1322.1]	212.60 · 不稳	Apache 2.0
上下文Context36.9K 价格Price$0.20 / $0.20 方差Variance212.60 协议LicenseApache 2.0 综合排名Overall Rank180.0 投票数Votes438
› 4	206.0	olmo-3.1-32b-think Allen AI · Apache 2.0	1273±9.6	111 - 210	4.4K 票	$0.15 / $0.50	65.5K	1272.613 [1263.0, 1282.2]	23.89 · 不稳	Apache 2.0
上下文Context65.5K 价格Price$0.15 / $0.50 方差Variance23.89 协议LicenseApache 2.0 综合排名Overall Rank206.0 投票数Votes4.4K
› 5	217.0	llama-3.1-tulu-3-70b Allen AI · Llama 3.1	1220±18.8	172 - 258	779 票	N/A	N/A	1220.204 [1201.4, 1239.0]	92.34 · 不稳	Llama 3.1
上下文ContextN/A 价格PriceN/A 方差Variance92.34 协议LicenseLlama 3.1 综合排名Overall Rank217.0 投票数Votes779
› 6	238.0	olmo-2-0325-32b-instruct Allen AI · Apache-2.0	1208±20.1	214 - 266	789 票	$0.05 / $0.20	128K	1207.680 [1187.6, 1227.8]	105.43 · 不稳	Apache-2.0
上下文Context128K 价格Price$0.05 / $0.20 方差Variance105.43 协议LicenseApache-2.0 综合排名Overall Rank238.0 投票数Votes789
› 7	247.0	llama-3.1-tulu-3-8b Allen AI · Llama 3.1	1174±20.0	240 - 291	728 票	N/A	N/A	1174.444 [1154.4, 1194.5]	104.46 · 不稳	Llama 3.1
上下文ContextN/A 价格PriceN/A 方差Variance104.46 协议LicenseLlama 3.1 综合排名Overall Rank247.0 投票数Votes728
› 8	280.0	tulu-2-dpo-70b Allen AI · AI2 ImpACT Low-risk	1105±16.6	273 - 319	1.4K 票	N/A	N/A	1104.583 [1088.0, 1121.1]	71.37 · 不稳	AI2 ImpACT Low-risk
上下文ContextN/A 价格PriceN/A 方差Variance71.37 协议LicenseAI2 ImpACT Low-risk 综合排名Overall Rank280.0 投票数Votes1.4K
› 9	322.0	olmo-7b-instruct Allen AI · Apache-2.0	993±17.2	306 - 323	1.5K 票	$0.20 / $0.20	N/A	992.993 [975.8, 1010.2]	76.73 · 不稳	Apache-2.0
上下文ContextN/A 价格Price$0.20 / $0.20 方差Variance76.73 协议LicenseApache-2.0 综合排名Overall Rank322.0 投票数Votes1.5K

没有找到相关模型No matching models found

AI 大语言模型排行榜 🏆 综合榜单