AI 大模型排行榜 (Artificial Analysis LLM Ranking)

信息查询

2.7k 次浏览

100% 有帮助 · 1 人反馈

值品工具箱提供的 AI 大模型排行榜聚合了来自 Artificial Analysis 的权威数据，实时追踪并排名超过 100 个主流大语言模型。

AI 大模型排行榜数据中心

重置

排名	模型名称	综合指数 ▼	编程	价格 ($/1M)
1	Claude Opus 5 (Adaptive Reasoning, Max Effort)	60.7	78	$10
2	Claude Opus 5 (Adaptive Reasoning, Xhigh Effort)	60.1	77	$10
3	Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback)	59.9	76.5	$20
4	GPT-5.6 Sol (max)	58.9	77.4	$11.25
5	Claude Opus 5 (Adaptive Reasoning, High Effort)	58.9	76.5	$10
6	GPT-5.6 Sol (xhigh)	57.7	78.3	$11.25
7	Kimi K3	57.1	76.2	$6
8	Claude Opus 5 (Adaptive Reasoning, Medium Effort)	56.3	74.3	$10
9	GPT-5.6 Sol (high)	55.9	77.2	$11.25
10	Claude Opus 4.8 (Adaptive Reasoning, Max Effort)	55.7	74.3	$10
11	GPT-5.6 Terra (max)	55	76.7	$5.625
12	GPT-5.5 (xhigh)	54.8	74.9	$11.25
13	Grok 4.5 (high)	53.8	72.4	$3
14	GPT-5.6 Sol (medium)	53.6	76.3	$11.25
15	Claude Opus 4.7 (Adaptive Reasoning, Max Effort)	53.5	73.6	$10
16	Claude Sonnet 5 (Adaptive Reasoning, Max Effort)	53.4	71.5	$4
17	GPT-5.5 (high)	53.1	71.6	$11.25
18	GPT-5.6 Terra (xhigh)	51.6	70.6	$5.625
19	GPT-5.4 (xhigh)	51.4	71.1	$5.625
20	GPT-5.6 Luna (max)	51.2	71.4	$2.25
21	GLM-5.2 (max)	51.1	68.8	$2.15
22	Muse Spark 1.1 (xhigh)	50.6	71.3	$2
23	Claude Opus 5 (Adaptive Reasoning, Low Effort)	50.6	66.9	$10
24	GPT-5.5 (medium)	50.4	71.5	$11.25
25	Gemini 3.5 Flash (high)	50.2	70.1	$3.375
26	Gemini 3.6 Flash (high)	50.1	69.2	$3
27	GPT-5.6 Sol (low)	49.4	69.7	$11.25
28	GPT-5.6 Luna (xhigh)	49.1	68.6	$2.25
29	GPT-5.6 Terra (high)	49	67.1	$5.625
30	Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)	47.2	63	$6
31	Gemini 3.1 Pro Preview	46.5	68.8	$4.5
32	GPT-5.6 Luna (high)	46.1	63.3	$2.25
33	Qwen3.7 Max	46	66	$3.75
34	GPT-5.6 Terra (medium)	45.6	64.7	$5.625
35	Gemini 3.5 Flash (medium)	45.4	-	$3.375
36	MiniMax-M3	44.4	58.6	$0.525
37	GPT-5.3 Codex (xhigh)	44.3	-	$4.813
38	DeepSeek V4 Pro (Reasoning, Max Effort)	44.3	59.4	$0.544
39	Kimi K2.6	44.2	61.8	$1.712
40	Motif 3 (Beta)	44.1	62	$0
41	Claude Opus 4.6 (Adaptive Reasoning, Max Effort)	43.7	-	$10
42	GPT-5.5 (low)	43.5	60.9	$11.25
43	Muse Spark	43.1	58.6	$0
44	DeepSeek V4 Pro (Reasoning, High Effort)	43.1	58.7	$0.544
45	Claude Opus 4.7 (Non-reasoning, High Effort)	42.7	-	$10
46	MiMo-V2.5-Pro	42.2	60.2	$0.544
47	GPT-5.2 (xhigh)	42.2	-	$4.813
48	Kimi K2.7 Code	41.9	60.8	$1.712
49	Claude Sonnet 5 (Non-reasoning, High Effort)	41.7	66.4	$4
50	GPT-5.6 Sol (Non-reasoning)	41.2	65.1	$11.25
51	Hy3	41.2	58.8	$0.25
52	Nex-N2-Pro	41	59.1	$1
53	Claude Opus 4.5 (Reasoning)	40.8	-	$10
54	Inkling (xhigh)	40.7	52.1	$2.572
55	GPT-5.6 Terra (low)	40.5	58.1	$5.625
56	DeepSeek V4 Flash (Reasoning, Max Effort)	40.3	56.2	$0.175
57	MiMo-V2-Pro	40.3	-	$0
58	GLM-5.1 (Reasoning)	40.2	55.8	$2.135
59	GPT-5.2 Codex (xhigh)	40.1	-	$4.813
60	GPT-5.4 mini (xhigh)	40	56.1	$1.688
61	Qwen3.6 Max Preview	40	-	$2.925
62	Grok Build 0.1 0616	39.8	51.5	$1.25
63	Qwen3.6 Plus	39.6	54.5	$1.125
64	Gemini 3 Pro Preview (high)	39.6	-	$4.5
65	GLM-5 (Reasoning)	39.5	-	$1.55
66	GPT-5.4 (low)	39.1	-	$5.625
67	Qwen3.7 Plus	39	55.9	$0.7
68	JT-4.1 Flash 236B A21B	38.8	52.4	$0
69	Agnes 2.5 Pro Alpha	38.8	58.8	$0.563
70	GPT-5.4 nano (xhigh)	38.2	56.1	$0.463
71	GPT-5.6 Luna (medium)	38.1	50.7	$2.25
72	MiniMax-M2.7	38.1	52.6	$0.525
73	GLM-5-Turbo	38.1	-	$0
74	GPT-5.2 (medium)	38	-	$4.813
75	Nemotron 3 Ultra 550B A55B (Reasoning)	37.8	49.3	$1.175
76	Gemini 3 Flash Preview (Reasoning)	37.8	-	$1.125
77	Claude Opus 4.6 (Non-reasoning, High Effort)	37.8	-	$10
78	Grok 4.3 (high)	37.6	42.2	$1.563
79	DeepSeek V4 Flash (Reasoning, High Effort)	37.5	52	$0.175
80	MiMo-V2.5	37.2	56.8	$0.175
81	Qwen3.6 27B (Reasoning)	37.1	53.7	$1.35
82	Grok 4.20 0309 v2 (Reasoning)	37	-	$3
83	GPT-5.1 (high)	36.9	49.4	$3.438
84	Gemini 3.5 Flash-Lite	36.5	49.3	$0.85
85	Grok 4.20 0309 (Reasoning)	36.5	-	$3
86	MiMo-V2-Omni-0327	36.4	-	$0
87	Claude 4.5 Sonnet (Reasoning)	36.4	52.1	$6
88	GPT-5 Codex (high)	36.1	-	$3.438
89	Grok 4.3 (medium)	36	-	$1.563
90	Claude Sonnet 4.6 (Non-reasoning, High Effort)	35.9	-	$6
91	Grok 4.3 (low)	35.4	-	$1.563
92	GPT-5.5 (Non-reasoning)	35.4	56.5	$11.25
93	Kimi K2.5 (Reasoning)	35.4	46.8	$1.2
94	GLM-5.1 (Non-reasoning)	35.4	-	$2.135
95	MiMo-V2-Omni	35	-	$0
96	Gemini 3.5 Flash (minimal)	34.9	-	$3.375
97	GPT-5.1 Codex (high)	34.7	-	$3.438
98	GPT-5 (high)	34.7	37.8	$3.438
99	Claude Opus 4.5 (Non-reasoning)	34.7	-	$10
100	Kimi K2.6 (Non-reasoning)	34.6	-	$1.712
101	GLM 5V Turbo (Reasoning)	34.5	-	$0
102	Claude Sonnet 4.6 (Non-reasoning, Low Effort)	34.3	-	$6
103	GLM-5.2 (Non-reasoning)	34.1	46.5	$2.076
104	GPT-5.6 Terra (Non-reasoning)	34	52.3	$5.625
105	Qwen3.5 27B (Reasoning)	33.8	-	$0.825
106	KAT Coder Pro V2	33.7	59.5	$0.525
107	Qwen3.5 397B A17B (Reasoning)	33.7	48.2	$1.35
108	GPT-5 (medium)	33.7	-	$3.438
109	Claude 4.1 Opus (Reasoning)	33.7	-	$30
110	MiniMax-M2.5	33.7	-	$0.525
111	GLM-4.7 (Reasoning)	33.7	45.3	$1
112	Hy3-preview (Reasoning)	33.6	-	$0.107
113	LongCat 2.0	33.5	45.3	$0
114	GPT-5.5 Instant (May 2026)	33.5	-	$11.25
115	GPT-5.6 Luna (low)	33.3	44.2	$2.25
116	Grok 4	33.3	-	$6
117	MiMo-V2-Flash (Feb 2026)	33.2	-	$0
118	Gemini 3 Pro Preview (low)	33.1	-	$4.5
119	Kimi K2 Thinking	32.7	-	$1.075
120	o3-pro	32.5	-	$35
121	GLM-5 (Non-reasoning)	32.4	-	$1.55
122	Qwen3.5 122B A10B (Reasoning)	32.3	45.7	$1.1
123	Qwen3.5 397B A17B (Non-reasoning)	32	-	$1.35
124	DeepSeek V3.2 (Reasoning)	32	44.2	$0.315
125	Qwen3 Max Thinking	31.7	-	$0
126	Qwen3.6 35B A3B (Reasoning)	31.6	41.9	$0.557
127	MiniMax-M2.1	31.4	-	$0.525
128	DeepSeek V4 Pro (Non-reasoning)	31.2	-	$0.544
129	GPT-5 (low)	31.2	-	$3.438
130	MiMo-V2-Flash (Reasoning)	31.2	-	$0.15
131	Claude 4 Opus (Reasoning)	31	-	$30
132	GPT-5 mini (medium)	30.9	-	$0.688
133	Qwen3.5 Omni Plus	30.6	-	$1.5
134	Ring-2.6-1T	30.6	42.8	$0.85
135	GPT-5.1 Codex mini (high)	30.6	-	$0.688
136	Grok 4.1 Fast (Reasoning)	30.6	-	$0
137	Qwen3.6 27B (Non-reasoning)	30.5	46.6	$1.35
138	o3	30.4	-	$3.5
139	DeepSeek V3.1 Terminus (Reasoning)	30.4	43.5	$1.914
140	Step 3.7 Flash	30.3	39.6	$0.438
141	GPT-5.4 nano (medium)	30.2	-	$0.463
142	Mistral Medium 3.5	29.9	46.9	$3
143	GPT-5.4 mini (medium)	29.8	-	$1.688
144	Claude 4.5 Haiku (Reasoning)	29.6	43.9	$2
145	Gemma 4 31B (Reasoning)	29.4	43.4	$0
146	Kimi K2.5 (Non-reasoning)	29.4	-	$1.2
147	Claude 4.5 Sonnet (Non-reasoning)	29.3	-	$6
148	Qwen3.5 35B A3B (Reasoning)	29.3	-	$0.688
149	Qwen3.5 27B (Non-reasoning)	29.3	-	$0.825
150	GPT-5.5 Instant (June 2026)	28.9	39.4	$11.25
151	Claude 4 Sonnet (Reasoning)	28.9	37.6	$6
152	DeepSeek V4 Flash (Non-reasoning)	28.7	-	$0.175
153	GLM-4.6 (Reasoning)	28.7	45.8	$0.963
154	JT-35B-Flash	28.4	-	$0
155	KAT-Coder-Pro V1	28.3	-	$0
156	MiniMax-M2	28.3	-	$0.525
157	Claude 4.1 Opus (Non-reasoning)	28.2	-	$30
158	MiMo-V2.5-Pro (Non-reasoning)	27.9	-	$0.544
159	GPT-5.4 (Non-reasoning)	27.7	-	$5.625
160	Qwen3.5 122B A10B (Non-reasoning)	27.6	43.3	$1.1
161	Gemini 3 Flash Preview (Non-reasoning)	27.4	-	$1.125
162	Grok 4 Fast (Reasoning)	27.4	-	$0.275
163	Claude 3.7 Sonnet (Reasoning)	27.1	36.4	$0
164	GPT-5.6 Luna (Non-reasoning)	26.6	39.3	$2.25
165	GLM-4.7 (Non-reasoning)	26.6	-	$1
166	Hy3-preview (Non-reasoning)	26.1	-	$0.107
167	Ling-2.6-1T	26.1	-	$0.85
168	Step 3.5 Flash 2603	26	-	$0.15
169	Doubao Seed Code	26	-	$0
170	GPT-5.2 (Non-reasoning)	26	-	$4.813
171	Gemini 2.5 Pro	25.8	33.3	$3.438
172	Gemma 4 26B A4B (Reasoning)	25.7	39.3	$0.198
173	o4-mini (high)	25.6	-	$1.925
174	Claude 4 Opus (Non-reasoning)	25.5	-	$30
175	Claude 4 Sonnet (Non-reasoning)	25.5	-	$6
176	Step 3.5 Flash	25.5	-	$0.15
177	NVIDIA Nemotron 3 Super 120B A12B (Reasoning)	25.4	37.7	$0.381
178	DeepSeek V3.2 Exp (Reasoning)	25.4	-	$0.315
179	GPT-5 mini (high)	25.3	15.6	$0.688
180	Gemini 3.1 Flash-Lite	25	34.7	$0.563
181	Qwen3 Max Thinking (Preview)	25	-	$2.4
182	Grok 4.3 (Non-reasoning)	24.8	35.2	$1.563
183	MiMo-V2-Flash (Non-reasoning)	24.7	49.8	$0
184	DeepSeek V3.2 (Non-reasoning)	24.7	-	$0.315
185	Qwen3.6 35B A3B (Non-reasoning)	24.2	28.1	$0.844
186	Qwen3.5 35B A3B (Non-reasoning)	24	37	$0.688
187	Qwen3 Max	24	-	$2.4
188	gpt-oss-120b (high)	23.8	30.4	$0.262
189	Gemini 2.5 Flash Preview (Sep '25) (Reasoning)	23.8	-	$0
190	Claude 4.5 Haiku (Non-reasoning)	23.7	-	$2
191	Claude 3.7 Sonnet (Non-reasoning)	23.5	-	$6
192	Kimi K2 0905	23.5	-	$1.075
193	o1	23.4	39.7	$26.25
194	Gemini 2.5 Pro Preview (Mar' 25)	23	46.7	$0
195	GLM-4.6 (Non-reasoning)	23	-	$0.981
196	GLM-4.7-Flash (Reasoning)	22.9	-	$0.153
197	Command A+	22.5	27.8	$0
198	Grok 3 mini Reasoning (high)	22.5	-	$0.35
199	Grok 4.20 0309 (Non-reasoning)	22.5	-	$3
200	Gemini 2.5 Pro Preview (May' 25)	22.3	-	$3.438
201	DeepSeek V3.2 Speciale	22.2	-	$0
202	K-EXAONE (Reasoning)	22.1	32.1	$0
203	ERNIE 5.0 Thinking Preview	21.9	-	$0
204	Gemma 4 31B (Non-reasoning)	21.8	33.2	$0.205
205	Gemma 4 12B (Reasoning)	21.8	31	$0.15
206	Nova 2.0 Pro Preview (medium)	21.8	34	$3.438
207	Grok 4.20 0309 v2 (Non-reasoning)	21.8	-	$3
208	Grok Code Fast 1	21.6	-	$0
209	Mercury 2	21.4	31.1	$0.375
210	Qwen3.5 9B (Reasoning)	21.4	28.7	$0.151
211	DeepSeek V3.1 Terminus (Non-reasoning)	21.4	-	$0.453
212	DeepSeek V3.2 Exp (Non-reasoning)	21.3	-	$0.315
213	Apriel-v1.5-15B-Thinker	21.2	-	$0
214	Qwen3 Coder Next	21.1	36.2	$0.563
215	DeepSeek V3.1 (Non-reasoning)	21	-	$0.84
216	Nova 2.0 Omni (medium)	20.9	-	$0.85
217	DeepSeek V3.1 (Reasoning)	20.7	-	$0.865
218	Qwen3 VL 235B A22B (Reasoning)	20.6	-	$2.625
219	Apriel-v1.6-15B-Thinker	20.5	-	$0
220	GPT-5.1 (Non-reasoning)	20.4	-	$3.438
221	Qwen3.5 9B (Non-reasoning)	20.3	23.5	$0
222	EXAONE 4.5 33B	20.2	23.6	$0
223	Gemma 4 26B A4B (Non-reasoning)	20.1	-	$0.198
224	Qwen3.5 4B (Reasoning)	20.1	22.6	$0.06
225	Gemini 2.5 Flash (Reasoning)	20.1	-	$0.85
226	DeepSeek R1 0528 (May '25)	20.1	-	$2.063
227	GPT-5 nano (high)	19.9	-	$0.138
228	North Mini Code	19.8	36.5	$0
229	Mistral Small 4 (Reasoning)	19.6	26.6	$0.262
230	Nova 2.0 Pro Preview (low)	19.6	25.9	$3.438
231	Qwen3 235B A22B 2507 (Reasoning)	19.6	22.1	$2.625
232	GLM-4.5 (Reasoning)	19.5	-	$0
233	GPT-4.1	19.4	-	$3.5
234	Kimi K2	19.4	-	$1.002
235	Devstral 2	19.2	31.3	$0
236	Qwen3 Max (Preview)	19.2	-	$2.4
237	Nova 2.0 Lite (medium)	19	-	$0.85
238	Qwen3.5 Omni Flash	19	-	$0.275
239	o3-mini	19	-	$1.925
240	GPT-5 nano (medium)	19	-	$0.138
241	o1-pro	18.9	-	$262.5
242	Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning)	18.8	-	$0
243	JT-MINI	18.5	-	$0
244	DeepSeek R1 (Jan '25)	18.5	24.6	$2.431
245	Grok 3	18.4	-	$8
246	Seed-OSS-36B-Instruct	18.3	-	$0.3
247	Nova 2.0 Lite (high)	18.2	23	$0.85
248	Trinity Large Thinking	18.2	25.8	$0.395
249	Qwen3 235B A22B 2507 Instruct	18.2	-	$1.225
250	Qwen3 Coder 480B A35B Instruct	18	-	$3
251	Magistral Medium 1.2	17.9	21.3	$2.75
252	Qwen3 VL 32B (Reasoning)	17.9	-	$2.625
253	Nova 2.0 Lite (low)	17.8	-	$0.85
254	HyperNova 60B 2605	17.8	23.2	$0.065
255	Sonar Reasoning Pro	17.8	-	$0
256	MiniMax M1 80k	17.7	-	$0.963
257	Nemotron Cascade 2 30B A3B	17.6	25.3	$0
258	GPT-5.4 nano (Non-Reasoning)	17.6	-	$0.463
259	Gemini 2.5 Flash Preview (Reasoning)	17.5	-	$0
260	Devstral Small 2	17.4	29.3	$0
261	K2 Think V2	17.3	21	$0
262	LongCat Flash Lite	17.2	-	$0
263	GPT-5 (minimal)	17.2	-	$3.438
264	HyperCLOVA X SEED Think (32B)	17	-	$0
265	o1-preview	17	34	$28.875
266	Grok 4.1 Fast (Non-reasoning)	16.9	-	$0
267	GLM-4.6V (Reasoning)	16.8	-	$0.45
268	K-EXAONE (Non-reasoning)	16.7	-	$0
269	Qwen3 Next 80B A3B (Reasoning)	16.7	17.4	$1.875
270	Nova 2.0 Omni (low)	16.6	-	$0.85
271	GPT-5.4 mini (Non-Reasoning)	16.6	-	$1.688
272	Grok 4 Fast (Non-reasoning)	16.5	-	$0.275
273	GLM-4.5-Air	16.5	-	$0.372
274	Mi:dm K 2.5 Pro	16.4	-	$0
275	Ring-1T	16.2	-	$0
276	G9v3-3B	16.1	9.9	$0
277	Qwen3.5 4B (Non-reasoning)	16	20.3	$0.06
278	Mistral Large 3	15.9	20.1	$0.75
279	INTELLECT-3	15.6	-	$0
280	o3-mini (high)	15.6	16.3	$1.925
281	GLM-4.7-Flash (Non-reasoning)	15.5	-	$0.153
282	DeepSeek V3 0324	15.4	21.2	$0.483
283	GPT-5 (ChatGPT)	15.3	-	$3.438
284	Solar Open 100B (Reasoning)	15.1	-	$0
285	Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)	15.1	-	$0.175
286	Grok 3 Reasoning Beta	15.1	-	$0
287	gpt-oss-120b (low)	14.9	21.2	$0.262
288	gpt-oss-20b (high)	14.9	20.7	$0.095
289	Nemotron 3 Nano Omni 30B A3B Reasoning	14.9	13.8	$0.131
290	GPT-4.1 mini	14.8	20.2	$0.7
291	Mistral Small 3.1	14.7	26.3	$0.15
292	Mistral Medium 3.1	14.7	20.5	$0.8
293	Nova 2.0 Pro Preview (Non-reasoning)	14.4	20.9	$3.438
294	MiniMax M1 40k	14.4	-	$0
295	Qwen3 30B A3B 2507 (Reasoning)	14.4	12.1	$0.75
296	gpt-oss-20b (low)	14.3	-	$0.103
297	Llama 4 Maverick	14.3	16.3	$0.415
298	GPT-5 mini (minimal)	14.3	-	$0.688
299	Qwen3 VL 235B A22B Instruct	14.3	-	$1.225
300	NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)	14.2	14.4	$0.088
301	K2-V2 (high)	14.2	-	$0
302	DeepSeek V3 (Dec '24)	14.2	23	$0.493
303	Solar Pro 3	14.1	16.2	$0
304	Ling 2.6 Flash	14.1	25.3	$0.15
305	Gemini 2.5 Flash (Non-reasoning)	14.1	-	$0.85
306	o1-mini	14	-	$0
307	Qwen3 Next 80B A3B Instruct	13.7	-	$0.875
308	Tri-21B-think Preview	13.6	-	$0
309	GPT-4.5 (Preview)	13.6	-	$0
310	Qwen3 Coder 30B A3B Instruct	13.6	-	$0.9
311	DiffusionGemma 26B A4B	13.5	19.7	$0
312	QwQ 32B	13.4	-	$0.745
313	Qwen3 235B A22B (Reasoning)	13.4	-	$2.625
314	Gemini 2.0 Flash Thinking Experimental (Jan '25)	13.3	24.1	$0
315	Qwen3 VL 30B A3B (Reasoning)	13.3	-	$0.75
316	Gemma 4 12B (Non-reasoning)	13.2	-	$0.15
317	Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)	13.1	-	$0.175
318	Motif-2-12.7B-Reasoning	12.8	-	$0
319	Ling-1T	12.8	-	$0
320	Nova Premier	12.7	-	$5
321	Magistral Medium 1	12.5	-	$0
322	Mistral Medium 3	12.5	-	$0.8
323	Solar Pro 2 (Preview) (Reasoning)	12.5	-	$0
324	Mistral Small 4 (Non-reasoning)	12.4	-	$0.262
325	Llama Nemotron Super 49B v1.5 (Reasoning)	12.4	-	$0.4
326	K2-V2 (medium)	12.4	-	$0
327	Tri-21B-Think	12.4	-	$0
328	Devstral Medium	12.4	-	$0
329	GPT-4o (March 2025, chatgpt-4o-latest)	12.3	-	$0
330	Gemini 2.0 Flash (Feb '25)	12.3	-	$0
331	Claude 3.5 Haiku	12.3	15.9	$0
332	Llama 3.3 Nemotron Super 49B v1 (Reasoning)	12.2	-	$0
333	MiniCPM5-1B (Reasoning)	12	-	$0
334	Qwen3 4B 2507 (Reasoning)	12	-	$0
335	Gemma 4 E4B (Reasoning)	11.9	9.4	$0.04
336	Sarvam 105B (high)	11.9	-	$0.074
337	Nova 2.0 Lite (Non-reasoning)	11.8	-	$0.85
338	Gemini 2.0 Pro Experimental (Feb '25)	11.8	25.5	$0
339	Claude 3 Opus	11.8	19.5	$30
340	Devstral Small (May '25)	11.8	-	$0
341	MiniCPM5-1B (Non-reasoning)	11.7	-	$0
342	Gemini 2.5 Flash Preview (Non-reasoning)	11.7	-	$0
343	Sonar Reasoning	11.7	-	$0
344	Qwen3 32B (Reasoning)	11.5	15.3	$2.625
345	Gemini 2.5 Flash-Lite (Reasoning)	11.4	-	$0.175
346	Magistral Small 1.2	11.3	14.7	$0.75
347	GPT-4o (Nov '24)	11.2	-	$4.375
348	Ministral 3 14B	11.1	14.4	$0.2
349	Nanbeige4.1-3B	11.1	9.6	$0
350	Qwen3 VL 32B Instruct	11.1	-	$1.225
351	DeepSeek R1 Distill Qwen 32B	11	-	$0
352	GLM-4.6V (Non-reasoning)	11	-	$0.45
353	Qwen3 235B A22B (Non-reasoning)	10.9	-	$1.225
354	Gemini 2.0 Flash (experimental)	10.7	-	$0
355	Magistral Small 1	10.7	-	$0
356	EXAONE 4.0 32B (Reasoning)	10.6	-	$0
357	Mistral Small 3.2	10.6	12.5	$0.15
358	Qwen3 VL 8B (Reasoning)	10.6	-	$0.66
359	Nova 2.0 Omni (Non-reasoning)	10.5	-	$0.85
360	DeepSeek R1 0528 Qwen3 8B	10.4	-	$0
361	Qwen3 14B (Reasoning)	10.4	13.8	$1.313
362	Qwen2.5 Max	10.2	-	$0
363	Llama 4 Scout	10	8.2	$0.3
364	Hermes 4 - Llama-3.1 70B (Reasoning)	10	-	$0.198
365	Gemini 1.5 Pro (Sep '24)	10	23.6	$0
366	Solar Pro 2 (Preview) (Non-reasoning)	10	-	$0
367	Qwen3 VL 30B A3B Instruct	10	-	$0.35
368	Claude 3.5 Sonnet (Oct '24)	9.9	30.2	$6
369	DeepSeek R1 Distill Llama 70B	9.9	-	$0.787
370	Falcon-H1R-7B	9.8	-	$0
371	DeepSeek R1 Distill Qwen 14B	9.8	-	$0
372	Ling-flash-2.0	9.7	-	$0.247
373	Qwen3 Omni 30B A3B (Reasoning)	9.6	-	$0.43
374	GPT-4o (Aug '24)	9.6	-	$4.375
375	GPT-4.1 nano	9.6	11.1	$0.175
376	Qwen2.5 Instruct 72B	9.6	-	$0.48
377	Step3 VL 10B	9.5	-	$0
378	Sonar	9.5	-	$0
379	Llama 3.3 Instruct 70B	9.4	11.9	$0.623
380	Gemma 4 E2B (Reasoning)	9.4	7.2	$0
381	Devstral Small (Jul '25)	9.3	-	$0
382	Sonar Pro	9.3	-	$0
383	Qwen3 30B A3B (Reasoning)	9.3	-	$0.75
384	QwQ 32B-Preview	9.2	-	$0
385	Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)	9.1	-	$0.9
386	Mistral Large 2 (Nov '24)	9.1	-	$0
387	GLM-4.5V (Reasoning)	9.1	-	$0.9
388	Qwen3 30B A3B 2507 Instruct	9.1	-	$0.35
389	Ministral 3 8B	9	9.7	$0.15
390	Solar Pro 2 (Reasoning)	9	-	$0
391	NVIDIA Nemotron Nano 12B v2 VL (Reasoning)	9	-	$0.3
392	Hermes 4 - Llama-3.1 405B (Reasoning)	9	-	$1.5
393	ERNIE 4.5 300B A47B	9	-	$0.485
394	Gemma 4 E4B (Non-reasoning)	8.9	-	$0.04
395	Granite 4.1 30B	8.9	10.4	$0
396	NVIDIA Nemotron Nano 9B V2 (Reasoning)	8.8	-	$0.07
397	Hermes 4 - Llama-3.1 405B (Non-reasoning)	8.8	-	$1.5
398	Gemini 2.0 Flash-Lite (Feb '25)	8.8	-	$0
399	Llama Nemotron Super 49B v1.5 (Non-reasoning)	8.7	-	$0.4
400	NVIDIA Nemotron 3 Nano 4B	8.7	8	$0
401	K2-V2 (low)	8.6	-	$0
402	GPT-4o (May '24)	8.6	24.2	$7.5
403	Gemini 2.0 Flash-Lite (Preview)	8.6	-	$0
404	Qwen3 32B (Non-reasoning)	8.6	-	$1.225
405	Llama 3.1 Instruct 405B	8.5	-	$4.375
406	Kimi Linear 48B A3B Instruct	8.5	-	$0
407	Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)	8.5	-	$0
408	Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)	8.5	-	$0
409	Qwen3 4B (Reasoning)	8.4	-	$0
410	Qwen3 VL 8B Instruct	8.4	-	$0.31
411	LFM2.5-8B-A1B	8.3	-	$0
412	Claude 3.5 Sonnet (June '24)	8.3	26	$6
413	Llama 3.1 Tulu3 405B	8.3	-	$0
414	Qwen3 8B (Reasoning)	8.3	9	$0.66
415	Ring-flash-2.0	8.2	-	$0.247
416	GPT-4o (ChatGPT)	8.2	-	$0
417	Olmo 3.1 32B Think	8.1	-	$0
418	Pixtral Large	8.1	-	$0
419	GPT-5 nano (minimal)	8	-	$0.138
420	Gemini 1.5 Flash (Sep '24)	8	-	$0
421	Grok 2 (Dec '24)	8	-	$0
422	GPT-4 Turbo	7.9	21.5	$15
423	Qwen3 VL 4B (Reasoning)	7.9	-	$0
424	Solar Pro 2 (Non-reasoning)	7.8	-	$0
425	Command A	7.7	-	$4.375
426	Nova Pro	7.7	-	$1.4
427	Llama 3.1 Nemotron Instruct 70B	7.6	-	$1.2
428	Llama 3.1 Instruct 8B	7.6	5.4	$0.079
429	Grok Beta	7.5	-	$0
430	Qwen2.5 Instruct 32B	7.5	-	$0
431	NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)	7.4	-	$0.088
432	NVIDIA Nemotron Nano 9B V2 (Non-reasoning)	7.4	-	$0.086
433	Gemma 3 27B Instruct	7.4	10.1	$0
434	Mistral Large 2 (Jul '24)	7.3	-	$3
435	Qwen3.5 2B (Reasoning)	7.1	2.9	$0
436	Qwen2.5 Coder Instruct 32B	7.1	-	$0
437	Qwen3 4B 2507 Instruct	7.1	-	$0
438	GPT-4	7	13.1	$37.5
439	GLM-4.5V (Non-reasoning)	7	-	$0.9
440	Qwen3 14B (Non-reasoning)	7	-	$0.612
441	Hermes 4 - Llama-3.1 70B (Non-reasoning)	6.9	-	$0.198
442	GPT-4o mini	6.9	11.4	$0.262
443	Gemini 2.5 Flash-Lite (Non-reasoning)	6.9	-	$0.175
444	Mistral Small 3	6.9	-	$0.15
445	Nova Lite	6.9	-	$0.105
446	Llama 3.1 Instruct 70B	6.8	-	$0.56
447	DeepSeek-V2.5 (Dec '24)	6.8	-	$0
448	Qwen3 4B (Non-reasoning)	6.8	-	$0
449	Qwen3 30B A3B (Non-reasoning)	6.8	-	$0.35
450	Granite 4.1 8B	6.7	9.5	$0.063
451	Sarvam 30B (high)	6.6	-	$0.047
452	Gemini 2.0 Flash Thinking Experimental (Dec '24)	6.6	-	$0
453	DeepSeek-V2.5	6.6	-	$0
454	Olmo 3.1 32B Instruct	6.5	-	$0
455	Gemma 4 E2B (Non-reasoning)	6.4	-	$0
456	Mistral Saba	6.4	-	$0
457	DeepSeek R1 Distill Llama 8B	6.4	-	$0
458	Olmo 3 32B Think	6.4	-	$0
459	Ministral 3 3B	6.3	4.8	$0.1
460	R1 1776	6.3	-	$0
461	Gemini 1.5 Pro (May '24)	6.3	19.8	$0
462	Reka Flash (Sep '24)	6.3	-	$0.35
463	Qwen2.5 Turbo	6.3	-	$0.088
464	Llama 3.2 Instruct 90B (Vision)	6.2	-	$2.04
465	Solar Mini	6.2	-	$0.15
466	Grok-1	6	-	$0
467	Phi-4 Mini Instruct	6	3.8	$0
468	EXAONE 4.0 32B (Non-reasoning)	6	-	$0
469	Qwen2 Instruct 72B	6	-	$0
470	Qwen3.5 2B (Non-reasoning)	5.6	2.4	$0
471	Gemini 1.5 Flash-8B	5.5	-	$0
472	Gemma 3 12B Instruct	5.5	5.8	$0
473	DeepHermes 3 - Mistral 24B Preview (Non-reasoning)	5.3	-	$0
474	Jamba 1.7 Large	5.3	-	$3.5
475	Qwen3.5 0.8B (Reasoning)	5.3	0	$0
476	Granite 4.0 H Small	5.2	-	$0.107
477	Qwen3 Omni 30B A3B Instruct	5.1	-	$0.43
478	DeepSeek-Coder-V2	5.1	-	$0
479	Hermes 3 - Llama-3.1 70B	5.1	-	$0.7
480	Jamba 1.5 Large	5.1	-	$3.5
481	Qwen3 8B (Non-reasoning)	5.1	-	$0.31
482	OLMo 2 32B	5	-	$0
483	Jamba 1.6 Large	5	-	$3.5
484	Phi-4	4.9	-	$0.219
485	LFM2 24B A2B	4.9	-	$0
486	Gemini 1.5 Flash (May '24)	4.9	-	$0
487	Nova Micro	4.7	-	$0.061
488	Granite 4.1 3B	4.7	4.7	$0
489	Claude 3 Sonnet	4.7	-	$6
490	Mistral Small (Sep '24)	4.7	-	$0.3
491	NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)	4.6	-	$0.3
492	Gemma 3n E4B Instruct Preview (May '25)	4.6	-	$0
493	Gemini 1.0 Ultra	4.6	17.6	$0
494	Phi-3 Mini Instruct 3.8B	4.6	-	$0
495	Phi-4 Multimodal Instruct	4.5	-	$0
496	Qwen2.5 Coder Instruct 7B	4.5	-	$0
497	Mixtral 8x22B Instruct	4.4	-	$0
498	Mistral Large (Feb '24)	4.4	-	$6
499	Llama 2 Chat 7B	4.3	-	$0.1
500	MiniCPM-V 4.6 1.3B	4.2	0.7	$0
501	Llama 3.2 Instruct 3B	4.2	-	$0
502	Reka Flash 3	4.1	-	$0.35
503	Jamba Reasoning 3B	4.1	-	$0
504	Qwen1.5 Chat 110B	4.1	-	$0
505	Qwen3 VL 4B Instruct	4.1	-	$0
506	Olmo 3 7B Think	4	-	$0
507	Claude 3 Haiku	3.9	-	$0.5
508	Claude 2.1	3.9	14	$0
509	OLMo 2 7B	3.9	-	$0
510	Molmo 7B-D	3.8	-	$0
511	Ling-mini-2.0	3.8	-	$0
512	DeepSeek R1 Distill Qwen 1.5B	3.7	-	$0
513	GPT-3.5 Turbo	3.6	10.7	$0.75
514	Claude 2.0	3.6	12.9	$0
515	Mistral Small (Feb '24)	3.6	-	$0.262
516	Mistral Medium	3.6	-	$3
517	DeepSeek-V2-Chat	3.6	-	$0
518	Llama 3 Instruct 70B	3.5	-	$1.175
519	LFM 40B	3.4	-	$0
520	Arctic Instruct	3.4	-	$0
521	Qwen Chat 72B	3.4	-	$0
522	Llama 3.2 Instruct 11B (Vision)	3.3	-	$0.357
523	Qwen3.5 0.8B (Non-reasoning)	3.3	1.2	$0
524	PALM-2	3.2	4.6	$0
525	Gemini 1.0 Pro	3.1	-	$0
526	DeepSeek Coder V2 Lite Instruct	3.1	-	$0
527	Llama 2 Chat 70B	3	-	$0
528	Llama 2 Chat 13B	3	-	$0
529	DeepSeek LLM 67B Chat (V1)	3	-	$0
530	OpenChat 3.5 (1210)	3	-	$0
531	DBRX Instruct	3	-	$0
532	Sarvam M (Reasoning)	3	-	$0
533	Command-R+ (Apr '24)	3	-	$6
534	Exaone 4.0 1.2B (Reasoning)	2.9	-	$0
535	Olmo 3 7B Instruct	2.8	-	$0.125
536	Exaone 4.0 1.2B (Non-reasoning)	2.8	-	$0
537	LFM2 2.6B	2.7	-	$0
538	LFM2.5-1.2B-Instruct	2.7	-	$0
539	LFM2.5-1.2B-Thinking	2.7	-	$0
540	Granite 4.0 H 1B	2.7	-	$0
541	Jamba 1.7 Mini	2.7	-	$0
542	Jamba 1.5 Mini	2.7	-	$0.25
543	Jamba 1.6 Mini	2.6	-	$0.25
544	Qwen3 1.7B (Reasoning)	2.6	-	$0
545	Gemma 3 270M	2.4	-	$0
546	Granite 4.0 Micro	2.4	-	$0
547	Apertus 70B Instruct	2.4	-	$1.345
548	Mixtral 8x7B Instruct	2.4	-	$0.512
549	DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)	2.3	-	$0
550	Llama 65B	2.1	-	$0
551	Qwen Chat 14B	2.1	-	$0
552	Granite 4.0 1B	2.1	-	$0
553	Claude Instant	2.1	7.8	$0
554	Mistral 7B Instruct	2.1	-	$0.25
555	Command-R (Mar '24)	2.1	-	$0.75
556	Molmo2-8B	2	-	$0
557	LFM2 8B A1B	1.8	-	$0
558	Granite 3.3 8B (Non-reasoning)	1.8	-	$0.085
559	Qwen3 1.7B (Non-reasoning)	1.5	-	$0
560	Qwen3 0.6B (Reasoning)	1.3	-	$0
561	Llama 3 Instruct 8B	1.2	-	$0.07
562	Gemma 3n E4B Instruct	1.2	3.2	$0.075
563	Llama 3.2 Instruct 1B	1.1	-	$0
564	Gemma 3 4B Instruct	1.1	2.7	$0
565	LFM2 1.2B	1.1	-	$0
566	LFM2.5-VL-1.6B	1	-	$0
567	Granite 4.0 H 350M	1	-	$0
568	Granite 4.0 350M	1	-	$0
569	Apertus 8B Instruct	1	-	$0.125
570	Tiny Aya Global	1	-	$0
571	Gemma 3n E2B Instruct	1	-	$0
572	Gemma 3 1B Instruct	1	-	$0
573	Qwen3 0.6B (Non-reasoning)	1	-	$0
574	GPT-5.5 Pro (xhigh)	-	-	$0
575	Gemini 3 Deep Think	-	-	$0
576	Claude Sonnet 5 (Adaptive Reasoning, High Effort)	-	-	$4
577	Claude Sonnet 5 (Adaptive Reasoning, Low Effort)	-	-	$4
578	Claude Sonnet 5 (Adaptive Reasoning, Medium Effort)	-	-	$4
579	Claude Sonnet 5 (Adaptive Reasoning, Xhigh Effort)	-	-	$4
580	EXAONE 4.5 33B (Non-reasoning)	-	-	$0
581	Cogito v2.1 (Reasoning)	-	-	$1.25
582	Mi:dm K 2.5 Pro Preview	-	-	$0
583	GPT-3.5 Turbo (0613)	-	-	$0
584	GPT-4o Realtime (Dec '24)	-	-	$0
585	GPT-4o mini Realtime (Dec '24)	-	-	$0
586	GPT-5.4 Pro (xhigh)	-	-	$67.5

榜单解读建议

参考 AI 大模型排行榜 时，应综合考虑“综合指数”与“成本价格”。如果您是开发者，编程能力 (Coding) 是更核心的指标。

值品工具箱同步的 AI 大模型排行榜数据每 24 小时更新，确保您获取到最新的模型性能对比。

指标说明

● 综合指数：评估通用理解与逻辑。
● 价格 $/1M：混合 3:1 输入输出比的平均成本。
● 编程能力：衡量代码生成的准确性。

AI 大模型排行榜常见问题 (FAQ)

Q1: AI 大模型排行榜的数据多久更新？

AI 大模型排行榜数据每 24 小时自动抓取一次，确保最新模型加入列表。

Q2: 这个 AI 大模型排行榜包含国产模型吗？

是的，只要国产模型通过了 Artificial Analysis 的全球测评，就会出现在 AI 大模型排行榜中。

Q3: 综合指数在 AI 大模型排行榜中代表什么？

它代表模型的全能表现。AI 大模型排行榜通过加权算法给出这个综合评分。

Q4: 如何在 AI 大模型排行榜中查找性价比最高的游戏？

在 AI 大模型排行榜页面中，您可以点击“价格”标题进行排序，寻找低价高分的模型。

Q5: AI 大模型排行榜的编程能力测试准吗？

AI 大模型排行榜参考了 LiveCodeBench 等权威基准测试，具有极高的参考价值。

Q6: 为什么有的新模型没进入 AI 大模型排行榜？

模型进入 AI 大模型排行榜需要经过一系列测试，通常在新模型发布后数日内会完成更新。

Q7: AI 大模型排行榜中的价格计算标准是什么？

价格是基于百万 Token 的调用成本，由 AI 大模型排行榜统一混合计算得出。

Q8: 手机上能查看 AI 大模型排行榜吗？

当然可以。AI 大模型排行榜进行了移动端响应式深度优化。

Q9: AI 大模型排行榜这个工具免费吗？

是的，由值品工具箱免费提供 AI 大模型排行榜信息查询服务。

Q10: 我该怎么利用 AI 大模型排行榜做选型？

如果您需要智能客服，参考 AI 大模型排行榜的综合指数；如果做翻译，参考编程外的语言指标。

发表评论

称呼 *

Email *

网站

内容 *

请友善文明留言

AI 大模型排行榜 (Artificial Analysis LLM Ranking)

AI 大模型排行榜数据中心

榜单解读建议

指标说明

AI 大模型排行榜 常见问题 (FAQ)

Q1: AI 大模型排行榜 的数据多久更新？

Q2: 这个 AI 大模型排行榜 包含国产模型吗？

Q3: 综合指数在 AI 大模型排行榜 中代表什么？

Q4: 如何在 AI 大模型排行榜 中查找性价比最高的游戏？

Q5: AI 大模型排行榜 的编程能力测试准吗？