AI 大模型排名 ArtificialAnalysis AI 大模型排行榜

信息查询
1k 次浏览
100% 有帮助 · 1 人反馈

AI 大模型排名 Artificial Analysis AI 大模型排行榜,综合对超过 100 个 AI 模型(LLM)的性能进行了比较和排名,评估指标包括智能程度、价格以及常见AI基准测试的结果。

AI 大模型排行榜数据中心

重置
排名 模型名称 综合指数 ▼ 编程 价格 ($/1M)
1 GPT-5.5 (xhigh) 60.2 59.1 $11.25
2 GPT-5.5 (high) 58.9 58.5 $11.25
3 Claude Opus 4.7 (Adaptive Reasoning, Max Effort) 57.3 52.5 $10.938
4 Gemini 3.1 Pro Preview 57.2 55.5 $4.5
5 GPT-5.4 (xhigh) 56.8 57.3 $5.625
6 GPT-5.5 (medium) 56.7 56.2 $11.25
7 Kimi K2.6 53.9 47.1 $1.712
8 MiMo-V2.5-Pro 53.8 45.5 $1.5
9 GPT-5.3 Codex (xhigh) 53.6 53.1 $4.813
10 Grok 4.3 53.2 41 $1.563
11 Claude Opus 4.6 (Adaptive Reasoning, Max Effort) 53 48.1 $10.938
12 Muse Spark 52.1 47.5 $0
13 Claude Opus 4.7 (Non-reasoning, High Effort) 51.8 53.1 $10.938
14 Qwen3.6 Max Preview 51.8 44.9 $2.925
15 Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) 51.7 50.9 $6.563
16 DeepSeek V4 Pro (Reasoning, Max Effort) 51.5 47.5 $2.175
17 GLM-5.1 (Reasoning) 51.4 43.4 $2.15
18 GPT-5.2 (xhigh) 51.3 48.7 $4.813
19 GPT-5.5 (low) 50.8 52.1 $11.25
20 Qwen3.6 Plus 50 42.9 $1.125
21 DeepSeek V4 Pro (Reasoning, High Effort) 49.8 43.3 $2.175
22 GLM-5 (Reasoning) 49.8 44.2 $1.55
23 Claude Opus 4.5 (Reasoning) 49.7 47.8 $10.938
24 MiniMax-M2.7 49.6 41.9 $0.525
25 Grok 4.20 0309 v2 (Reasoning) 49.3 40.5 $3
26 MiMo-V2-Pro 49.2 41.4 $1.5
27 MiMo-V2.5 49 42.1 $0.72
28 GPT-5.2 Codex (xhigh) 49 43 $4.813
29 GPT-5.4 mini (xhigh) 48.9 51.5 $1.688
30 Grok 4.20 0309 (Reasoning) 48.5 42.2 $3
31 Gemini 3 Pro Preview (high) 48.4 46.5 $4.5
32 GPT-5.4 (low) 47.9 45.6 $5.625
33 GPT-5.1 (high) 47.7 44.7 $3.438
34 GLM-5-Turbo 46.8 36.8 $0
35 Kimi K2.5 (Reasoning) 46.8 39.5 $1.136
36 GPT-5.2 (medium) 46.6 44.2 $4.813
37 DeepSeek V4 Flash (Reasoning, Max Effort) 46.5 38.7 $0.175
38 Claude Opus 4.6 (Non-reasoning, High Effort) 46.5 47.6 $10.938
39 Gemini 3 Flash Preview (Reasoning) 46.4 42.6 $1.125
40 Qwen3.6 27B (Reasoning) 45.8 36.5 $1.35
41 Qwen3.5 397B A17B (Reasoning) 45 41.3 $1.35
42 DeepSeek V4 Flash (Reasoning, High Effort) 44.9 39.8 $0.175
43 MiMo-V2-Omni-0327 44.9 36.9 $0.8
44 GPT-5 (high) 44.6 36 $3.438
45 GPT-5 Codex (high) 44.6 38.9 $3.438
46 Claude Sonnet 4.6 (Non-reasoning, High Effort) 44.4 46.4 $6.563
47 GPT-5.4 nano (xhigh) 44 43.9 $0.463
48 KAT Coder Pro V2 43.8 45.6 $0.525
49 GLM-5.1 (Non-reasoning) 43.8 35.8 $2.15
50 Qwen3.6 35B A3B (Reasoning) 43.5 35.1 $0.557
51 MiMo-V2-Omni 43.4 35.5 $0
52 GPT-5.1 Codex (high) 43.1 36.6 $3.438
53 Claude Opus 4.5 (Non-reasoning) 43.1 42.9 $10.938
54 Kimi K2.6 (Non-reasoning) 43 38.4 $1.712
55 Claude 4.5 Sonnet (Reasoning) 43 38.6 $6.563
56 GLM 5V Turbo (Reasoning) 42.9 36.2 $0
57 Claude Sonnet 4.6 (Non-reasoning, Low Effort) 42.6 43 $6.563
58 GLM-4.7 (Reasoning) 42.1 36.3 $1
59 Qwen3.5 27B (Reasoning) 42.1 34.9 $0.825
60 GPT-5 (medium) 42 39 $3.438
61 Claude 4.1 Opus (Reasoning) 42 36.5 $32.813
62 Hy3-preview (Reasoning) 41.9 36.5 $0
63 MiniMax-M2.5 41.9 37.4 $0.525
64 DeepSeek V3.2 (Reasoning) 41.7 36.7 $0.337
65 Qwen3.5 122B A10B (Reasoning) 41.6 34.7 $1.1
66 MiMo-V2-Flash (Feb 2026) 41.5 33.5 $0.15
67 Grok 4 41.5 40.5 $8.5
68 Gemini 3 Pro Preview (low) 41.3 39.4 $4.5
69 GPT-5 mini (high) 41.2 35.3 $0.688
70 GPT-5.5 (Non-reasoning) 40.9 48.6 $11.25
71 Kimi K2 Thinking 40.9 34.8 $1.075
72 o3-pro 40.7 - $35
73 GLM-5 (Non-reasoning) 40.6 39 $1.55
74 Qwen3.5 397B A17B (Non-reasoning) 40.1 37.4 $1.35
75 Qwen3 Max Thinking 39.9 30.5 $2.4
76 MiniMax-M2.1 39.4 32.8 $0.525
77 DeepSeek V4 Pro (Non-reasoning) 39.3 38.4 $2.175
78 Gemma 4 31B (Reasoning) 39.2 38.7 $0
79 Mistral Medium 3.5 39.2 35.4 $3
80 GPT-5 (low) 39.2 30.7 $3.438
81 MiMo-V2-Flash (Reasoning) 39.2 31.8 $0.15
82 Claude 4 Opus (Reasoning) 39 34 $32.813
83 GPT-5 mini (medium) 38.9 32.9 $0.688
84 Claude 4 Sonnet (Reasoning) 38.7 34.1 $6.563
85 Grok 4.1 Fast (Reasoning) 38.6 30.9 $0.275
86 Qwen3.5 Omni Plus 38.6 27.6 $1.5
87 GPT-5.1 Codex mini (high) 38.6 36.4 $0.688
88 Step 3.5 Flash 2603 38.5 34.6 $0
89 o3 38.4 38.4 $3.5
90 GPT-5.4 nano (medium) 38.1 35 $0.463
91 Step 3.5 Flash 37.8 31.6 $0.15
92 GPT-5.4 mini (medium) 37.7 37.5 $1.688
93 Kimi K2.5 (Non-reasoning) 37.3 25.8 $1.2
94 Qwen3.5 27B (Non-reasoning) 37.2 33.4 $0.835
95 Claude 4.5 Haiku (Reasoning) 37.1 32.6 $2.188
96 Qwen3.6 27B (Non-reasoning) 37.1 26.6 $1.35
97 Claude 4.5 Sonnet (Non-reasoning) 37.1 33.5 $6.563
98 Qwen3.5 35B A3B (Reasoning) 37.1 30.3 $0.688
99 DeepSeek V4 Flash (Non-reasoning) 36.5 35.1 $0.175
100 MiniMax-M2 36.1 29.2 $0.525
101 NVIDIA Nemotron 3 Super 120B A12B (Reasoning) 36 31.2 $0.412
102 KAT-Coder-Pro V1 36 18.3 $0.525
103 Claude 4.1 Opus (Non-reasoning) 36 - $32.813
104 Qwen3.5 122B A10B (Non-reasoning) 35.9 31.6 $1.1
105 Nova 2.0 Pro Preview (medium) 35.7 30.4 $3.438
106 MiMo-V2.5-Pro (Non-reasoning) 35.6 36.8 $1.5
107 GPT-5.4 (Non-reasoning) 35.4 41 $5.625
108 Grok 4 Fast (Reasoning) 35.1 27.4 $0.275
109 Gemini 3 Flash Preview (Non-reasoning) 35 37.8 $1.125
110 Claude 3.7 Sonnet (Reasoning) 34.7 27.6 $0
111 Gemini 2.5 Pro 34.6 31.9 $3.438
112 Nova 2.0 Lite (high) 34.5 23.4 $0.85
113 GLM-4.7 (Non-reasoning) 34.2 32 $1
114 DeepSeek V3.1 Terminus (Reasoning) 33.9 33.7 $1.914
115 Hy3-preview (Non-reasoning) 33.7 34.3 $0
116 Ling-2.6-1T 33.6 33 $0.85
117 GPT-5.2 (Non-reasoning) 33.6 34.7 $4.813
118 Gemini 3.1 Flash-Lite Preview 33.5 30.1 $0.563
119 Doubao Seed Code 33.5 31.3 $0
120 gpt-oss-120B (high) 33.3 28.6 $0.262
121 o4-mini (high) 33.1 25.6 $1.925
122 Claude 4 Opus (Non-reasoning) 33 - $32.813
123 Claude 4 Sonnet (Non-reasoning) 33 30.6 $6.563
124 DeepSeek V3.2 Exp (Reasoning) 32.9 33.3 $0.31
125 Mercury 2 32.8 30.6 $0.375
126 GLM-4.6 (Reasoning) 32.5 29.5 $0.963
127 Qwen3 Max Thinking (Preview) 32.5 24.5 $2.4
128 Qwen3.5 9B (Reasoning) 32.4 25.3 $0.113
129 Gemma 4 31B (Non-reasoning) 32.3 33.9 $0
130 Grok 3 mini Reasoning (high) 32.1 25.2 $0.35
131 K-EXAONE (Reasoning) 32.1 27 $0
132 DeepSeek V3.2 (Non-reasoning) 32.1 34.6 $0.775
133 Nova 2.0 Pro Preview (low) 31.9 24.5 $3.438
134 Trinity Large Thinking 31.9 27.2 $0.395
135 Qwen3.6 35B A3B (Non-reasoning) 31.5 17.6 $0.844
136 Qwen3 Max 31.4 26.4 $3.047
137 Gemma 4 26B A4B (Reasoning) 31.2 22.4 $0.198
138 Claude 4.5 Haiku (Non-reasoning) 31.1 29.6 $2.188
139 Gemini 2.5 Flash Preview (Sep '25) (Reasoning) 31.1 24.6 $0
140 Grok 4.3 (Non-reasoning) 31 25.1 $1.563
141 Kimi K2 0905 30.9 25.9 $1.075
142 o1 30.8 20.5 $26.25
143 Claude 3.7 Sonnet (Non-reasoning) 30.8 26.7 $6.563
144 Qwen3.5 35B A3B (Non-reasoning) 30.7 16.8 $0.688
145 MiMo-V2-Flash (Non-reasoning) 30.4 25.8 $0.15
146 Gemini 2.5 Pro Preview (Mar' 25) 30.3 46.7 $0
147 EXAONE 4.5 33B 30.2 23 $0
148 GLM-4.6 (Non-reasoning) 30.2 30.2 $1
149 GLM-4.7-Flash (Reasoning) 30.1 25.9 $0.153
150 Nova 2.0 Lite (medium) 29.7 23.9 $0.85
151 Grok 4.20 0309 (Non-reasoning) 29.7 25.4 $3
152 Gemini 2.5 Pro Preview (May' 25) 29.5 - $3.438
153 Qwen3 235B A22B 2507 (Reasoning) 29.5 23.2 $0.838
154 DeepSeek V3.2 Speciale 29.4 37.9 $0
155 ERNIE 5.0 Thinking Preview 29.1 29.2 $0
156 Grok 4.20 0309 v2 (Non-reasoning) 29 22 $3
157 Grok Code Fast 1 28.7 23.7 $0.525
158 DeepSeek V3.1 Terminus (Non-reasoning) 28.5 31.9 $0.453
159 Nemotron Cascade 2 30B A3B 28.4 25.8 $0
160 DeepSeek V3.2 Exp (Non-reasoning) 28.4 30 $0.31
161 Qwen3 Coder Next 28.3 22.9 $0.563
162 Apriel-v1.5-15B-Thinker 28.3 18.7 $0
163 DeepSeek V3.1 (Non-reasoning) 28.1 28.4 $0.834
164 Nova 2.0 Omni (medium) 28 15.1 $0.85
165 Mistral Small 4 (Reasoning) 27.8 24.3 $0.262
166 DeepSeek V3.1 (Reasoning) 27.7 29.7 $0.865
167 Apriel-v1.6-15B-Thinker 27.6 22 $0
168 Qwen3 VL 235B A22B (Reasoning) 27.6 20.9 $2.174
169 GPT-5.1 (Non-reasoning) 27.4 27.3 $3.438
170 Qwen3.5 9B (Non-reasoning) 27.3 21.4 $0
171 Gemma 4 26B A4B (Non-reasoning) 27.1 29.1 $0
172 Magistral Medium 1.2 27.1 21.7 $2.75
173 DeepSeek R1 0528 (May '25) 27.1 24 $2.063
174 Qwen3.5 4B (Reasoning) 27.1 17.5 $0.06
175 Gemini 2.5 Flash (Reasoning) 27 22.2 $0.85
176 GPT-5 nano (high) 26.8 20.3 $0.138
177 Qwen3 Next 80B A3B (Reasoning) 26.7 19.5 $1.875
178 GLM-4.5 (Reasoning) 26.4 26.3 $1
179 GPT-4.1 26.3 21.8 $3.5
180 Kimi K2 26.3 22.1 $1.039
181 Ling 2.6 Flash 26.2 23.2 $0.15
182 Qwen3 Max (Preview) 26.1 25.5 $2.4
183 Solar Pro 3 25.9 13.3 $0
184 Qwen3.5 Omni Flash 25.9 14 $0.275
185 o3-mini 25.9 17.9 $1.925
186 GPT-5 nano (medium) 25.9 22.9 $0.138
187 o1-pro 25.8 - $262.5
188 Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) 25.7 22.1 $0
189 JT-MINI 25.4 21.2 $0
190 o3-mini (high) 25.2 17.3 $1.925
191 Grok 3 25.2 19.8 $6
192 Seed-OSS-36B-Instruct 25.2 16.7 $0.3
193 Qwen3 235B A22B 2507 Instruct 25 22.1 $0.356
194 Qwen3 Coder 480B A35B Instruct 24.8 24.6 $0.675
195 Qwen3 VL 32B (Reasoning) 24.7 14.5 $2.625
196 Nova 2.0 Lite (low) 24.6 13.6 $0.85
197 Sonar Reasoning Pro 24.6 - $0
198 gpt-oss-120B (low) 24.5 15.5 $0.262
199 gpt-oss-20B (high) 24.5 18.5 $0.088
200 GPT-5.4 nano (Non-Reasoning) 24.4 27.9 $0.463
201 MiniMax M1 80k 24.4 14.5 $0.963
202 NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) 24.3 19 $0.096
203 Gemini 2.5 Flash Preview (Reasoning) 24.3 - $0
204 K2 Think V2 24.1 15.5 $0
205 LongCat Flash Lite 23.9 16.5 $0
206 GPT-5 (minimal) 23.9 25.1 $3.438
207 HyperCLOVA X SEED Think (32B) 23.7 17.5 $0
208 o1-preview 23.7 34 $28.875
209 Grok 4.1 Fast (Non-reasoning) 23.6 19.5 $0.275
210 K-EXAONE (Non-reasoning) 23.4 13.5 $0
211 GLM-4.6V (Reasoning) 23.4 19.7 $0.45
212 GPT-5.4 mini (Non-Reasoning) 23.3 25.3 $1.688
213 Nova 2.0 Omni (low) 23.2 13.9 $0.85
214 GLM-4.5-Air 23.2 23.8 $0.372
215 Nova 2.0 Pro Preview (Non-reasoning) 23.1 20.5 $3.438
216 Mi:dm K 2.5 Pro 23.1 12.6 $0
217 Grok 4 Fast (Non-reasoning) 23.1 19 $0.275
218 GPT-4.1 mini 22.9 18.5 $0.7
219 Mistral Large 3 22.8 22.7 $0.75
220 Ring-1T 22.8 16.8 $0
221 Qwen3.5 4B (Non-reasoning) 22.6 13.7 $0.06
222 Qwen3 30B A3B 2507 (Reasoning) 22.4 14.7 $0.673
223 DeepSeek V3 0324 22.3 22 $1.209
224 INTELLECT-3 22.2 19.1 $0
225 GLM-4.7-Flash (Non-reasoning) 22.1 11 $0.153
226 Devstral 2 22 23.7 $0
227 GPT-5 (ChatGPT) 21.8 21.2 $3.438
228 Solar Open 100B (Reasoning) 21.7 10.5 $0
229 Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) 21.6 18.1 $0.175
230 Grok 3 Reasoning Beta 21.6 - $0
231 Nemotron 3 Nano Omni 30B A3B Reasoning 21.4 14.8 $0.131
232 Mistral Medium 3.1 21.3 18.3 $0.8
233 MiniMax M1 40k 20.9 14.1 $0
234 gpt-oss-20B (low) 20.8 14.4 $0.095
235 Qwen3 VL 235B A22B Instruct 20.8 16.5 $0.7
236 GPT-5 mini (minimal) 20.7 21.9 $0.688
237 K2-V2 (high) 20.6 16.1 $0
238 Gemini 2.5 Flash (Non-reasoning) 20.6 17.8 $0.85
239 o1-mini 20.4 - $0
240 Qwen3 Next 80B A3B Instruct 20.1 15.3 $0.875
241 Tri-21B-think Preview 20 7.4 $0
242 GPT-4.5 (Preview) 20 - $0
243 Qwen3 Coder 30B A3B Instruct 20 19.4 $0.352
244 Qwen3 235B A22B (Reasoning) 19.8 17.4 $2.625
245 QwQ 32B 19.7 - $0.745
246 Qwen3 VL 30B A3B (Reasoning) 19.7 13.1 $0.338
247 Gemini 2.0 Flash Thinking Experimental (Jan '25) 19.6 24.1 $0
248 Devstral Small 2 19.5 20.7 $0
249 Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) 19.4 14.5 $0.175
250 Motif-2-12.7B-Reasoning 19.1 11.9 $0
251 Nova Premier 19 13.8 $5
252 Ling-1T 19 18.8 $0
253 Gemma 4 E4B (Reasoning) 18.8 13.7 $0.537
254 Magistral Medium 1 18.8 16 $0
255 Mistral Medium 3 18.8 13.6 $0.8
256 DeepSeek R1 (Jan '25) 18.8 15.9 $2.431
257 Solar Pro 2 (Preview) (Reasoning) 18.8 - $0
258 Llama Nemotron Super 49B v1.5 (Reasoning) 18.7 15.2 $0.175
259 K2-V2 (medium) 18.7 14 $0
260 Claude 3.5 Haiku 18.7 10.7 $1.75
261 Devstral Medium 18.7 15.9 $0.8
262 Mistral Small 4 (Non-reasoning) 18.6 16.4 $0.262
263 Hermes 4 - Llama-3.1 405B (Reasoning) 18.6 16 $1.5
264 Tri-21B-Think 18.6 6.3 $0
265 GPT-4o (Aug '24) 18.6 16.6 $4.375
266 GPT-4o (March 2025, chatgpt-4o-latest) 18.6 - $0
267 Llama 3.3 Nemotron Super 49B v1 (Reasoning) 18.5 9.4 $0
268 Gemini 2.0 Flash (Feb '25) 18.5 13.6 $0.262
269 Llama 4 Maverick 18.4 15.6 $0.475
270 Magistral Small 1.2 18.2 14.8 $0.75
271 Sarvam 105B (high) 18.2 9.8 $0
272 Qwen3 4B 2507 (Reasoning) 18.2 9.5 $0
273 Gemini 2.0 Pro Experimental (Feb '25) 18.1 25.5 $0
274 Nova 2.0 Lite (Non-reasoning) 18 12.5 $0.85
275 Claude 3 Opus 18 19.5 $32.813
276 Devstral Small (May '25) 18 12.2 $0
277 Sonar Reasoning 17.9 - $0
278 Gemini 2.5 Flash Preview (Non-reasoning) 17.8 - $0
279 Hermes 4 - Llama-3.1 405B (Non-reasoning) 17.6 18.1 $1.5
280 Gemini 2.5 Flash-Lite (Reasoning) 17.6 9.5 $0.175
281 Llama 3.1 Instruct 405B 17.4 14.5 $3.688
282 GPT-4o (Nov '24) 17.3 16.7 $4.375
283 DeepSeek R1 Distill Qwen 32B 17.2 - $0
284 Qwen3 VL 32B Instruct 17.2 15.6 $1.225
285 GLM-4.6V (Non-reasoning) 17.1 11.1 $0.45
286 Qwen3 235B A22B (Non-reasoning) 17 14 $0.787
287 Gemini 2.0 Flash (experimental) 16.8 - $0
288 Magistral Small 1 16.8 11.1 $0
289 EXAONE 4.0 32B (Reasoning) 16.7 14 $0
290 Qwen3 VL 8B (Reasoning) 16.7 9.8 $0.66
291 Nova 2.0 Omni (Non-reasoning) 16.6 13.8 $0.85
292 DeepSeek V3 (Dec '24) 16.5 16.4 $0.523
293 Qwen3 32B (Reasoning) 16.5 13.8 $0.276
294 DeepSeek R1 0528 Qwen3 8B 16.4 7.8 $0
295 Qwen3.5 2B (Reasoning) 16.3 3.5 $0.04
296 Qwen2.5 Max 16.3 - $2.8
297 Qwen3 14B (Reasoning) 16.2 13.1 $0.731
298 Nanbeige4.1-3B 16.1 8.9 $0
299 Qwen3 VL 30B A3B Instruct 16.1 14.3 $0.3
300 Ministral 3 14B 16 10.9 $0.2
301 DeepSeek R1 Distill Llama 70B 16 11.4 $0.787
302 Hermes 4 - Llama-3.1 70B (Reasoning) 16 14.4 $0.198
303 Gemini 1.5 Pro (Sep '24) 16 23.6 $0
304 Solar Pro 2 (Preview) (Non-reasoning) 16 - $0
305 Claude 3.5 Sonnet (Oct '24) 15.9 30.2 $6.563
306 Falcon-H1R-7B 15.8 9.8 $0
307 DeepSeek R1 Distill Qwen 14B 15.8 - $0
308 Ling-flash-2.0 15.7 16.7 $0.247
309 Qwen3 Omni 30B A3B (Reasoning) 15.6 12.7 $0.43
310 Qwen2.5 Instruct 72B 15.6 11.9 $0.37
311 Sonar 15.5 - $0
312 Step3 VL 10B 15.4 13.9 $0
313 Qwen3 30B A3B (Reasoning) 15.3 11 $0.18
314 Gemma 4 E2B (Reasoning) 15.2 9 $0
315 Devstral Small (Jul '25) 15.2 12.1 $0.15
316 Sonar Pro 15.2 - $0
317 QwQ 32B-Preview 15.2 - $0
318 Mistral Large 2 (Nov '24) 15.1 13.8 $3
319 Mistral Small 3.2 15.1 13.3 $0.128
320 GLM-4.5V (Reasoning) 15.1 10.9 $0.9
321 Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) 15 13.1 $0.9
322 ERNIE 4.5 300B A47B 15 14.5 $0.485
323 Qwen3 30B A3B 2507 Instruct 15 14.2 $0.213
324 Solar Pro 2 (Reasoning) 14.9 12.1 $0
325 NVIDIA Nemotron Nano 12B v2 VL (Reasoning) 14.9 11.8 $0.3
326 Gemma 4 E4B (Non-reasoning) 14.8 6.4 $0.537
327 Ministral 3 8B 14.8 10 $0.15
328 NVIDIA Nemotron Nano 9B V2 (Reasoning) 14.8 8.3 $0.07
329 NVIDIA Nemotron 3 Nano 4B 14.7 10 $0
330 Granite 4.1 30B 14.7 10.1 $0
331 Qwen3.5 2B (Non-reasoning) 14.7 4.9 $0.04
332 Gemini 2.0 Flash-Lite (Feb '25) 14.7 - $0
333 Llama Nemotron Super 49B v1.5 (Non-reasoning) 14.6 10.5 $0.175
334 Llama 3.3 Instruct 70B 14.5 10.7 $0.616
335 GPT-4o (May '24) 14.5 24.2 $7.5
336 Gemini 2.0 Flash-Lite (Preview) 14.5 - $0
337 Mistral Small 3.1 14.5 13.9 $0.138
338 Qwen3 32B (Non-reasoning) 14.5 - $0.26
339 Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) 14.4 - $0
340 Kimi Linear 48B A3B Instruct 14.4 14.2 $0
341 K2-V2 (low) 14.4 10.5 $0
342 Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) 14.3 7.6 $0
343 Qwen3 VL 8B Instruct 14.3 7.3 $0.31
344 Claude 3.5 Sonnet (June '24) 14.2 26 $6.563
345 Qwen3 4B (Reasoning) 14.2 - $0.398
346 GPT-4o (ChatGPT) 14.1 - $0
347 Llama 3.1 Tulu3 405B 14.1 - $0
348 Ring-flash-2.0 14 10.6 $0.247
349 Pixtral Large 14 - $3
350 Olmo 3.1 32B Think 13.9 9.8 $0
351 Grok 2 (Dec '24) 13.9 - $0
352 GPT-5 nano (minimal) 13.8 14.2 $0.138
353 Gemini 1.5 Flash (Sep '24) 13.8 - $0
354 GPT-4 Turbo 13.7 21.5 $15
355 Qwen3 VL 4B (Reasoning) 13.7 6.7 $0
356 Solar Pro 2 (Non-reasoning) 13.6 11.3 $0
357 Llama 4 Scout 13.5 6.7 $0.292
358 Command A 13.5 9.9 $4.375
359 Nova Pro 13.5 11 $1.4
360 Llama 3.1 Nemotron Instruct 70B 13.4 10.8 $1.2
361 Grok Beta 13.3 - $0
362 NVIDIA Nemotron Nano 9B V2 (Non-reasoning) 13.2 7.5 $0.086
363 NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) 13.2 15.8 $0.088
364 Qwen2.5 Instruct 32B 13.2 - $0
365 Qwen3 8B (Reasoning) 13.2 9 $0.37
366 GPT-4.1 nano 13 11.2 $0.175
367 Mistral Large 2 (Jul '24) 13 - $3
368 Qwen2.5 Coder Instruct 32B 12.9 - $0
369 Qwen3 4B 2507 Instruct 12.9 9.1 $0
370 GPT-4 12.8 13.1 $37.5
371 Qwen3 14B (Non-reasoning) 12.8 12.4 $0.381
372 Gemini 2.5 Flash-Lite (Non-reasoning) 12.7 7.4 $0.175
373 Mistral Small 3 12.7 - $0.104
374 Nova Lite 12.7 5.1 $0.105
375 GLM-4.5V (Non-reasoning) 12.7 10.8 $0.9
376 Hermes 4 - Llama-3.1 70B (Non-reasoning) 12.6 9.2 $0.198
377 GPT-4o mini 12.6 - $0.262
378 Llama 3.1 Instruct 70B 12.5 10.9 $0.56
379 DeepSeek-V2.5 (Dec '24) 12.5 - $0
380 Qwen3 4B (Non-reasoning) 12.5 - $0.188
381 Qwen3 30B A3B (Non-reasoning) 12.5 13.3 $0.133
382 Granite 4.1 8B 12.4 7.3 $0.063
383 Sarvam 30B (high) 12.3 7.9 $0
384 Gemini 2.0 Flash Thinking Experimental (Dec '24) 12.3 - $0
385 Claude 3 Haiku 12.3 6.7 $0.5
386 DeepSeek-V2.5 12.3 - $0
387 Olmo 3.1 32B Instruct 12.2 5.6 $0
388 Gemma 4 E2B (Non-reasoning) 12.1 8.3 $0
389 Mistral Saba 12.1 - $0
390 DeepSeek R1 Distill Llama 8B 12.1 - $0
391 Olmo 3 32B Think 12.1 10.5 $0
392 R1 1776 12 - $0
393 Gemini 1.5 Pro (May '24) 12 19.8 $0
394 Reka Flash (Sep '24) 12 - $0.35
395 Qwen2.5 Turbo 12 - $0.088
396 Llama 3.2 Instruct 90B (Vision) 11.9 - $1.38
397 Solar Mini 11.9 - $0.15
398 Llama 3.1 Instruct 8B 11.8 4.9 $0.1
399 Grok-1 11.7 - $0
400 EXAONE 4.0 32B (Non-reasoning) 11.7 9.4 $0
401 Qwen2 Instruct 72B 11.7 - $0
402 Ministral 3 3B 11.2 4.8 $0.1
403 Gemini 1.5 Flash-8B 11.1 - $0
404 DeepHermes 3 - Mistral 24B Preview (Non-reasoning) 10.9 - $0
405 Jamba 1.7 Large 10.9 7.8 $3.5
406 Granite 4.0 H Small 10.8 8.5 $0.107
407 Qwen3 Omni 30B A3B Instruct 10.7 7.2 $0.43
408 Jamba 1.5 Large 10.7 - $3.5
409 DeepSeek-Coder-V2 10.6 - $0
410 OLMo 2 32B 10.6 2.7 $0
411 Hermes 3 - Llama-3.1 70B 10.6 - $0.3
412 Jamba 1.6 Large 10.6 - $3.5
413 Qwen3 8B (Non-reasoning) 10.6 7.1 $0.185
414 LFM2 24B A2B 10.5 3.6 $0.052
415 Qwen3.5 0.8B (Reasoning) 10.5 0 $0.02
416 Gemini 1.5 Flash (May '24) 10.5 - $0
417 Phi-4 10.4 11.2 $0.219
418 Nova Micro 10.3 4.1 $0.061
419 Gemma 3 27B Instruct 10.3 9.6 $0.145
420 Claude 3 Sonnet 10.3 - $6
421 Mistral Small (Sep '24) 10.2 - $0.3
422 NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) 10.1 5.9 $0.3
423 Gemma 3n E4B Instruct Preview (May '25) 10.1 - $0
424 Gemini 1.0 Ultra 10.1 17.6 $0
425 Phi-3 Mini Instruct 3.8B 10.1 3 $0
426 Phi-4 Multimodal Instruct 10 - $0
427 Qwen2.5 Coder Instruct 7B 10 - $0
428 Qwen3.5 0.8B (Non-reasoning) 9.9 1 $0.02
429 Mistral Large (Feb '24) 9.9 - $6
430 Mixtral 8x22B Instruct 9.8 - $0
431 Llama 3.2 Instruct 3B 9.7 - $0.15
432 Llama 2 Chat 7B 9.7 - $0.1
433 Jamba Reasoning 3B 9.6 2.5 $0
434 Qwen3 VL 4B Instruct 9.6 4.5 $0
435 Reka Flash 3 9.5 8.9 $0.35
436 Qwen1.5 Chat 110B 9.5 - $0
437 Olmo 3 7B Think 9.4 7.6 $0
438 Claude 2.1 9.3 14 $0
439 OLMo 2 7B 9.3 1.2 $0
440 Molmo 7B-D 9.2 1.2 $0
441 Ling-mini-2.0 9.2 5 $0
442 Claude 2.0 9.1 12.9 $0
443 DeepSeek R1 Distill Qwen 1.5B 9.1 - $0
444 DeepSeek-V2-Chat 9.1 - $0
445 GPT-3.5 Turbo 9 10.7 $0.75
446 Mistral Small (Feb '24) 9 - $1.5
447 Mistral Medium 9 - $4.088
448 Llama 3 Instruct 70B 8.9 6.8 $1.175
449 Gemma 3 12B Instruct 8.8 6.3 $0.14
450 LFM 40B 8.8 - $0
451 Arctic Instruct 8.8 - $0
452 Qwen Chat 72B 8.8 - $0
453 Llama 3.2 Instruct 11B (Vision) 8.7 4.3 $0.245
454 PALM-2 8.6 4.6 $0
455 Granite 4.1 3B 8.5 5.5 $0
456 Gemini 1.0 Pro 8.5 - $0
457 DeepSeek Coder V2 Lite Instruct 8.5 - $0
458 Phi-4 Mini Instruct 8.4 3.6 $0
459 Llama 2 Chat 70B 8.4 - $0
460 Llama 2 Chat 13B 8.4 - $0
461 DeepSeek LLM 67B Chat (V1) 8.4 - $0
462 Sarvam M (Reasoning) 8.4 7.5 $0
463 Exaone 4.0 1.2B (Reasoning) 8.3 3.1 $0
464 OpenChat 3.5 (1210) 8.3 - $0
465 DBRX Instruct 8.3 - $0
466 Command-R+ (Apr '24) 8.3 - $6
467 Olmo 3 7B Instruct 8.2 3.4 $0.125
468 LFM2.5-1.2B-Thinking 8.1 1.4 $0
469 Exaone 4.0 1.2B (Non-reasoning) 8.1 2.5 $0
470 Jamba 1.7 Mini 8.1 3.1 $0
471 LFM2.5-1.2B-Instruct 8 0.8 $0
472 LFM2 2.6B 8 1.4 $0
473 Granite 4.0 H 1B 8 2.7 $0
474 Jamba 1.5 Mini 8 - $0.25
475 Qwen3 1.7B (Reasoning) 8 1.4 $0.398
476 Jamba 1.6 Mini 7.9 - $0.25
477 Gemma 3 270M 7.7 0 $0
478 Granite 4.0 Micro 7.7 5 $0
479 Apertus 70B Instruct 7.7 1.9 $1.345
480 Mixtral 8x7B Instruct 7.7 - $0.512
481 DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) 7.6 - $0
482 Llama 65B 7.4 - $0
483 Qwen Chat 14B 7.4 - $0
484 Claude Instant 7.4 7.8 $0
485 Mistral 7B Instruct 7.4 - $0.206
486 Command-R (Mar '24) 7.4 - $0.75
487 Molmo2-8B 7.3 4.4 $0
488 Granite 4.0 1B 7.3 2.9 $0
489 LFM2 8B A1B 7 2.3 $0
490 Granite 3.3 8B (Non-reasoning) 7 3.4 $0.085
491 Qwen3 1.7B (Non-reasoning) 6.8 2.3 $0.188
492 Qwen3 0.6B (Reasoning) 6.5 0.9 $0.398
493 Llama 3 Instruct 8B 6.4 4 $0.07
494 Gemma 3n E4B Instruct 6.4 4.2 $0.025
495 Llama 3.2 Instruct 1B 6.3 0.6 $0.05
496 Gemma 3 4B Instruct 6.3 2.9 $0.05
497 LFM2 1.2B 6.3 0.8 $0
498 LFM2.5-VL-1.6B 6.2 1 $0
499 Granite 4.0 350M 6.1 0.3 $0
500 Apertus 8B Instruct 5.9 1.4 $0.125
501 Qwen3 0.6B (Non-reasoning) 5.7 1.4 $0.188
502 Gemma 3 1B Instruct 5.5 0.2 $0
503 Granite 4.0 H 350M 5.4 0.6 $0
504 Gemma 3n E2B Instruct 4.8 2.2 $0
505 Tiny Aya Global 4.7 1.2 $0
506 GPT-5.5 Pro (xhigh) - - $0
507 Gemini 3 Deep Think - - $0
508 EXAONE 4.5 33B (Non-reasoning) - - $0
509 Cogito v2.1 (Reasoning) - 24.8 $1.25
510 Mi:dm K 2.5 Pro Preview - 11.9 $0
511 GPT-4o mini Realtime (Dec '24) - - $0
512 GPT-5.4 Pro (xhigh) - - $67.5
513 GPT-4o Realtime (Dec '24) - - $0
514 GPT-3.5 Turbo (0613) - - $0

榜单解读建议

参考 AI 大模型排行榜 时,应综合考虑“综合指数”与“成本价格”。如果您是开发者,编程能力 (Coding) 是更核心的指标。

值品工具箱同步的 AI 大模型排行榜 数据每 24 小时更新,确保您获取到最新的模型性能对比。

指标说明

  • 综合指数:评估通用理解与逻辑。
  • 价格 $/1M:混合 3:1 输入输出比的平均成本。
  • 编程能力:衡量代码生成的准确性。

AI 大模型排行榜 常见问题 (FAQ)

Q1: AI 大模型排行榜 的数据多久更新?

AI 大模型排行榜 数据每 24 小时自动抓取一次,确保最新模型加入列表。

Q2: 这个 AI 大模型排行榜 包含国产模型吗?

是的,只要国产模型通过了 Artificial Analysis 的全球测评,就会出现在 AI 大模型排行榜 中。

Q3: 综合指数在 AI 大模型排行榜 中代表什么?

它代表模型的全能表现。AI 大模型排行榜 通过加权算法给出这个综合评分。

Q4: 如何在 AI 大模型排行榜 中查找性价比最高的游戏?

在 AI 大模型排行榜 页面中,您可以点击“价格”标题进行排序,寻找低价高分的模型。

Q5: AI 大模型排行榜 的编程能力测试准吗?

AI 大模型排行榜 参考了 LiveCodeBench 等权威基准测试,具有极高的参考价值。

Q6: 为什么有的新模型没进入 AI 大模型排行榜?

模型进入 AI 大模型排行榜 需要经过一系列测试,通常在新模型发布后数日内会完成更新。

Q7: AI 大模型排行榜 中的价格计算标准是什么?

价格是基于百万 Token 的调用成本,由 AI 大模型排行榜 统一混合计算得出。

Q8: 手机上能查看 AI 大模型排行榜 吗?

当然可以。AI 大模型排行榜 进行了移动端响应式深度优化。

Q9: AI 大模型排行榜 这个工具免费吗?

是的,由值品工具箱免费提供 AI 大模型排行榜 信息查询服务。

Q10: 我该怎么利用 AI 大模型排行榜 做选型?

如果您需要智能客服,参考 AI 大模型排行榜 的综合指数;如果做翻译,参考编程外的语言指标。

发表评论

请友善文明留言