AI 大模型排行榜 (Artificial Analysis LLM Ranking)

信息查询
2.1k 次浏览
100% 有帮助 · 1 人反馈

值品工具箱提供的 AI 大模型排行榜聚合了来自 Artificial Analysis 的权威数据,实时追踪并排名超过 100 个主流大语言模型。

AI 大模型排行榜数据中心

重置
排名 模型名称 综合指数 ▼ 编程 价格 ($/1M)
1 Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) 59.9 76.5 $20
2 Claude Opus 4.8 (Adaptive Reasoning, Max Effort) 55.7 74.3 $10
3 GPT-5.5 (xhigh) 54.8 74.9 $11.25
4 Claude Opus 4.7 (Adaptive Reasoning, Max Effort) 53.5 73.6 $10
5 GPT-5.5 (high) 53.1 71.6 $11.25
6 GPT-5.4 (xhigh) 51.4 71.1 $5.625
7 GLM-5.2 (max) 51.1 68.8 $2.15
8 GPT-5.5 (medium) 50.4 71.5 $11.25
9 Gemini 3.5 Flash (high) 50.2 70.1 $3.375
10 Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) 47.2 63 $6
11 Gemini 3.1 Pro Preview 46.5 68.8 $4.5
12 Qwen3.7 Max 46 66 $3.75
13 Gemini 3.5 Flash (medium) 45.4 - $3.375
14 MiniMax-M3 44.4 58.6 $0.525
15 GPT-5.3 Codex (xhigh) 44.3 - $4.813
16 DeepSeek V4 Pro (Reasoning, Max Effort) 44.3 59.4 $0.544
17 Claude Opus 4.6 (Adaptive Reasoning, Max Effort) 43.7 - $10
18 GPT-5.5 (low) 43.5 60.9 $11.25
19 Muse Spark 43.1 58.6 $0
20 Kimi K2.6 42.8 56 $1.712
21 Claude Opus 4.7 (Non-reasoning, High Effort) 42.7 - $10
22 MiMo-V2.5-Pro 42.2 60.2 $0.544
23 GPT-5.2 (xhigh) 42.2 - $4.813
24 Kimi K2.7 Code 41.9 60.8 $1.712
25 Nex-N2-Pro 41 59.1 $1
26 DeepSeek V4 Pro (Reasoning, High Effort) 40.8 - $0.544
27 Claude Opus 4.5 (Reasoning) 40.8 - $10
28 DeepSeek V4 Flash (Reasoning, Max Effort) 40.3 56.2 $0.175
29 MiMo-V2-Pro 40.3 - $1.5
30 GLM-5.1 (Reasoning) 40.2 55.8 $2.15
31 MiMo-V2.5 40.1 - $0.175
32 GPT-5.2 Codex (xhigh) 40.1 - $4.813
33 GPT-5.4 mini (xhigh) 40 56.1 $1.688
34 Qwen3.6 Max Preview 40 - $2.925
35 Grok Build 0.1 0616 39.8 51.5 $1.25
36 Qwen3.6 Plus 39.6 54.5 $1.125
37 Gemini 3 Pro Preview (high) 39.6 - $4.5
38 GLM-5 (Reasoning) 39.5 - $1.55
39 GPT-5.4 (low) 39.1 - $5.906
40 Qwen3.7 Plus 39 55.9 $0.59
41 GPT-5.1 (high) 38.9 - $3.438
42 GPT-5.4 nano (xhigh) 38.2 56.1 $0.463
43 MiniMax-M2.7 38.1 52.6 $0.525
44 GLM-5-Turbo 38.1 - $0
45 Kimi K2.5 (Reasoning) 38.1 - $1.185
46 GPT-5.2 (medium) 38 - $4.813
47 Nemotron 3 Ultra 550B A55B (Reasoning) 37.8 49.3 $1.175
48 Gemini 3 Flash Preview (Reasoning) 37.8 - $1.125
49 Claude Opus 4.6 (Non-reasoning, High Effort) 37.8 - $10
50 Grok 4.3 (high) 37.6 42.2 $1.563
51 DeepSeek V4 Flash (Reasoning, High Effort) 37.4 - $0.175
52 Qwen3.6 27B (Reasoning) 37.1 53.7 $1.35
53 Grok 4.20 0309 v2 (Reasoning) 37 - $3
54 Grok 4.20 0309 (Reasoning) 36.5 - $3
55 MiMo-V2-Omni-0327 36.4 - $0.8
56 GPT-5 (high) 36.1 - $3.438
57 GPT-5 Codex (high) 36.1 - $3.438
58 Grok 4.3 (medium) 36 - $1.563
59 Claude Sonnet 4.6 (Non-reasoning, High Effort) 35.9 - $6
60 GPT-5.5 (Non-reasoning) 35.4 56.5 $11.25
61 Grok 4.3 (low) 35.4 - $1.563
62 KAT Coder Pro V2 35.4 - $0.525
63 GLM-5.1 (Non-reasoning) 35.4 - $2.15
64 MiMo-V2-Omni 35 - $0
65 Gemini 3.5 Flash (minimal) 34.9 - $3.375
66 GPT-5.1 Codex (high) 34.7 - $3.438
67 Claude 4.5 Sonnet (Reasoning) 34.7 - $6
68 Claude Opus 4.5 (Non-reasoning) 34.7 - $10
69 Kimi K2.6 (Non-reasoning) 34.6 - $1.712
70 KAT-Coder-Pro V1 34.6 58.9 $0.525
71 GLM 5V Turbo (Reasoning) 34.5 - $0
72 Claude Sonnet 4.6 (Non-reasoning, Low Effort) 34.3 - $6
73 GLM-4.7 (Reasoning) 33.8 - $1
74 Qwen3.5 27B (Reasoning) 33.8 - $0.825
75 Qwen3.5 397B A17B (Reasoning) 33.7 48.2 $1.35
76 GPT-5 (medium) 33.7 - $3.438
77 Claude 4.1 Opus (Reasoning) 33.7 - $30
78 MiniMax-M2.5 33.7 - $0.525
79 Hy3-preview (Reasoning) 33.6 - $0.2
80 GPT-5.5 Instant (May 2026) 33.5 - $11.25
81 DeepSeek V3.2 (Reasoning) 33.4 - $0.337
82 Grok 4 33.3 - $11
83 MiMo-V2-Flash (Feb 2026) 33.2 - $0.15
84 Gemini 3 Pro Preview (low) 33.1 - $4.5
85 GPT-5 mini (high) 33 - $0.688
86 Kimi K2 Thinking 32.7 - $1.075
87 o3-pro 32.5 - $35
88 GLM-5 (Non-reasoning) 32.4 - $1.55
89 Qwen3.5 122B A10B (Reasoning) 32.3 45.7 $1.1
90 Qwen3.5 397B A17B (Non-reasoning) 32 - $1.35
91 Qwen3 Max Thinking 31.7 - $2.4
92 Qwen3.6 35B A3B (Reasoning) 31.6 41.9 $0.557
93 MiniMax-M2.1 31.4 - $0.525
94 DeepSeek V4 Pro (Non-reasoning) 31.2 - $0.544
95 GPT-5 (low) 31.2 - $3.438
96 MiMo-V2-Flash (Reasoning) 31.2 - $0.15
97 Claude 4 Opus (Reasoning) 31 - $30
98 GPT-5 mini (medium) 30.9 - $0.688
99 Claude 4 Sonnet (Reasoning) 30.7 - $6
100 Qwen3.5 Omni Plus 30.6 - $1.5
101 Ring-2.6-1T 30.6 42.8 $0.85
102 GPT-5.1 Codex mini (high) 30.6 - $0.688
103 Grok 4.1 Fast (Reasoning) 30.6 - $0
104 o3 30.4 - $3.5
105 GPT-5.4 nano (medium) 30.2 - $0.463
106 Mistral Medium 3.5 29.9 46.9 $3
107 GPT-5.4 mini (medium) 29.8 - $1.688
108 Step 3.7 Flash 29.7 37.3 $0.438
109 Claude 4.5 Haiku (Reasoning) 29.6 43.9 $2
110 Gemma 4 31B (Reasoning) 29.4 43.4 $0
111 Kimi K2.5 (Non-reasoning) 29.4 - $1.2
112 Command A+ 29.3 - $0
113 Qwen3.6 27B (Non-reasoning) 29.3 - $1.35
114 Claude 4.5 Sonnet (Non-reasoning) 29.3 - $6
115 Qwen3.5 35B A3B (Reasoning) 29.3 - $0.688
116 Qwen3.5 27B (Non-reasoning) 29.3 - $0.875
117 GPT-5.5 Instant (June 2026) 28.9 39.4 $11.25
118 DeepSeek V4 Flash (Non-reasoning) 28.7 - $0.175
119 JT-35B-Flash 28.4 - $0
120 MiniMax-M2 28.3 - $0.525
121 Claude 4.1 Opus (Non-reasoning) 28.2 - $30
122 Qwen3.5 122B A10B (Non-reasoning) 28.1 - $1.1
123 MiMo-V2.5-Pro (Non-reasoning) 27.9 - $1.35
124 GPT-5.4 (Non-reasoning) 27.7 - $5.906
125 Gemini 3 Flash Preview (Non-reasoning) 27.4 - $1.125
126 Grok 4 Fast (Reasoning) 27.4 - $0.275
127 Claude 3.7 Sonnet (Reasoning) 27.1 36.4 $0
128 GLM-4.7 (Non-reasoning) 26.6 - $1
129 DeepSeek V3.1 Terminus (Reasoning) 26.3 - $1.914
130 Hy3-preview (Non-reasoning) 26.1 - $0.2
131 Ling-2.6-1T 26.1 - $0.85
132 Step 3.5 Flash 2603 26 - $0.15
133 Doubao Seed Code 26 - $0
134 GPT-5.2 (Non-reasoning) 26 - $4.813
135 Gemini 2.5 Pro 25.8 33.3 $3.438
136 Gemma 4 26B A4B (Reasoning) 25.7 39.3 $0.198
137 o4-mini (high) 25.6 - $1.925
138 Claude 4 Sonnet (Non-reasoning) 25.5 - $6
139 Claude 4 Opus (Non-reasoning) 25.5 - $30
140 Step 3.5 Flash 25.5 - $0.15
141 NVIDIA Nemotron 3 Super 120B A12B (Reasoning) 25.4 37.7 $0.412
142 DeepSeek V3.2 Exp (Reasoning) 25.4 - $0.31
143 Mercury 2 25.3 - $0.375
144 GLM-4.6 (Reasoning) 25.1 - $0.963
145 Gemini 3.1 Flash-Lite 25 34.7 $0.563
146 Qwen3.5 9B (Reasoning) 25 28.7 $0.113
147 Qwen3 Max Thinking (Preview) 25 - $2.4
148 Gemma 4 31B (Non-reasoning) 24.8 - $0.205
149 Grok 4.3 (Non-reasoning) 24.8 35.2 $1.563
150 K-EXAONE (Reasoning) 24.7 - $0
151 MiMo-V2-Flash (Non-reasoning) 24.7 49.8 $0.15
152 DeepSeek V3.2 (Non-reasoning) 24.7 - $0.775
153 Trinity Large Thinking 24.5 - $0.395
154 Qwen3.6 35B A3B (Non-reasoning) 24.2 - $0.844
155 Qwen3 Max 24 - $3.047
156 gpt-oss-120b (high) 23.8 30.4 $0.262
157 Gemini 2.5 Flash Preview (Sep '25) (Reasoning) 23.8 - $0
158 Claude 4.5 Haiku (Non-reasoning) 23.7 - $2
159 Claude 3.7 Sonnet (Non-reasoning) 23.5 - $6
160 Kimi K2 0905 23.5 - $1.075
161 Qwen3.5 35B A3B (Non-reasoning) 23.4 - $0.688
162 o1 23.4 39.7 $26.25
163 EXAONE 4.5 33B 23 - $0
164 Gemini 2.5 Pro Preview (Mar' 25) 23 46.7 $0
165 GLM-4.6 (Non-reasoning) 23 - $1
166 GLM-4.7-Flash (Reasoning) 22.9 - $0.153
167 Grok 3 mini Reasoning (high) 22.5 - $0.35
168 Grok 4.20 0309 (Non-reasoning) 22.5 - $3
169 Gemini 2.5 Pro Preview (May' 25) 22.3 - $3.438
170 Qwen3 235B A22B 2507 (Reasoning) 22.3 - $0.838
171 DeepSeek V3.2 Speciale 22.2 - $0
172 HyperNova 60B 2605 22.1 - $0.065
173 Gemma 4 12B (Reasoning) 22 - $0.15
174 ERNIE 5.0 Thinking Preview 21.9 - $0
175 Nova 2.0 Pro Preview (medium) 21.8 34 $3.438
176 Grok 4.20 0309 v2 (Non-reasoning) 21.8 - $3
177 Grok Code Fast 1 21.6 - $0
178 DeepSeek V3.1 Terminus (Non-reasoning) 21.4 - $0.453
179 Nemotron Cascade 2 30B A3B 21.3 - $0
180 DeepSeek V3.2 Exp (Non-reasoning) 21.3 - $0.31
181 Qwen3 Coder Next 21.2 - $0.563
182 Apriel-v1.5-15B-Thinker 21.2 - $0
183 DeepSeek V3.1 (Non-reasoning) 21 - $0.834
184 Nova 2.0 Omni (medium) 20.9 - $0.85
185 Mistral Small 4 (Reasoning) 20.8 - $0.262
186 DeepSeek V3.1 (Reasoning) 20.7 - $0.865
187 North Mini Code 20.6 - $0
188 Qwen3 VL 235B A22B (Reasoning) 20.6 - $2.174
189 Apriel-v1.6-15B-Thinker 20.5 - $0
190 GPT-5.1 (Non-reasoning) 20.4 - $3.438
191 Qwen3.5 9B (Non-reasoning) 20.3 23.5 $0
192 Gemma 4 26B A4B (Non-reasoning) 20.1 - $0.198
193 Qwen3.5 4B (Reasoning) 20.1 22.6 $0.06
194 Gemini 2.5 Flash (Reasoning) 20.1 - $0.85
195 DeepSeek R1 0528 (May '25) 20.1 - $2.063
196 GPT-5 nano (high) 19.9 - $0.138
197 Qwen3 Next 80B A3B (Reasoning) 19.8 - $1.875
198 Nova 2.0 Pro Preview (low) 19.6 25.9 $3.438
199 GLM-4.5 (Reasoning) 19.5 - $1
200 GPT-4.1 19.4 - $3.5
201 Kimi K2 19.4 - $1.039
202 Ling 2.6 Flash 19.3 - $0.15
203 Devstral 2 19.2 31.3 $0
204 Qwen3 Max (Preview) 19.2 - $2.4
205 Nova 2.0 Lite (medium) 19 - $0.85
206 Qwen3.5 Omni Flash 19 - $0.275
207 GPT-5 nano (medium) 19 - $0.138
208 o3-mini 19 - $1.925
209 o1-pro 18.9 - $262.5
210 Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) 18.8 - $0
211 JT-MINI 18.5 - $0
212 o3-mini (high) 18.4 42.1 $1.925
213 Grok 3 18.4 - $8
214 Seed-OSS-36B-Instruct 18.3 - $0.3
215 Nova 2.0 Lite (high) 18.2 23 $0.85
216 Qwen3 235B A22B 2507 Instruct 18.2 - $0.356
217 Qwen3 Coder 480B A35B Instruct 18 - $0.675
218 Magistral Medium 1.2 17.9 21.3 $2.75
219 Qwen3 VL 32B (Reasoning) 17.9 - $2.625
220 Nova 2.0 Lite (low) 17.8 - $0.85
221 Sonar Reasoning Pro 17.8 - $0
222 gpt-oss-120b (low) 17.7 - $0.262
223 MiniMax M1 80k 17.7 - $0.963
224 GPT-5.4 nano (Non-Reasoning) 17.6 - $0.463
225 Gemini 2.5 Flash Preview (Reasoning) 17.5 - $0
226 K2 Think V2 17.4 21 $0
227 LongCat Flash Lite 17.2 - $0
228 GPT-5 (minimal) 17.2 - $3.438
229 HyperCLOVA X SEED Think (32B) 17 - $0
230 o1-preview 17 34 $28.875
231 Grok 4.1 Fast (Non-reasoning) 16.9 - $0
232 GLM-4.6V (Reasoning) 16.8 - $0.45
233 K-EXAONE (Non-reasoning) 16.7 - $0
234 GPT-5.4 mini (Non-Reasoning) 16.6 - $1.688
235 Nova 2.0 Omni (low) 16.6 - $0.85
236 Grok 4 Fast (Non-reasoning) 16.5 - $0.275
237 GLM-4.5-Air 16.5 - $0.372
238 Mi:dm K 2.5 Pro 16.4 - $0
239 GPT-4.1 mini 16.3 - $0.7
240 Ring-1T 16.2 - $0
241 Qwen3.5 4B (Non-reasoning) 16 20.3 $0.06
242 Mistral Large 3 15.9 20.1 $0.75
243 Qwen3 30B A3B 2507 (Reasoning) 15.8 - $0.673
244 DeepSeek V3 0324 15.7 - $1.209
245 INTELLECT-3 15.6 - $0
246 GLM-4.7-Flash (Non-reasoning) 15.5 - $0.153
247 GPT-5 (ChatGPT) 15.3 - $3.438
248 Solar Open 100B (Reasoning) 15.1 - $0
249 Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) 15.1 - $0.175
250 Grok 3 Reasoning Beta 15.1 - $0
251 gpt-oss-20B (high) 14.9 20.7 $0.088
252 Nemotron 3 Nano Omni 30B A3B Reasoning 14.9 - $0.131
253 Mistral Medium 3.1 14.8 - $0.8
254 Nova 2.0 Pro Preview (Non-reasoning) 14.4 20.9 $3.438
255 MiniMax M1 40k 14.4 - $0
256 gpt-oss-20B (low) 14.3 - $0.095
257 Llama 4 Maverick 14.3 16.3 $0.475
258 GPT-5 mini (minimal) 14.3 - $0.688
259 Qwen3 VL 235B A22B Instruct 14.3 - $0.7
260 NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) 14.2 14.4 $0.096
261 K2-V2 (high) 14.2 - $0
262 Solar Pro 3 14.1 16.2 $0
263 Gemini 2.5 Flash (Non-reasoning) 14.1 - $0.85
264 o1-mini 14 - $0
265 Qwen3 Next 80B A3B Instruct 13.7 - $0.875
266 Tri-21B-think Preview 13.6 - $0
267 GPT-4.5 (Preview) 13.6 - $0
268 Qwen3 Coder 30B A3B Instruct 13.6 - $0.352
269 Qwen3 235B A22B (Reasoning) 13.4 - $2.625
270 QwQ 32B 13.4 - $0.745
271 Gemini 2.0 Flash Thinking Experimental (Jan '25) 13.3 24.1 $0
272 Qwen3 VL 30B A3B (Reasoning) 13.3 - $0.338
273 Gemma 4 12B (Non-reasoning) 13.2 - $0.15
274 Devstral Small 2 13.1 29.3 $0
275 Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) 13.1 - $0.175
276 Motif-2-12.7B-Reasoning 12.8 - $0
277 Ling-1T 12.8 - $0
278 Nova Premier 12.7 - $5
279 DeepSeek R1 (Jan '25) 12.6 - $2.431
280 Gemma 4 E4B (Reasoning) 12.5 - $0
281 Magistral Medium 1 12.5 - $0
282 Mistral Medium 3 12.5 - $0.8
283 Solar Pro 2 (Preview) (Reasoning) 12.5 - $0
284 Mistral Small 4 (Non-reasoning) 12.4 - $0.262
285 Llama Nemotron Super 49B v1.5 (Reasoning) 12.4 - $0.175
286 K2-V2 (medium) 12.4 - $0
287 Tri-21B-Think 12.4 - $0
288 Devstral Medium 12.4 - $0.8
289 GPT-4o (March 2025, chatgpt-4o-latest) 12.3 - $0
290 Gemini 2.0 Flash (Feb '25) 12.3 - $0.262
291 Claude 3.5 Haiku 12.3 15.9 $1.6
292 Llama 3.3 Nemotron Super 49B v1 (Reasoning) 12.2 - $0
293 MiniCPM5-1B (Reasoning) 12 - $0
294 Qwen3 4B 2507 (Reasoning) 12 - $0
295 Sarvam 105B (high) 11.9 - $0.074
296 Nova 2.0 Lite (Non-reasoning) 11.8 - $0.85
297 Gemini 2.0 Pro Experimental (Feb '25) 11.8 25.5 $0
298 Claude 3 Opus 11.8 19.5 $30
299 Devstral Small (May '25) 11.8 - $0
300 MiniCPM5-1B (Non-reasoning) 11.7 - $0
301 Gemini 2.5 Flash Preview (Non-reasoning) 11.7 - $0
302 Sonar Reasoning 11.7 - $0
303 Qwen3 32B (Reasoning) 11.5 15.3 $0.276
304 Gemini 2.5 Flash-Lite (Reasoning) 11.4 - $0.175
305 Magistral Small 1.2 11.3 14.7 $0.75
306 GPT-4o (Nov '24) 11.2 - $4.375
307 Ministral 3 14B 11.1 14.4 $0.2
308 Qwen3 VL 32B Instruct 11.1 - $1.225
309 DeepSeek R1 Distill Qwen 32B 11 - $0
310 GLM-4.6V (Non-reasoning) 11 - $0.45
311 Qwen3 235B A22B (Non-reasoning) 10.9 - $0.787
312 Gemini 2.0 Flash (experimental) 10.7 - $0
313 Magistral Small 1 10.7 - $0
314 EXAONE 4.0 32B (Reasoning) 10.6 - $0
315 Qwen3 VL 8B (Reasoning) 10.6 - $0.66
316 Nova 2.0 Omni (Non-reasoning) 10.5 - $0.85
317 DeepSeek V3 (Dec '24) 10.4 - $0.523
318 DeepSeek R1 0528 Qwen3 8B 10.4 - $0
319 Qwen3.5 2B (Reasoning) 10.2 19.7 $0.04
320 Qwen2.5 Max 10.2 - $2.8
321 Nanbeige4.1-3B 10.1 - $0
322 Qwen3 14B (Reasoning) 10.1 - $0.731
323 Llama 4 Scout 10 8.2 $0.292
324 Hermes 4 - Llama-3.1 70B (Reasoning) 10 - $0.198
325 Gemini 1.5 Pro (Sep '24) 10 23.6 $0
326 Solar Pro 2 (Preview) (Non-reasoning) 10 - $0
327 Qwen3 VL 30B A3B Instruct 10 - $0.3
328 Claude 3.5 Sonnet (Oct '24) 9.9 30.2 $6
329 DeepSeek R1 Distill Llama 70B 9.9 - $0.787
330 Falcon-H1R-7B 9.8 - $0
331 DeepSeek R1 Distill Qwen 14B 9.8 - $0
332 Ling-flash-2.0 9.7 - $0.247
333 Qwen3 Omni 30B A3B (Reasoning) 9.6 - $0.43
334 GPT-4o (Aug '24) 9.6 - $4.375
335 Qwen2.5 Instruct 72B 9.6 - $0.37
336 Step3 VL 10B 9.5 - $0
337 Sonar 9.5 - $0
338 Gemma 4 E2B (Reasoning) 9.3 - $0
339 Devstral Small (Jul '25) 9.3 - $0.15
340 Sonar Pro 9.3 - $0
341 Qwen3 30B A3B (Reasoning) 9.3 - $0.18
342 QwQ 32B-Preview 9.2 - $0
343 Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) 9.1 - $0.9
344 Mistral Large 2 (Nov '24) 9.1 - $3
345 Mistral Small 3.2 9.1 - $0.128
346 GLM-4.5V (Reasoning) 9.1 - $0.9
347 Qwen3 30B A3B 2507 Instruct 9.1 - $0.213
348 Ministral 3 8B 9 9.7 $0.15
349 Solar Pro 2 (Reasoning) 9 - $0
350 NVIDIA Nemotron Nano 12B v2 VL (Reasoning) 9 - $0.3
351 Hermes 4 - Llama-3.1 405B (Reasoning) 9 - $1.5
352 ERNIE 4.5 300B A47B 9 - $0.485
353 Gemma 4 E4B (Non-reasoning) 8.9 - $0
354 Granite 4.1 30B 8.9 10.4 $0
355 NVIDIA Nemotron 3 Nano 4B 8.8 - $0
356 NVIDIA Nemotron Nano 9B V2 (Reasoning) 8.8 - $0.07
357 Hermes 4 - Llama-3.1 405B (Non-reasoning) 8.8 - $1.5
358 Qwen3.5 2B (Non-reasoning) 8.8 17.4 $0.04
359 Gemini 2.0 Flash-Lite (Feb '25) 8.8 - $0
360 Llama Nemotron Super 49B v1.5 (Non-reasoning) 8.7 - $0.175
361 Llama 3.3 Instruct 70B 8.6 - $0.612
362 K2-V2 (low) 8.6 - $0
363 GPT-4o (May '24) 8.6 24.2 $7.5
364 Gemini 2.0 Flash-Lite (Preview) 8.6 - $0
365 Mistral Small 3.1 8.6 - $0.138
366 Qwen3 32B (Non-reasoning) 8.6 - $0.26
367 Llama 3.1 Instruct 405B 8.5 - $3.688
368 Kimi Linear 48B A3B Instruct 8.5 - $0
369 Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) 8.5 - $0
370 Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) 8.5 - $0
371 Qwen3 4B (Reasoning) 8.4 - $0.398
372 Qwen3 VL 8B Instruct 8.4 - $0.31
373 LFM2.5-8B-A1B 8.3 - $0
374 Claude 3.5 Sonnet (June '24) 8.3 26 $6
375 Llama 3.1 Tulu3 405B 8.3 - $0
376 Ring-flash-2.0 8.2 - $0.247
377 GPT-4o (ChatGPT) 8.2 - $0
378 Olmo 3.1 32B Think 8.1 - $0
379 Pixtral Large 8.1 - $3
380 GPT-5 nano (minimal) 8 - $0.138
381 Gemini 1.5 Flash (Sep '24) 8 - $0
382 Grok 2 (Dec '24) 8 - $0
383 GPT-4 Turbo 7.9 21.5 $15
384 Qwen3 VL 4B (Reasoning) 7.9 - $0
385 Solar Pro 2 (Non-reasoning) 7.8 - $0
386 Command A 7.7 - $4.375
387 Nova Pro 7.7 - $1.4
388 Llama 3.1 Nemotron Instruct 70B 7.6 - $1.2
389 Grok Beta 7.5 - $0
390 Qwen2.5 Instruct 32B 7.5 - $0
391 NVIDIA Nemotron Nano 9B V2 (Non-reasoning) 7.4 - $0.086
392 NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) 7.4 - $0.088
393 Gemma 3 27B Instruct 7.4 10.1 $0.145
394 Qwen3 8B (Reasoning) 7.4 - $0.37
395 GPT-4.1 nano 7.3 - $0.175
396 Mistral Large 2 (Jul '24) 7.3 - $3
397 Qwen2.5 Coder Instruct 32B 7.1 - $0
398 Qwen3 4B 2507 Instruct 7.1 - $0
399 GPT-4 7 13.1 $37.5
400 GLM-4.5V (Non-reasoning) 7 - $0.9
401 Qwen3 14B (Non-reasoning) 7 - $0.381
402 Hermes 4 - Llama-3.1 70B (Non-reasoning) 6.9 - $0.198
403 GPT-4o mini 6.9 11.4 $0.262
404 Gemini 2.5 Flash-Lite (Non-reasoning) 6.9 - $0.175
405 Mistral Small 3 6.9 - $0.104
406 Nova Lite 6.9 - $0.105
407 Ministral 3 3B 6.8 4.8 $0.1
408 Llama 3.1 Instruct 70B 6.8 - $0.56
409 DeepSeek-V2.5 (Dec '24) 6.8 - $0
410 Qwen3 4B (Non-reasoning) 6.8 - $0.188
411 Qwen3 30B A3B (Non-reasoning) 6.8 - $0.133
412 Granite 4.1 8B 6.7 9.5 $0.063
413 Sarvam 30B (high) 6.6 - $0.047
414 Gemini 2.0 Flash Thinking Experimental (Dec '24) 6.6 - $0
415 DeepSeek-V2.5 6.6 - $0
416 Olmo 3.1 32B Instruct 6.5 - $0
417 Gemma 4 E2B (Non-reasoning) 6.4 - $0
418 Mistral Saba 6.4 - $0
419 DeepSeek R1 Distill Llama 8B 6.4 - $0
420 Olmo 3 32B Think 6.4 - $0
421 R1 1776 6.3 - $0
422 Gemini 1.5 Pro (May '24) 6.3 19.8 $0
423 Reka Flash (Sep '24) 6.3 - $0.35
424 Qwen2.5 Turbo 6.3 - $0.088
425 Llama 3.2 Instruct 90B (Vision) 6.2 - $1.38
426 Solar Mini 6.2 - $0.15
427 Llama 3.1 Instruct 8B 6.1 5.4 $0.1
428 Grok-1 6 - $0
429 EXAONE 4.0 32B (Non-reasoning) 6 - $0
430 Qwen2 Instruct 72B 6 - $0
431 Gemini 1.5 Flash-8B 5.5 - $0
432 DeepHermes 3 - Mistral 24B Preview (Non-reasoning) 5.3 - $0
433 Jamba 1.7 Large 5.3 - $3.5
434 Granite 4.0 H Small 5.2 - $0.107
435 Qwen3 Omni 30B A3B Instruct 5.1 - $0.43
436 DeepSeek-Coder-V2 5.1 - $0
437 Hermes 3 - Llama-3.1 70B 5.1 - $0.3
438 Jamba 1.5 Large 5.1 - $3.5
439 Qwen3 8B (Non-reasoning) 5.1 - $0.185
440 Qwen3.5 0.8B (Reasoning) 5 15 $0.02
441 OLMo 2 32B 5 - $0
442 Jamba 1.6 Large 5 - $3.5
443 Phi-4 4.9 - $0.219
444 LFM2 24B A2B 4.9 - $0.052
445 Gemini 1.5 Flash (May '24) 4.9 - $0
446 Nova Micro 4.7 - $0.061
447 Claude 3 Sonnet 4.7 - $6
448 Mistral Small (Sep '24) 4.7 - $0.3
449 NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) 4.6 - $0.3
450 Gemma 3n E4B Instruct Preview (May '25) 4.6 - $0
451 Gemini 1.0 Ultra 4.6 17.6 $0
452 Phi-3 Mini Instruct 3.8B 4.6 - $0
453 Phi-4 Multimodal Instruct 4.5 - $0
454 Qwen2.5 Coder Instruct 7B 4.5 - $0
455 Qwen3.5 0.8B (Non-reasoning) 4.4 21.9 $0.02
456 Mixtral 8x22B Instruct 4.4 - $0
457 Mistral Large (Feb '24) 4.4 - $6
458 Llama 2 Chat 7B 4.3 - $0.1
459 MiniCPM-V 4.6 1.3B 4.2 0.7 $0
460 Llama 3.2 Instruct 3B 4.2 - $0.15
461 Reka Flash 3 4.1 - $0.35
462 Jamba Reasoning 3B 4.1 - $0
463 Qwen1.5 Chat 110B 4.1 - $0
464 Qwen3 VL 4B Instruct 4.1 - $0
465 Olmo 3 7B Think 4 - $0
466 Claude 3 Haiku 3.9 - $0.5
467 Claude 2.1 3.9 14 $0
468 OLMo 2 7B 3.9 - $0
469 Molmo 7B-D 3.8 - $0
470 Ling-mini-2.0 3.8 - $0
471 DeepSeek R1 Distill Qwen 1.5B 3.7 - $0
472 GPT-3.5 Turbo 3.6 10.7 $0.75
473 Claude 2.0 3.6 12.9 $0
474 Mistral Small (Feb '24) 3.6 - $1.5
475 Mistral Medium 3.6 - $4.088
476 DeepSeek-V2-Chat 3.6 - $0
477 Llama 3 Instruct 70B 3.5 - $1.175
478 Gemma 3 12B Instruct 3.4 - $0.14
479 LFM 40B 3.4 - $0
480 Arctic Instruct 3.4 - $0
481 Qwen Chat 72B 3.4 - $0
482 Llama 3.2 Instruct 11B (Vision) 3.3 - $0.245
483 Granite 4.1 3B 3.2 - $0
484 PALM-2 3.2 4.6 $0
485 Gemini 1.0 Pro 3.1 - $0
486 DeepSeek Coder V2 Lite Instruct 3.1 - $0
487 Phi-4 Mini Instruct 3 - $0
488 Llama 2 Chat 70B 3 - $0
489 Llama 2 Chat 13B 3 - $0
490 DeepSeek LLM 67B Chat (V1) 3 - $0
491 OpenChat 3.5 (1210) 3 - $0
492 DBRX Instruct 3 - $0
493 Sarvam M (Reasoning) 3 - $0
494 Command-R+ (Apr '24) 3 - $6
495 Exaone 4.0 1.2B (Reasoning) 2.9 - $0
496 Olmo 3 7B Instruct 2.8 - $0.125
497 Exaone 4.0 1.2B (Non-reasoning) 2.8 - $0
498 LFM2 2.6B 2.7 - $0
499 LFM2.5-1.2B-Thinking 2.7 - $0
500 LFM2.5-1.2B-Instruct 2.7 - $0
501 Granite 4.0 H 1B 2.7 - $0
502 Jamba 1.7 Mini 2.7 - $0
503 Jamba 1.5 Mini 2.7 - $0.25
504 Jamba 1.6 Mini 2.6 - $0.25
505 Qwen3 1.7B (Reasoning) 2.6 - $0.398
506 Gemma 3 270M 2.4 - $0
507 Granite 4.0 Micro 2.4 - $0
508 Apertus 70B Instruct 2.4 - $1.345
509 Mixtral 8x7B Instruct 2.4 - $0.512
510 DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) 2.3 - $0
511 Llama 65B 2.1 - $0
512 Qwen Chat 14B 2.1 - $0
513 Granite 4.0 1B 2.1 - $0
514 Claude Instant 2.1 7.8 $0
515 Mistral 7B Instruct 2.1 - $0.206
516 Command-R (Mar '24) 2.1 - $0.75
517 Molmo2-8B 2 - $0
518 LFM2 8B A1B 1.8 - $0
519 Granite 3.3 8B (Non-reasoning) 1.8 - $0.085
520 Qwen3 1.7B (Non-reasoning) 1.5 - $0.188
521 Qwen3 0.6B (Reasoning) 1.3 - $0.398
522 Llama 3 Instruct 8B 1.2 - $0.07
523 Gemma 3n E4B Instruct 1.2 - $0.025
524 Llama 3.2 Instruct 1B 1.1 - $0.05
525 Gemma 3 4B Instruct 1.1 - $0.05
526 LFM2 1.2B 1.1 - $0
527 LFM2.5-VL-1.6B 1 - $0
528 Granite 4.0 350M 1 - $0
529 Granite 4.0 H 350M 1 - $0
530 Apertus 8B Instruct 1 - $0.125
531 Tiny Aya Global 1 - $0
532 Gemma 3n E2B Instruct 1 - $0
533 Gemma 3 1B Instruct 1 - $0
534 Qwen3 0.6B (Non-reasoning) 1 - $0.188
535 GPT-5.5 Pro (xhigh) - - $0
536 Gemini 3 Deep Think - - $0
537 EXAONE 4.5 33B (Non-reasoning) - - $0
538 Cogito v2.1 (Reasoning) - - $1.25
539 Mi:dm K 2.5 Pro Preview - - $0
540 GPT-4o mini Realtime (Dec '24) - - $0
541 GPT-5.4 Pro (xhigh) - - $67.5
542 GPT-3.5 Turbo (0613) - - $0
543 GPT-4o Realtime (Dec '24) - - $0

榜单解读建议

参考 AI 大模型排行榜 时,应综合考虑“综合指数”与“成本价格”。如果您是开发者,编程能力 (Coding) 是更核心的指标。

值品工具箱同步的 AI 大模型排行榜 数据每 24 小时更新,确保您获取到最新的模型性能对比。

指标说明

  • 综合指数:评估通用理解与逻辑。
  • 价格 $/1M:混合 3:1 输入输出比的平均成本。
  • 编程能力:衡量代码生成的准确性。

AI 大模型排行榜 常见问题 (FAQ)

Q1: AI 大模型排行榜 的数据多久更新?

AI 大模型排行榜 数据每 24 小时自动抓取一次,确保最新模型加入列表。

Q2: 这个 AI 大模型排行榜 包含国产模型吗?

是的,只要国产模型通过了 Artificial Analysis 的全球测评,就会出现在 AI 大模型排行榜 中。

Q3: 综合指数在 AI 大模型排行榜 中代表什么?

它代表模型的全能表现。AI 大模型排行榜 通过加权算法给出这个综合评分。

Q4: 如何在 AI 大模型排行榜 中查找性价比最高的游戏?

在 AI 大模型排行榜 页面中,您可以点击“价格”标题进行排序,寻找低价高分的模型。

Q5: AI 大模型排行榜 的编程能力测试准吗?

AI 大模型排行榜 参考了 LiveCodeBench 等权威基准测试,具有极高的参考价值。

Q6: 为什么有的新模型没进入 AI 大模型排行榜?

模型进入 AI 大模型排行榜 需要经过一系列测试,通常在新模型发布后数日内会完成更新。

Q7: AI 大模型排行榜 中的价格计算标准是什么?

价格是基于百万 Token 的调用成本,由 AI 大模型排行榜 统一混合计算得出。

Q8: 手机上能查看 AI 大模型排行榜 吗?

当然可以。AI 大模型排行榜 进行了移动端响应式深度优化。

Q9: AI 大模型排行榜 这个工具免费吗?

是的,由值品工具箱免费提供 AI 大模型排行榜 信息查询服务。

Q10: 我该怎么利用 AI 大模型排行榜 做选型?

如果您需要智能客服,参考 AI 大模型排行榜 的综合指数;如果做翻译,参考编程外的语言指标。

发表评论

请友善文明留言