General leaderboard using the Artificial Analysis Intelligence Index.
Domain score averages relative percentile across included metrics.
Domain score 100
General leaderboard using the Artificial Analysis Intelligence Index.
| Rank | Model | Creator | Domain Score | Speed | Blended Price |
|---|---|---|---|---|---|
| #1 | Claude Opus 4.8 (Adaptive Reasoning, Max Effort) | Anthropic | 100.0 | 66.3 tok/s | $10.94/M |
| #2 | GPT-5.5 (xhigh) | OpenAI | 99.8 | 62.4 tok/s | $11.25/M |
| #3 |
| OpenAI |
| 99.6 |
| 62.3 tok/s |
| $11.25/M |
| #4 | Claude Opus 4.7 (Adaptive Reasoning, Max Effort) | Anthropic | 99.4 | 61.8 tok/s | $10.94/M |
| #5 | Gemini 3.1 Pro Preview | 99.2 | 135.9 tok/s | $4.50/M |
| #6 | GPT-5.4 (xhigh) | OpenAI | 99.0 | 75.5 tok/s | $5.63/M |
| #7 | GPT-5.5 (medium) | OpenAI | 98.9 | 59.2 tok/s | $11.25/M |
| #8 | Qwen3.7 Max | Alibaba | 98.7 | 186.5 tok/s | $3.75/M |
| #9 | Gemini 3.5 Flash (high) | 98.5 | 212.4 tok/s | $3.38/M |
| #10 | Gemini 3.5 Flash (medium) | 98.3 | 210.2 tok/s | $3.38/M |
| #11 | MiniMax-M3 | MiniMax | 98.1 | 45.6 tok/s | $0.525/M |
| #12 | Kimi K2.6 | Kimi | 97.9 | 41.6 tok/s | $1.71/M |
| #13 | MiMo-V2.5-Pro | Xiaomi | 97.7 | 43.3 tok/s | $0.544/M |
| #14 | GPT-5.3 Codex (xhigh) | OpenAI | 97.5 | 98.6 tok/s | $4.81/M |
| #15 | Qwen3.7 Plus | Alibaba | 97.3 | 53.6 tok/s | $0.590/M |
| #16 | Grok 4.3 (high) | xAI | 97.1 | 237.5 tok/s | $1.56/M |
| #17 | Claude Opus 4.6 (Adaptive Reasoning, Max Effort) | Anthropic | 96.9 | 47.3 tok/s | $10.94/M |
| #18 | Muse Spark | Meta | 96.7 | n/a | - |
| #19 | Claude Opus 4.7 (Non-reasoning, High Effort) | Anthropic | 96.6 | 48.1 tok/s | $10.94/M |
| #20 | Qwen3.6 Max Preview | Alibaba | 96.4 | 40.9 tok/s | $2.93/M |
| #21 | Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) | Anthropic | 96.2 | 70.4 tok/s | $6.56/M |
| #22 | DeepSeek V4 Pro (Reasoning, Max Effort) | DeepSeek | 96.0 | 61.8 tok/s | $0.544/M |
| #23 | GLM-5.1 (Reasoning) | Z AI | 95.8 | 46.8 tok/s | $2.15/M |
| #24 | GPT-5.2 (xhigh) | OpenAI | 95.6 | 71 tok/s | $4.81/M |
| #25 | GPT-5.5 (low) | OpenAI | 95.4 | 60.2 tok/s | $11.25/M |
| #26 | Qwen3.6 Plus | Alibaba | 95.2 | 52.8 tok/s | $1.13/M |
| #27 | DeepSeek V4 Pro (Reasoning, High Effort) | DeepSeek | 95.0 | 57.6 tok/s | $0.544/M |
| #28 | GLM-5 (Reasoning) | Z AI | 94.8 | 79.5 tok/s | $1.55/M |
| #29 | Claude Opus 4.5 (Reasoning) | Anthropic | 94.6 | 53.5 tok/s | $10.94/M |
| #30 | MiniMax-M2.7 | MiniMax | 94.4 | 75 tok/s | $0.525/M |
| #31 | Grok 4.20 0309 v2 (Reasoning) | xAI | 94.3 | 168.7 tok/s | $3.00/M |
| #32 | MiMo-V2-Pro | Xiaomi | 94.1 | 42.5 tok/s | $1.50/M |
| #33 | GPT-5.2 Codex (xhigh) | OpenAI | 93.9 | 105.3 tok/s | $4.81/M |
| #34 | MiMo-V2.5 | Xiaomi | 93.7 | 77.4 tok/s | $0.175/M |
| #35 | GPT-5.4 mini (xhigh) | OpenAI | 93.5 | 176 tok/s | $1.69/M |
| #36 | Grok 4.3 (medium) | xAI | 93.3 | 197.6 tok/s | $1.56/M |
| #37 | Grok 4.20 0309 (Reasoning) | xAI | 93.1 | 166.5 tok/s | $3.00/M |
| #38 | Gemini 3 Pro Preview (high) | 92.9 | n/a | $4.50/M |
| #39 | GPT-5.4 (low) | OpenAI | 92.7 | 63.6 tok/s | $5.63/M |
| #40 | GPT-5.1 (high) | OpenAI | 92.5 | 121.2 tok/s | $3.44/M |
| #41 | GLM-5-Turbo | Z AI | 92.3 | n/a | - |
| #42 | Kimi K2.5 (Reasoning) | Kimi | 92.1 | 31.7 tok/s | $1.19/M |
| #43 | GPT-5.2 (medium) | OpenAI | 92.0 | n/a | $4.81/M |
| #44 | Claude Opus 4.6 (Non-reasoning, High Effort) | Anthropic | 91.8 | 40.9 tok/s | $10.94/M |
| #45 | DeepSeek V4 Flash (Reasoning, Max Effort) | DeepSeek | 91.6 | 107.8 tok/s | $0.175/M |
| #46 | Gemini 3 Flash Preview (Reasoning) | 91.4 | 172.8 tok/s | $1.13/M |
| #47 | DeepSeek V4 Flash (Reasoning, High Effort) | DeepSeek | 91.2 | n/a | $0.175/M |
| #48 | Qwen3.6 27B (Reasoning) | Alibaba | 91.0 | 54.7 tok/s | $1.35/M |
| #49 | Qwen3.5 397B A17B (Reasoning) | Alibaba | 90.8 | 51.8 tok/s | $1.35/M |
| #50 | MiMo-V2-Omni-0327 | Xiaomi | 90.6 | 85.6 tok/s | $0.800/M |
| #51 | GPT-5 (high) | OpenAI | 90.4 | 111.1 tok/s | $3.44/M |
| #52 | GPT-5 Codex (high) | OpenAI | 90.2 | 171.1 tok/s | $3.44/M |
| #53 | Claude Sonnet 4.6 (Non-reasoning, High Effort) | Anthropic | 90.0 | 57.9 tok/s | $6.56/M |
| #54 | GPT-5.4 nano (xhigh) | OpenAI | 89.8 | 158.3 tok/s | $0.463/M |
| #55 | Grok 4.3 (low) | xAI | 89.7 | 154.5 tok/s | $1.56/M |
| #56 | GLM-5.1 (Non-reasoning) | Z AI | 89.5 | 45.6 tok/s | $2.15/M |
| #57 | KAT Coder Pro V2 | KwaiKAT | 89.3 | 118.1 tok/s | $0.525/M |
| #58 | Qwen3.6 35B A3B (Reasoning) | Alibaba | 89.1 | 159.9 tok/s | $0.557/M |
| #59 | MiMo-V2-Omni | Xiaomi | 88.9 | 81.5 tok/s | - |
| #60 | Gemini 3.5 Flash (minimal) | 88.7 | 199.1 tok/s | $3.38/M |
| #61 | Claude Opus 4.5 (Non-reasoning) | Anthropic | 88.5 | 47.6 tok/s | $10.94/M |
| #62 | GPT-5.1 Codex (high) | OpenAI | 88.3 | 182.1 tok/s | $3.44/M |
| #63 | Claude 4.5 Sonnet (Reasoning) | Anthropic | 88.1 | 50.1 tok/s | $6.56/M |
| #64 | GLM 5V Turbo (Reasoning) | Z AI | 87.9 | n/a | - |
| #65 | Kimi K2.6 (Non-reasoning) | Kimi | 87.7 | 39 tok/s | $1.71/M |
| #66 | Claude Sonnet 4.6 (Non-reasoning, Low Effort) | Anthropic | 87.5 | 61.7 tok/s | $6.56/M |
| #67 | Step 3.7 Flash | StepFun | 87.4 | 385.5 tok/s | $0.438/M |
| #68 | GLM-4.7 (Reasoning) | Z AI | 87.2 | 79.2 tok/s | $1.00/M |
| #69 | Qwen3.5 27B (Reasoning) | Alibaba | 87.0 | 82.8 tok/s | $0.825/M |
| #70 | Claude 4.1 Opus (Reasoning) | Anthropic | 86.8 | 33.7 tok/s | $32.81/M |
| #71 | GPT-5 (medium) | OpenAI | 86.6 | 85.6 tok/s | $3.44/M |
| #72 | Hy3-preview (Reasoning) | Tencent | 86.4 | 96 tok/s | $0.200/M |
| #73 | MiniMax-M2.5 | MiniMax | 86.2 | 202.9 tok/s | $0.525/M |
| #74 | GPT-5.5 Instant (May 2026) | OpenAI | 86.0 | n/a | $11.25/M |
| #75 | DeepSeek V3.2 (Reasoning) | DeepSeek | 85.8 | n/a | $0.337/M |
| #76 | Qwen3.5 122B A10B (Reasoning) | Alibaba | 85.6 | 143.6 tok/s | $1.10/M |
| #77 | Grok 4 | xAI | 85.4 | n/a | $11.00/M |
| #78 | MiMo-V2-Flash (Feb 2026) | Xiaomi | 85.2 | 124.9 tok/s | $0.150/M |
| #79 | Gemini 3 Pro Preview (low) | 85.1 | n/a | $4.50/M |
| #80 | GPT-5 mini (high) | OpenAI | 84.9 | 87.4 tok/s | $0.688/M |