EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Artificial Analysis Intelligence Index

Artificial Analysis aggregate LLM intelligence score.

The Intelligence Index is an Artificial Analysis aggregate for comparing broad language model capability. Its March 2026 methodology describes a text-only English suite weighted across agents, coding, general tasks, and scientific reasoning.

Test type: Aggregate evaluation suite with standardized prompting and task-specific scoring.

Coverage

500 models have this metric.

60.2

Current leader: GPT-5.5 (xhigh)

Project links

Scores come from the Artificial Analysis LLM snapshot committed in this app.

Artificial Analysis methodology

Top Overall Models

Top models ranked by Overall.

Leaderboard

RankModelCreatorValueSpeedBlended Price
#1GPT-5.5 (xhigh)OpenAI60.266.1 tok/s$11.25/M
#2
GPT-5.5 (high)
OpenAI
58.9
59.3 tok/s
$11.25/M
#3Claude Opus 4.7 (Adaptive Reasoning, Max Effort)Anthropic57.351.8 tok/s$10.00/M
#4Gemini 3.1 Pro PreviewGoogle57.2131.2 tok/s$4.50/M
#5GPT-5.4 (xhigh)OpenAI56.893.5 tok/s$5.63/M
#6GPT-5.5 (medium)OpenAI56.757.5 tok/s$11.25/M
#7Kimi K2.6Kimi53.929.1 tok/s$1.71/M
#8MiMo-V2.5-ProXiaomi53.859.9 tok/s$1.50/M
#9GPT-5.3 Codex (xhigh)OpenAI53.687.1 tok/s$4.81/M
#10Claude Opus 4.6 (Adaptive Reasoning, Max Effort)Anthropic53.049.9 tok/s$10.00/M
#11Muse SparkMeta52.1n/a-
#12Claude Opus 4.7 (Non-reasoning, High Effort)Anthropic51.843 tok/s$10.00/M
#13Qwen3.6 Max PreviewAlibaba51.833.2 tok/s$2.93/M
#14Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)Anthropic51.768 tok/s$6.00/M
#15DeepSeek V4 Pro (Reasoning, Max Effort)DeepSeek51.534.3 tok/s$2.18/M
#16GLM-5.1 (Reasoning)Z AI51.445.7 tok/s$2.15/M
#17GPT-5.2 (xhigh)OpenAI51.371.8 tok/s$4.81/M
#18GPT-5.5 (low)OpenAI50.856.8 tok/s$11.25/M
#19Qwen3.6 PlusAlibaba50.053.1 tok/s$1.13/M
#20DeepSeek V4 Pro (Reasoning, High Effort)DeepSeek49.832.9 tok/s$2.18/M
#21GLM-5 (Reasoning)Z AI49.864.5 tok/s$1.55/M
#22Claude Opus 4.5 (Reasoning)Anthropic49.757 tok/s$10.00/M
#23MiniMax-M2.7MiniMax49.643.9 tok/s$0.525/M
#24Grok 4.20 0309 v2 (Reasoning)xAI49.389.3 tok/s$3.00/M
#25MiMo-V2-ProXiaomi49.2n/a-
#26GPT-5.2 Codex (xhigh)OpenAI49.087.7 tok/s$4.81/M
#27MiMo-V2.5Xiaomi49.0n/a-
#28GPT-5.4 mini (xhigh)OpenAI48.9158.9 tok/s$1.69/M
#29Grok 4.20 0309 (Reasoning)xAI48.587.8 tok/s$3.00/M
#30Gemini 3 Pro Preview (high)Google48.4128.7 tok/s$4.50/M
#31GPT-5.4 (low)OpenAI47.959.1 tok/s$5.63/M
#32GPT-5.1 (high)OpenAI47.7123.3 tok/s$3.44/M
#33GLM-5-TurboZ AI46.8n/a-
#34Kimi K2.5 (Reasoning)Kimi46.831.6 tok/s$1.20/M
#35GPT-5.2 (medium)OpenAI46.6n/a$4.81/M
#36Claude Opus 4.6 (Non-reasoning, High Effort)Anthropic46.542 tok/s$10.00/M
#37DeepSeek V4 Flash (Reasoning, Max Effort)DeepSeek46.577.4 tok/s$0.175/M
#38Gemini 3 Flash Preview (Reasoning)Google46.4193.2 tok/s$1.13/M
#39Qwen3.6 27B (Reasoning)Alibaba45.864.1 tok/s$1.35/M
#40Qwen3.5 397B A17B (Reasoning)Alibaba45.050.4 tok/s$1.35/M
#41DeepSeek V4 Flash (Reasoning, High Effort)DeepSeek44.9n/a$0.175/M
#42MiMo-V2-Omni-0327Xiaomi44.9n/a-
#43GPT-5 (high)OpenAI44.684.2 tok/s$3.44/M
#44GPT-5 Codex (high)OpenAI44.6166.8 tok/s$3.44/M
#45Claude Sonnet 4.6 (Non-reasoning, High Effort)Anthropic44.448.3 tok/s$6.00/M
#46GPT-5.4 nano (xhigh)OpenAI44.0160.3 tok/s$0.463/M
#47GLM-5.1 (Non-reasoning)Z AI43.841.5 tok/s$2.15/M
#48KAT Coder Pro V2KwaiKAT43.8110.7 tok/s$0.525/M
#49Qwen3.6 35B A3B (Reasoning)Alibaba43.5191.8 tok/s$0.557/M
#50MiMo-V2-OmniXiaomi43.4n/a-
#51Claude Opus 4.5 (Non-reasoning)Anthropic43.150.3 tok/s$10.00/M
#52GPT-5.1 Codex (high)OpenAI43.1162.7 tok/s$3.44/M
#53Claude 4.5 Sonnet (Reasoning)Anthropic43.043.8 tok/s$6.00/M
#54Kimi K2.6 (Non-reasoning)Kimi43.0n/a-
#55GLM 5V Turbo (Reasoning)Z AI42.9n/a-
#56Claude Sonnet 4.6 (Non-reasoning, Low Effort)Anthropic42.651.5 tok/s$6.00/M
#57GLM-4.7 (Reasoning)Z AI42.190.3 tok/s$1.00/M
#58Qwen3.5 27B (Reasoning)Alibaba42.187 tok/s$0.825/M
#59Claude 4.1 Opus (Reasoning)Anthropic42.035.8 tok/s$30.00/M
#60GPT-5 (medium)OpenAI42.082.3 tok/s$3.44/M
#61TEHy3-preview (Reasoning)Tencent41.986.4 tok/s-
#62MiniMax-M2.5MiniMax41.979.7 tok/s$0.525/M
#63DeepSeek V3.2 (Reasoning)DeepSeek41.7n/a$0.315/M
#64Qwen3.5 122B A10B (Reasoning)Alibaba41.6139.9 tok/s$1.10/M
#65Grok 4xAI41.550.3 tok/s$6.00/M
#66MiMo-V2-Flash (Feb 2026)Xiaomi41.5120.6 tok/s$0.150/M
#67Gemini 3 Pro Preview (low)Google41.3n/a$4.50/M
#68GPT-5 mini (high)OpenAI41.285.7 tok/s$0.688/M
#69GPT-5.5 (Non-reasoning)OpenAI40.951.3 tok/s$11.25/M
#70Kimi K2 ThinkingKimi40.999 tok/s$1.08/M
#71o3-proOpenAI40.716.9 tok/s$35.00/M
#72GLM-5 (Non-reasoning)Z AI40.659.6 tok/s$1.55/M
#73Qwen3.5 397B A17B (Non-reasoning)Alibaba40.152.5 tok/s$1.35/M
#74Qwen3 Max ThinkingAlibaba39.934.3 tok/s$2.40/M
#75MiniMax-M2.1MiniMax39.484.8 tok/s$0.525/M
#76DeepSeek V4 Pro (Non-reasoning)DeepSeek39.3n/a-
#77Gemma 4 31B (Reasoning)Google39.234.8 tok/s-
#78GPT-5 (low)OpenAI39.265.8 tok/s$3.44/M
#79MiMo-V2-Flash (Reasoning)Xiaomi39.2118.8 tok/s$0.150/M
#80Claude 4 Opus (Reasoning)Anthropic39.036.8 tok/s$30.00/M