Easy Benchmarks
Workspace
Overview
Benchmarks
Benchmarks list
Compare
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
LLMs
Audio
Image
Video
Feedback
Log inSign up
Back

Artificial Analysis Coding Index

Artificial Analysis aggregate coding score.

The Coding Index is the source dataset's aggregate coding signal. Use it as a broad programming capability view before drilling into individual code benchmarks such as LiveCodeBench and SciCode.

Test type: Aggregate coding evaluation that combines code-focused benchmark results exposed by Artificial Analysis.

Coverage

436 models have this metric.

62.0

Current leader: Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback)

Project links

Scores come from the Artificial Analysis LLM snapshot committed in this app.

Artificial Analysis methodology

Top Coding Models

Top models ranked by Coding.

Leaderboard

RankModelCreatorValueSpeedBlended Price
#1Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback)Anthropic62.0n/a$20.00/M
#2
GPT-5.5 (xhigh)
OpenAI
59.1
69 tok/s
$11.25/M
#3GPT-5.5 (high)OpenAI58.561.6 tok/s$11.25/M
#4GPT-5.4 (xhigh)OpenAI57.275.5 tok/s$5.63/M
#5Claude Opus 4.8 (Adaptive Reasoning, Max Effort)Anthropic56.767.8 tok/s$10.00/M
#6GPT-5.5 (medium)OpenAI56.258.7 tok/s$11.25/M
#7Gemini 3.1 Pro PreviewGoogle55.5124.7 tok/s$4.50/M
#8Claude Opus 4.7 (Non-reasoning, High Effort)Anthropic53.146 tok/s$10.00/M
#9GPT-5.3 Codex (xhigh)OpenAI53.184.5 tok/s$4.81/M
#10Claude Opus 4.7 (Adaptive Reasoning, Max Effort)Anthropic52.553.8 tok/s$10.00/M
#11GPT-5.5 (low)OpenAI52.166.4 tok/s$11.25/M
#12GPT-5.4 mini (xhigh)OpenAI51.5178.8 tok/s$1.69/M
#13Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)Anthropic50.963.2 tok/s$6.00/M
#14Qwen3.7 MaxAlibaba50.1186.5 tok/s$3.75/M
#15GPT-5.2 (xhigh)OpenAI48.771 tok/s$4.81/M
#16GPT-5.5 (Non-reasoning)OpenAI48.654.4 tok/s$11.25/M
#17Claude Opus 4.6 (Adaptive Reasoning, Max Effort)Anthropic48.147.3 tok/s$10.94/M
#18Claude Opus 4.5 (Reasoning)Anthropic47.853.5 tok/s$10.94/M
#19Claude Opus 4.6 (Non-reasoning, High Effort)Anthropic47.640.9 tok/s$10.94/M
#20DeepSeek V4 Pro (Reasoning, Max Effort)DeepSeek47.561.6 tok/s$0.544/M
#21Muse SparkMeta47.5n/a-
#22Gemini 3.5 Flash (minimal)Google47.1202.7 tok/s$3.38/M
#23Kimi K2.6Kimi47.141.6 tok/s$1.71/M
#24Gemini 2.5 Pro Preview (Mar' 25)Google46.7n/a-
#25Gemini 3 Pro Preview (high)Google46.5n/a$4.50/M
#26Qwen3.7 PlusAlibaba46.553.6 tok/s$0.590/M
#27Claude Sonnet 4.6 (Non-reasoning, High Effort)Anthropic46.449.1 tok/s$6.00/M
#28GPT-5.4 (low)OpenAI45.663.6 tok/s$5.63/M
#29KAT Coder Pro V2KwaiKAT45.6118.1 tok/s$0.525/M
#30MiMo-V2.5-ProXiaomi45.543.3 tok/s$0.544/M
#31GPT-5.5 Instant (May 2026)OpenAI45.1n/a$11.25/M
#32Gemini 3.5 Flash (high)Google45.0203.3 tok/s$3.38/M
#33Qwen3.6 Max PreviewAlibaba44.940.9 tok/s$2.93/M
#34GPT-5.1 (high)OpenAI44.7121.2 tok/s$3.44/M
#35GLM-5 (Reasoning)Z AI44.279.5 tok/s$1.55/M
#36GPT-5.2 (medium)OpenAI44.2n/a$4.81/M
#37Gemini 3.5 Flash (medium)Google43.9210.1 tok/s$3.38/M
#38GPT-5.4 nano (xhigh)OpenAI43.9147.6 tok/s$0.463/M
#39GLM-5.1 (Reasoning)Z AI43.446.8 tok/s$2.15/M
#40MiniMax-M3MiniMax43.445.6 tok/s$0.525/M
#41DeepSeek V4 Pro (Reasoning, High Effort)DeepSeek43.265.7 tok/s$0.544/M
#42Claude Sonnet 4.6 (Non-reasoning, Low Effort)Anthropic43.050.1 tok/s$6.00/M
#43GPT-5.2 Codex (xhigh)OpenAI43.0105.3 tok/s$4.81/M
#44Claude Opus 4.5 (Non-reasoning)Anthropic42.947.6 tok/s$10.94/M
#45Qwen3.6 PlusAlibaba42.952.8 tok/s$1.13/M
#46Gemini 3 Flash Preview (Reasoning)Google42.6172.8 tok/s$1.13/M
#47Grok 4.20 0309 (Reasoning)xAI42.2166.5 tok/s$3.00/M
#48MiMo-V2.5Xiaomi42.177.4 tok/s$0.175/M
#49MiniMax-M2.7MiniMax41.975 tok/s$0.525/M
#50MiMo-V2-ProXiaomi41.442.5 tok/s$1.50/M
#51Qwen3.5 397B A17B (Reasoning)Alibaba41.351.8 tok/s$1.35/M
#52GPT-5.4 (Non-reasoning)OpenAI41.059.3 tok/s$5.63/M
#53Grok 4.3 (high)xAI41.0159.7 tok/s$1.56/M
#54Grok 4xAI40.5n/a$11.00/M
#55Grok 4.20 0309 v2 (Reasoning)xAI40.5168.7 tok/s$3.00/M
#56DeepSeek V4 Flash (Reasoning, High Effort)DeepSeek39.8n/a$0.175/M
#57Kimi K2.5 (Reasoning)Kimi39.631.7 tok/s$1.19/M
#58Gemini 3 Pro Preview (low)Google39.4n/a$4.50/M
#59GLM-5 (Non-reasoning)Z AI39.063.1 tok/s$1.55/M
#60GPT-5 (medium)OpenAI38.985.6 tok/s$3.44/M
#61GPT-5 Codex (high)OpenAI38.9171.1 tok/s$3.44/M
#62DeepSeek V4 Flash (Reasoning, Max Effort)DeepSeek38.798.3 tok/s$0.175/M
#63Gemma 4 31B (Reasoning)Google38.734.8 tok/s-
#64Claude 4.5 Sonnet (Reasoning)Anthropic38.650.1 tok/s$6.56/M
#65DeepSeek V4 Pro (Non-reasoning)DeepSeek38.467 tok/s$0.544/M
#66Kimi K2.6 (Non-reasoning)Kimi38.439 tok/s$1.71/M
#67o3OpenAI38.4122.3 tok/s$3.50/M
#68DeepSeek V3.2 SpecialeDeepSeek37.9n/a-
#69Gemini 3 Flash Preview (Non-reasoning)Google37.8181.3 tok/s$1.13/M
#70GPT-5.4 mini (medium)OpenAI37.5177.9 tok/s$1.69/M
#71MiniMax-M2.5MiniMax37.4202.9 tok/s$0.525/M
#72Qwen3.5 397B A17B (Non-reasoning)Alibaba37.453.1 tok/s$1.35/M
#73Step 3.7 FlashStepFun37.1385.5 tok/s$0.438/M
#74MiMo-V2-Omni-0327Xiaomi36.985.6 tok/s$0.800/M
#75GLM-5-TurboZ AI36.8n/a-
#76MiMo-V2.5-Pro (Non-reasoning)Xiaomi36.843.8 tok/s$1.35/M
#77DeepSeek V3.2 (Reasoning)DeepSeek36.7n/a$0.337/M
#78GPT-5.1 Codex (high)OpenAI36.6182.1 tok/s$3.44/M
#79Claude 4.1 Opus (Reasoning)Anthropic36.533.7 tok/s$32.81/M
#80TEHy3-preview (Reasoning)Tencent36.596 tok/s$0.200/M