Easy Benchmarks
Workspace
Overview
Benchmarks
Benchmarks list
Compare
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
LLMs
Audio
Image
Video
Feedback
Log inSign up
Back

Coding

Coding leaderboard from coding-specific benchmark metrics.

Leader

Domain score averages relative percentile across included metrics.

GPT-5.5 (xhigh)

Domain score 100

CodingLCBSciCode

Top Coding Models

Coding leaderboard from coding-specific benchmark metrics.

Domain Leaderboard

RankModelCreatorDomain ScoreSpeedBlended Price
#1GPT-5.5 (xhigh)OpenAI99.762.4 tok/s$11.25/M
#2
GPT-5.4 (xhigh)
OpenAI
99.7
75.5 tok/s
$5.63/M
#3GPT-5.5 (high)OpenAI99.562.3 tok/s$11.25/M
#4Gemini 3.1 Pro PreviewGoogle99.4135.9 tok/s$4.50/M
#5Claude Opus 4.8 (Adaptive Reasoning, Max Effort)Anthropic98.966.3 tok/s$10.94/M
#6GPT-5.5 (medium)OpenAI98.759.2 tok/s$11.25/M
#7Claude Opus 4.7 (Adaptive Reasoning, Max Effort)Anthropic98.561.8 tok/s$10.94/M
#8GPT-5.3 Codex (xhigh)OpenAI98.298.6 tok/s$4.81/M
#9Gemini 3 Pro Preview (high)Google98.1n/a$4.50/M
#10GPT-5.2 (xhigh)OpenAI97.671 tok/s$4.81/M
#11GPT-5.5 (low)OpenAI97.460.2 tok/s$11.25/M
#12Claude Opus 4.7 (Non-reasoning, High Effort)Anthropic97.248.1 tok/s$10.94/M
#13Claude Opus 4.6 (Adaptive Reasoning, Max Effort)Anthropic96.847.3 tok/s$10.94/M
#14Kimi K2.6Kimi96.741.6 tok/s$1.71/M
#15Claude Opus 4.5 (Reasoning)Anthropic96.353.5 tok/s$10.94/M
#16GPT-5.4 mini (xhigh)OpenAI96.3176 tok/s$1.69/M
#17Muse SparkMeta96.2n/a-
#18DeepSeek V4 Pro (Reasoning, Max Effort)DeepSeek95.761.8 tok/s$0.544/M
#19Qwen3.7 MaxAlibaba95.7186.5 tok/s$3.75/M
#20Gemini 3.5 Flash (high)Google95.4212.4 tok/s$3.38/M
#21Gemini 3 Flash Preview (Reasoning)Google95.4172.8 tok/s$1.13/M
#22GPT-5.5 (Non-reasoning)OpenAI95.359 tok/s$11.25/M
#23GPT-5.4 (low)OpenAI95.263.6 tok/s$5.63/M
#24Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)Anthropic94.970.4 tok/s$6.56/M
#25Gemini 3.5 Flash (minimal)Google94.9199.1 tok/s$3.38/M
#26GPT-5.2 Codex (xhigh)OpenAI94.7105.3 tok/s$4.81/M
#27MiMo-V2.5-ProXiaomi94.743.3 tok/s$0.544/M
#28Gemini 3.5 Flash (medium)Google94.7210.2 tok/s$3.38/M
#29GPT-5.5 Instant (May 2026)OpenAI94.7n/a$11.25/M
#30GPT-5.2 (medium)OpenAI94.2n/a$4.81/M
#31Claude Opus 4.6 (Non-reasoning, High Effort)Anthropic93.740.9 tok/s$10.94/M
#32Claude Sonnet 4.6 (Non-reasoning, High Effort)Anthropic93.657.9 tok/s$6.56/M
#33Gemini 3 Pro Preview (low)Google92.9n/a$4.50/M
#34Qwen3.6 Max PreviewAlibaba92.740.9 tok/s$2.93/M
#35Qwen3.7 PlusAlibaba92.653.6 tok/s$0.590/M
#36GPT-5.1 (high)OpenAI92.5121.2 tok/s$3.44/M
#37GPT-5.4 nano (xhigh)OpenAI92.2158.3 tok/s$0.463/M
#38GLM-5 (Reasoning)Z AI92.079.5 tok/s$1.55/M
#39DeepSeek V4 Pro (Reasoning, High Effort)DeepSeek91.457.6 tok/s$0.544/M
#40MiniMax-M2.7MiniMax91.175 tok/s$0.525/M
#41GPT-5.4 (Non-reasoning)OpenAI91.059.3 tok/s$5.63/M
#42Grok 4.3 (high)xAI91.0237.5 tok/s$1.56/M
#43Kimi K2.5 (Reasoning)Kimi90.931.7 tok/s$1.19/M
#44Grok 4xAI90.9n/a$11.00/M
#45DeepSeek V3.2 SpecialeDeepSeek90.9n/a-
#46MiniMax-M3MiniMax90.845.6 tok/s$0.525/M
#47Gemini 3 Flash Preview (Non-reasoning)Google90.5181.3 tok/s$1.13/M
#48GLM-4.7 (Reasoning)Z AI90.279.2 tok/s$1.00/M
#49GLM-5.1 (Reasoning)Z AI89.846.8 tok/s$2.15/M
#50Claude Sonnet 4.6 (Non-reasoning, Low Effort)Anthropic89.761.7 tok/s$6.56/M
#51Claude Opus 4.5 (Non-reasoning)Anthropic89.747.6 tok/s$10.94/M
#52Grok 4.20 0309 (Reasoning)xAI89.6166.5 tok/s$3.00/M
#53Grok 4.20 0309 v2 (Reasoning)xAI89.3168.7 tok/s$3.00/M
#54MiMo-V2.5Xiaomi88.277.4 tok/s$0.175/M
#55DeepSeek V4 Flash (Reasoning, Max Effort)DeepSeek88.1107.8 tok/s$0.175/M
#56GPT-5 (high)OpenAI87.7111.1 tok/s$3.44/M
#57GPT-5 Codex (high)OpenAI87.7171.1 tok/s$3.44/M
#58GPT-5.1 Codex mini (high)OpenAI87.3213.6 tok/s$0.688/M
#59MiMo-V2-ProXiaomi87.242.5 tok/s$1.50/M
#60Gemini 2.5 Pro Preview (Mar' 25)Google86.9n/a-
#61Gemma 4 31B (Reasoning)Google86.834.8 tok/s-
#62GPT-5.4 mini (medium)OpenAI86.8184.4 tok/s$1.69/M
#63Kimi K2 ThinkingKimi86.6131.1 tok/s$1.08/M
#64o3OpenAI86.5159.6 tok/s$3.50/M
#65Qwen3.5 397B A17B (Reasoning)Alibaba86.351.8 tok/s$1.35/M
#66Gemini 2.5 Pro Preview (May' 25)Google86.0n/a$3.44/M
#67DeepSeek V4 Flash (Reasoning, High Effort)DeepSeek86.0n/a$0.175/M
#68Claude 4.5 Sonnet (Reasoning)Anthropic85.950.1 tok/s$6.56/M
#69GPT-5.1 Codex (high)OpenAI85.7182.1 tok/s$3.44/M
#70GLM-5-TurboZ AI85.6n/a-
#71Qwen3.6 PlusAlibaba85.452.8 tok/s$1.13/M
#72DeepSeek V4 Pro (Non-reasoning)DeepSeek85.361.2 tok/s$0.544/M
#73Grok 4.1 Fast (Reasoning)xAI84.8n/a-
#74MiniMax-M2.5MiniMax84.8202.9 tok/s$0.525/M
#75GLM 5V Turbo (Reasoning)Z AI84.4n/a-
#76DeepSeek V3.2 (Reasoning)DeepSeek84.3n/a$0.337/M
#77Grok 4.3 (medium)xAI84.3197.6 tok/s$1.56/M
#78Gemini 2.5 ProGoogle84.1139.8 tok/s$3.44/M
#79o4-mini (high)OpenAI84.0151 tok/s$1.93/M
#80GPT-5 (medium)OpenAI83.385.6 tok/s$3.44/M