Easy Benchmarks
Workspace
Overview
Benchmarks
Benchmarks list
Compare
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
LLMs
Audio
Image
Video
Feedback
Log inSign up
Back

Math

Math leaderboard from math-specific benchmark metrics.

Leader

Domain score averages relative percentile across included metrics.

GPT-5.2 (xhigh)

Domain score 100

MathMATH-500AIME

Top Math Models

Math leaderboard from math-specific benchmark metrics.

Domain Leaderboard

RankModelCreatorDomain ScoreSpeedBlended Price
#1GPT-5.2 (xhigh)OpenAI100.071 tok/s$4.81/M
#2
GPT-5 Codex (high)
OpenAI
99.6
171.1 tok/s
$3.44/M
#3Gemini 3 Flash Preview (Reasoning)Google99.3172.8 tok/s$1.13/M
#4DeepSeek V3.2 SpecialeDeepSeek98.9n/a-
#5GPT-5 (high)OpenAI98.6111.1 tok/s$3.44/M
#6GPT-5.2 (medium)OpenAI98.5n/a$4.81/M
#7MiMo-V2-Flash (Reasoning)Xiaomi98.1129.5 tok/s$0.150/M
#8Gemini 3 Pro Preview (high)Google97.8n/a$4.50/M
#9GPT-5.1 Codex (high)OpenAI97.4182.1 tok/s$3.44/M
#10Grok 4xAI97.1n/a$11.00/M
#11GLM-4.7 (Reasoning)Z AI97.079.2 tok/s$1.00/M
#12KAT-Coder-Pro V1KwaiKAT96.6114.7 tok/s$0.525/M
#13GPT-5 (medium)OpenAI96.585.6 tok/s$3.44/M
#14Kimi K2 ThinkingKimi96.3131.1 tok/s$1.08/M
#15o4-mini (high)OpenAI95.8151 tok/s$1.93/M
#16Nova 2.0 Lite (high)Amazon95.5177.3 tok/s$0.850/M
#17Qwen3 235B A22B 2507 (Reasoning)Alibaba95.259.4 tok/s$0.838/M
#18GPT-5.1 (high)OpenAI95.1121.2 tok/s$3.44/M
#19gpt-oss-120b (high)OpenAI94.8358.8 tok/s$0.262/M
#20o3-mini (high)OpenAI94.4218.5 tok/s$1.93/M
#21o3OpenAI94.2159.6 tok/s$3.50/M
#22DeepSeek V3.2 (Reasoning)DeepSeek94.0n/a$0.337/M
#23Gemini 2.5 Pro Preview (May' 25)Google93.6n/a$3.44/M
#24Grok 3 mini Reasoning (high)xAI93.358.8 tok/s$0.350/M
#25GPT-5.1 Codex mini (high)OpenAI93.3213.6 tok/s$0.688/M
#26Gemini 2.5 Pro Preview (Mar' 25)Google93.2n/a-
#27Claude Opus 4.5 (Reasoning)Anthropic92.953.5 tok/s$10.94/M
#28NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)NVIDIA92.5133.6 tok/s$0.096/M
#29Gemini 2.5 Flash Preview (Reasoning)Google92.1n/a-
#30GPT-5 mini (high)OpenAI91.887.4 tok/s$0.688/M
#31K-EXAONE (Reasoning)LG AI Research91.0n/a-
#32DeepSeek V3.1 (Reasoning)DeepSeek90.7n/a$0.865/M
#33DeepSeek V3.1 Terminus (Reasoning)DeepSeek90.3n/a$1.91/M
#34Grok 4 Fast (Reasoning)xAI89.9n/a$0.275/M
#35Nova 2.0 Omni (medium)Amazon89.6n/a$0.850/M
#36gpt-oss-20B (high)OpenAI89.2272.3 tok/s$0.088/M
#37GPT-5 (low)OpenAI88.879.3 tok/s$3.44/M
#38Grok 4.1 Fast (Reasoning)xAI88.8n/a-
#39Gemini 2.5 ProGoogle88.8139.8 tok/s$3.44/M
#40Ring-1TInclusionAI88.4n/a-
#41Nova 2.0 Pro Preview (medium)Amazon88.1138.3 tok/s$3.44/M
#42Nova 2.0 Lite (medium)Amazon87.7135.5 tok/s$0.850/M
#43o3-miniOpenAI87.5203.3 tok/s$1.93/M
#44DeepSeek R1 0528 (May '25)DeepSeek87.0n/a$2.06/M
#45Qwen3 VL 235B A22B (Reasoning)Alibaba86.932.5 tok/s$2.17/M
#46Llama Nemotron Super 49B v1.5 (Reasoning)NVIDIA86.844.2 tok/s$0.175/M
#47Apriel-v1.6-15B-ThinkerServiceNow86.6n/a-
#48Claude 4.5 Sonnet (Reasoning)Anthropic86.250.1 tok/s$6.56/M
#49EXAONE 4.0 32B (Reasoning)LG AI Research85.9n/a-
#50INTELLECT-3Prime Intellect85.8n/a-
#51DeepSeek V3.2 Exp (Reasoning)DeepSeek85.4n/a$0.310/M
#52Claude 4 Sonnet (Reasoning)Anthropic85.145.5 tok/s$6.56/M
#53GLM-4.5 (Reasoning)Z AI84.950.1 tok/s$1.00/M
#54Apriel-v1.5-15B-ThinkerServiceNow84.7n/a-
#55o1OpenAI84.7123.2 tok/s$26.25/M
#56Sonar Reasoning ProPerplexity84.5n/a-
#57Gemini 3 Pro Preview (low)Google84.3n/a$4.50/M
#58GLM-4.6 (Reasoning)Z AI84.043.9 tok/s$0.963/M
#59Gemini 2.5 Flash (Reasoning)Google83.6221.3 tok/s$0.850/M
#60GLM-4.6V (Reasoning)Z AI83.644.9 tok/s$0.450/M
#61ERNIE 5.0 Thinking PreviewBaidu83.2n/a-
#62GPT-5 mini (medium)OpenAI82.886.7 tok/s$0.688/M
#63Claude 4 Opus (Reasoning)Anthropic82.436.4 tok/s$32.81/M
#64Qwen3 VL 32B (Reasoning)Alibaba82.198.3 tok/s$2.63/M
#65Seed-OSS-36B-InstructByteDance Seed81.740.4 tok/s$0.300/M
#66Qwen3 Next 80B A3B (Reasoning)Alibaba81.3135.7 tok/s$1.88/M
#67Claude 4.5 Haiku (Reasoning)Anthropic81.0152.2 tok/s$2.19/M
#68GPT-5 nano (high)OpenAI80.6150.4 tok/s$0.138/M
#69R1 1776Perplexity80.5n/a-
#70MiniMax M1 80kMiniMax80.4n/a$0.963/M
#71Ring-flash-2.0InclusionAI80.2n/a$0.247/M
#72Qwen3 30B A3B 2507 (Reasoning)Alibaba79.8139.3 tok/s$0.673/M
#73Qwen3 235B A22B (Reasoning)Alibaba79.859 tok/s$2.63/M
#74Qwen3 32B (Reasoning)Alibaba79.798.4 tok/s$0.276/M
#75GLM-4.5-AirZ AI79.674.5 tok/s$0.372/M
#76Qwen3 235B A22B 2507 InstructAlibaba79.642.5 tok/s$0.356/M
#77MiniMax-M2.1MiniMax79.5184.6 tok/s$0.525/M
#78Qwen3 4B 2507 (Reasoning)Alibaba79.1n/a-
#79Qwen3 Max Thinking (Preview)Alibaba78.750.7 tok/s$2.40/M
#80Qwen3 VL 30B A3B (Reasoning)Alibaba78.4126.8 tok/s$0.338/M