Easy Benchmarks
Workspace
Overview
Benchmarks
Benchmarks list
Compare
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
LLMs
Audio
Image
Video
Feedback
Log inSign up
Back

Price & Value

Cost leaderboard using input, output, blended price, and value index.

Leader

Domain score averages relative percentile across included metrics.

Qwen3.5 0.8B (Non-reasoning)

Domain score 100

Blended $Input $Output $Value

Top Cost Models

Cost leaderboard using input, output, blended price, and value index.

Domain Leaderboard

RankModelCreatorDomain ScoreSpeedBlended Price
#1Qwen3.5 0.8B (Non-reasoning)Alibaba99.888 tok/s$0.020/M
#2
Qwen3.5 0.8B (Reasoning)
Alibaba
99.6
n/a
$0.020/M
#3Qwen3.5 2B (Non-reasoning)Alibaba98.7318.9 tok/s$0.040/M
#4Gemma 3n E4B InstructGoogle98.750 tok/s$0.025/M
#5Qwen3.5 2B (Reasoning)Alibaba98.6n/a$0.040/M
#6Sarvam 30B (high)Sarvam97.7147.9 tok/s$0.047/M
#7Qwen3.5 4B (Non-reasoning)Alibaba97.4208.9 tok/s$0.060/M
#8Qwen3.5 4B (Reasoning)Alibaba97.3195.8 tok/s$0.060/M
#9LFM2 24B A2BLiquid AI96.5118.3 tok/s$0.052/M
#10Granite 4.1 8BIBM95.7134.2 tok/s$0.063/M
#11Nova MicroAmazon95.5302 tok/s$0.061/M
#12Gemma 3 4B InstructGoogle95.5n/a$0.050/M
#13NVIDIA Nemotron Nano 9B V2 (Reasoning)NVIDIA95.3118 tok/s$0.070/M
#14Sarvam 105B (high)Sarvam95.3100.7 tok/s$0.074/M
#15gpt-oss-20B (high)OpenAI95.1272.3 tok/s$0.088/M
#16Llama 3.2 Instruct 1BMeta94.992.9 tok/s$0.050/M
#17gpt-oss-20B (low)OpenAI93.5274.1 tok/s$0.095/M
#18NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)NVIDIA93.3133.6 tok/s$0.096/M
#19NVIDIA Nemotron Nano 9B V2 (Non-reasoning)NVIDIA93.1133.6 tok/s$0.086/M
#20NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)NVIDIA92.687.3 tok/s$0.088/M
#21Qwen3.5 9B (Reasoning)Alibaba92.569.4 tok/s$0.113/M
#22Qwen2.5 TurboAlibaba92.066.4 tok/s$0.088/M
#23Llama 3 Instruct 8BMeta91.988.3 tok/s$0.070/M
#24Llama 3.1 Instruct 8BMeta91.3201.5 tok/s$0.100/M
#25Mistral Small 3Mistral91.2153.7 tok/s$0.104/M
#26Ministral 3 3BMistral90.6155.6 tok/s$0.100/M
#27Nova LiteAmazon90.3191.7 tok/s$0.105/M
#28Granite 3.3 8B (Non-reasoning)IBM89.9400.9 tok/s$0.085/M
#29GPT-5 nano (high)OpenAI89.8150.4 tok/s$0.138/M
#30Llama 2 Chat 7BMeta89.6100.6 tok/s$0.100/M
#31GPT-5 nano (medium)OpenAI89.5167 tok/s$0.138/M
#32Nemotron 3 Nano Omni 30B A3B ReasoningNVIDIA89.3276.7 tok/s$0.131/M
#33MiMo-V2-Flash (Feb 2026)Xiaomi89.3124.9 tok/s$0.150/M
#34Granite 4.0 H SmallIBM89.2454.2 tok/s$0.107/M
#35Mistral Small 3.2Mistral89.0127.1 tok/s$0.128/M
#36MiMo-V2-Flash (Reasoning)Xiaomi88.6129.5 tok/s$0.150/M
#37Ling 2.6 FlashInclusionAI88.3n/a$0.150/M
#38MiMo-V2-Flash (Non-reasoning)Xiaomi88.1122.8 tok/s$0.150/M
#39GLM-4.7-Flash (Reasoning)Z AI87.894.1 tok/s$0.153/M
#40Step 3.5 Flash 2603StepFun87.6231 tok/s$0.150/M
#41Step 3.5 FlashStepFun87.6217.5 tok/s$0.150/M
#42GLM-4.7-Flash (Non-reasoning)Z AI87.2105.6 tok/s$0.153/M
#43DeepSeek V4 Flash (Reasoning, High Effort)DeepSeek87.1n/a$0.175/M
#44GPT-5 nano (minimal)OpenAI87.1153.9 tok/s$0.138/M
#45Qwen3 30B A3B (Non-reasoning)Alibaba87.068.6 tok/s$0.133/M
#46DeepSeek V4 Flash (Reasoning, Max Effort)DeepSeek87.0107.8 tok/s$0.175/M
#47Mistral Small 3.1Mistral87.0158.2 tok/s$0.138/M
#48Devstral Small (Jul '25)Mistral87.042.3 tok/s$0.150/M
#49DeepSeek V4 Flash (Non-reasoning)DeepSeek86.6120.2 tok/s$0.175/M
#50MiMo-V2.5Xiaomi86.477.4 tok/s$0.175/M
#51Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)Google85.4n/a$0.175/M
#52Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)Google85.2n/a$0.175/M
#53Ministral 3 8BMistral85.2119.9 tok/s$0.150/M
#54Gemini 2.5 Flash-Lite (Reasoning)Google84.9265.2 tok/s$0.175/M
#55Olmo 3 7B InstructAllen Institute for AI84.8n/a$0.125/M
#56Apertus 8B InstructSwiss AI Initiative84.3n/a$0.125/M
#57Gemma 3 12B InstructGoogle83.9n/a$0.140/M
#58Gemma 3 27B InstructGoogle83.7n/a$0.145/M
#59Gemma 4 26B A4B (Reasoning)Google83.4n/a$0.198/M
#60Llama Nemotron Super 49B v1.5 (Reasoning)NVIDIA83.344.2 tok/s$0.175/M
#61Gemma 4 26B A4B (Non-reasoning)Google83.282.3 tok/s$0.198/M
#62Gemini 2.5 Flash-Lite (Non-reasoning)Google82.7229.5 tok/s$0.175/M
#63Llama 3.2 Instruct 3BMeta82.752.3 tok/s$0.150/M
#64TEHy3-preview (Reasoning)Tencent82.696 tok/s$0.200/M
#65Solar MiniUpstage82.675.9 tok/s$0.150/M
#66Gemma 4 31B (Non-reasoning)Google82.456.9 tok/s$0.205/M
#67TEHy3-preview (Non-reasoning)Tencent82.183.9 tok/s$0.200/M
#68GPT-4.1 nanoOpenAI81.5118.2 tok/s$0.175/M
#69Llama Nemotron Super 49B v1.5 (Non-reasoning)NVIDIA81.543.9 tok/s$0.175/M
#70Qwen3 30B A3B (Reasoning)Alibaba81.368.5 tok/s$0.180/M
#71Ministral 3 14BMistral79.286 tok/s$0.200/M
#72Hermes 4 - Llama-3.1 70B (Reasoning)Nous Research78.887.2 tok/s$0.198/M
#73Qwen3 8B (Non-reasoning)Alibaba78.664.4 tok/s$0.185/M
#74gpt-oss-120b (high)OpenAI78.1358.8 tok/s$0.262/M
#75Qwen3 4B (Non-reasoning)Alibaba77.7n/a$0.188/M
#76Hermes 4 - Llama-3.1 70B (Non-reasoning)Nous Research77.684.9 tok/s$0.198/M
#77Grok 4 Fast (Reasoning)xAI77.1n/a$0.275/M
#78Qwen3.5 Omni FlashAlibaba76.7224.4 tok/s$0.275/M
#79Mistral Small 4 (Reasoning)Mistral76.3183.5 tok/s$0.262/M
#80Qwen3 30B A3B 2507 InstructAlibaba75.9105.2 tok/s$0.213/M