EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Input Price

Input token price per 1M tokens.

Input Price measures the listed cost of sending prompts and context to a model. It matters most for retrieval, agent, and long-context workflows where input tokens dominate spend.

Test type: Provider input-token price comparison. Lower values rank better.

Coverage

325 models have this metric.

$0.010/M

Current leader: Qwen3.5 0.8B (Non-reasoning)

Project links

Prices come from Artificial Analysis pricing data in the committed snapshot.

Artificial Analysis methodology

Top Input $ Models

Top models ranked by Input $.

Leaderboard

RankModelCreatorValueSpeedBlended Price
#1Qwen3.5 0.8B (Non-reasoning)Alibaba$0.010/M273.6 tok/s$0.020/M
#2
Qwen3.5 0.8B (Reasoning)
Alibaba
$0.010/M
n/a
$0.020/M
#3Gemma 3n E4B InstructGoogle$0.020/M15.3 tok/s$0.025/M
#4Qwen3.5 2B (Non-reasoning)Alibaba$0.020/M227 tok/s$0.040/M
#5Qwen3.5 2B (Reasoning)Alibaba$0.020/Mn/a$0.040/M
#6Granite 3.3 8B (Non-reasoning)IBM$0.030/M410.5 tok/s$0.085/M
#7LFM2 24B A2BLiquid AI$0.030/M196.9 tok/s$0.052/M
#8Qwen3.5 4B (Non-reasoning)Alibaba$0.030/M200.2 tok/s$0.060/M
#9Qwen3.5 4B (Reasoning)Alibaba$0.030/M204.8 tok/s$0.060/M
#10Nova MicroAmazon$0.035/M332.1 tok/s$0.061/M
#11NVIDIA Nemotron Nano 9B V2 (Reasoning)NVIDIA$0.040/M121.6 tok/s$0.070/M
#12Llama 3 Instruct 8BMeta$0.045/M82.2 tok/s$0.070/M
#13GPT-5 nano (high)OpenAI$0.050/M136 tok/s$0.138/M
#14GPT-5 nano (medium)OpenAI$0.050/M150.3 tok/s$0.138/M
#15GPT-5 nano (minimal)OpenAI$0.050/M139.1 tok/s$0.138/M
#16Granite 4.1 8BIBM$0.050/M134.6 tok/s$0.063/M
#17Llama 2 Chat 7BMeta$0.050/M99.7 tok/s$0.100/M
#18NVIDIA Nemotron Nano 9B V2 (Non-reasoning)NVIDIA$0.050/M153.3 tok/s$0.086/M
#19Qwen2.5 TurboAlibaba$0.050/M77.7 tok/s$0.087/M
#20NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)NVIDIA$0.055/M154.8 tok/s$0.096/M
#21Granite 4.0 H SmallIBM$0.060/M238.9 tok/s$0.107/M
#22Nova LiteAmazon$0.060/M186.8 tok/s$0.105/M
#23GLM-4.7-Flash (Non-reasoning)Z AI$0.070/M89.6 tok/s$0.152/M
#24GLM-4.7-Flash (Reasoning)Z AI$0.070/M110.5 tok/s$0.152/M
#25gpt-oss-20B (high)OpenAI$0.070/M242.3 tok/s$0.100/M
#26gpt-oss-20B (low)OpenAI$0.070/M249.7 tok/s$0.108/M
#27Apertus 8B InstructSwiss AI Initiative$0.100/Mn/a$0.125/M
#28Devstral Small (Jul '25)Mistral$0.100/M194.2 tok/s$0.150/M
#29Gemini 2.5 Flash-Lite (Non-reasoning)Google$0.100/M239.9 tok/s$0.175/M
#30Gemini 2.5 Flash-Lite (Reasoning)Google$0.100/M243.6 tok/s$0.175/M
#31Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)Google$0.100/Mn/a$0.175/M
#32Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)Google$0.100/Mn/a$0.175/M
#33GPT-4.1 nanoOpenAI$0.100/M125.2 tok/s$0.175/M
#34Ling 2.6 FlashInclusionAI$0.100/M206 tok/s$0.150/M
#35Llama 3.1 Instruct 8BMeta$0.100/M164.4 tok/s$0.100/M
#36Llama 3.2 Instruct 1BMeta$0.100/M97.7 tok/s$0.100/M
#37Llama Nemotron Super 49B v1.5 (Non-reasoning)NVIDIA$0.100/M51.3 tok/s$0.175/M
#38Llama Nemotron Super 49B v1.5 (Reasoning)NVIDIA$0.100/M50.8 tok/s$0.175/M
#39MiMo-V2-Flash (Feb 2026)Xiaomi$0.100/M120.6 tok/s$0.150/M
#40MiMo-V2-Flash (Non-reasoning)Xiaomi$0.100/M116.7 tok/s$0.150/M
#41MiMo-V2-Flash (Reasoning)Xiaomi$0.100/M118.8 tok/s$0.150/M
#42Ministral 3 3BMistral$0.100/M287.6 tok/s$0.100/M
#43Mistral Small 3Mistral$0.100/M135.9 tok/s$0.150/M
#44Mistral Small 3.1Mistral$0.100/M138.8 tok/s$0.150/M
#45Mistral Small 3.2Mistral$0.100/M153.8 tok/s$0.150/M
#46Olmo 3 7B InstructAllen Institute for AI$0.100/Mn/a$0.125/M
#47Qwen3.5 9B (Reasoning)Alibaba$0.100/M62.9 tok/s$0.113/M
#48Qwen3.5 Omni FlashAlibaba$0.100/M190.4 tok/s$0.275/M
#49Step 3.5 FlashStepFun$0.100/M123.6 tok/s$0.150/M
#50Qwen3 0.6B (Non-reasoning)Alibaba$0.110/M204.8 tok/s$0.188/M
#51Qwen3 0.6B (Reasoning)Alibaba$0.110/M195.1 tok/s$0.398/M
#52Qwen3 1.7B (Non-reasoning)Alibaba$0.110/M139 tok/s$0.188/M
#53Qwen3 1.7B (Reasoning)Alibaba$0.110/M136.7 tok/s$0.398/M
#54Qwen3 4B (Non-reasoning)Alibaba$0.110/M104.4 tok/s$0.188/M
#55Qwen3 4B (Reasoning)Alibaba$0.110/M101.8 tok/s$0.398/M
#56Phi-4Microsoft$0.125/M41.6 tok/s$0.219/M
#57Gemma 4 26B A4B (Reasoning)Google$0.130/Mn/a$0.198/M
#58Hermes 4 - Llama-3.1 70B (Non-reasoning)Nous Research$0.130/M83.5 tok/s$0.198/M
#59Hermes 4 - Llama-3.1 70B (Reasoning)Nous Research$0.130/M78.6 tok/s$0.198/M
#60DeepSeek V4 Flash (Reasoning, High Effort)DeepSeek$0.140/Mn/a$0.175/M
#61DeepSeek V4 Flash (Reasoning, Max Effort)DeepSeek$0.140/M77.4 tok/s$0.175/M
#62Ling-flash-2.0InclusionAI$0.140/M87.3 tok/s$0.247/M
#63Ring-flash-2.0InclusionAI$0.140/M91 tok/s$0.247/M
#64Gemini 2.0 Flash (Feb '25)Google$0.150/Mn/a$0.263/M
#65GPT-4o miniOpenAI$0.150/M59.9 tok/s$0.263/M
#66gpt-oss-120B (high)OpenAI$0.150/M212.3 tok/s$0.263/M
#67gpt-oss-120B (low)OpenAI$0.150/M216.3 tok/s$0.263/M
#68Llama 3.2 Instruct 3BMeta$0.150/M52.2 tok/s$0.150/M
#69Ministral 3 8BMistral$0.150/M157.6 tok/s$0.150/M
#70Mistral Small 4 (Non-reasoning)Mistral$0.150/M139.5 tok/s$0.263/M
#71Mistral Small 4 (Reasoning)Mistral$0.150/M149.5 tok/s$0.263/M
#72Solar MiniUpstage$0.150/M41.7 tok/s$0.150/M
#73GLM-4.5-AirZ AI$0.170/M72.9 tok/s$0.372/M
#74Llama 4 ScoutMeta$0.170/M109.2 tok/s$0.292/M
#75Qwen3 8B (Non-reasoning)Alibaba$0.180/M89 tok/s$0.310/M
#76Qwen3 8B (Reasoning)Alibaba$0.180/M87.9 tok/s$0.660/M
#77Qwen3 VL 8B (Reasoning)Alibaba$0.180/M132.7 tok/s$0.660/M
#78Qwen3 VL 8B InstructAlibaba$0.180/M143.4 tok/s$0.310/M
#79GPT-5.4 nano (medium)OpenAI$0.200/M153.4 tok/s$0.463/M
#80GPT-5.4 nano (Non-Reasoning)OpenAI$0.200/M148.5 tok/s$0.463/M