Easy Benchmarks
Workspace
Overview
Benchmarks
Benchmarks list
Compare
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
LLMs
Audio
Image
Video
Feedback
Log inSign up
Back

Input Price

Input token price per 1M tokens.

Input Price measures the listed cost of sending prompts and context to a model. It matters most for retrieval, agent, and long-context workflows where input tokens dominate spend.

Test type: Provider input-token price comparison. Lower values rank better.

Coverage

357 models have this metric.

$0.010/M

Current leader: Qwen3.5 0.8B (Non-reasoning)

Project links

Prices come from Artificial Analysis pricing data in the committed snapshot.

Artificial Analysis methodology

Top Input $ Models

Top models ranked by Input $.

Leaderboard

RankModelCreatorValueSpeedBlended Price
#1Qwen3.5 0.8B (Non-reasoning)Alibaba$0.010/M88 tok/s$0.020/M
#2
Qwen3.5 0.8B (Reasoning)
Alibaba
$0.010/M
n/a
$0.020/M
#3Gemma 3n E4B InstructGoogle$0.020/M50 tok/s$0.025/M
#4Qwen3.5 2B (Non-reasoning)Alibaba$0.020/M318.9 tok/s$0.040/M
#5Qwen3.5 2B (Reasoning)Alibaba$0.020/Mn/a$0.040/M
#6Sarvam 30B (high)Sarvam$0.026/M147.9 tok/s$0.047/M
#7Granite 3.3 8B (Non-reasoning)IBM$0.030/M400.9 tok/s$0.085/M
#8LFM2 24B A2BLiquid AI$0.030/M127.6 tok/s$0.052/M
#9Qwen3.5 4B (Non-reasoning)Alibaba$0.030/M208.9 tok/s$0.060/M
#10Qwen3.5 4B (Reasoning)Alibaba$0.030/M195.8 tok/s$0.060/M
#11Nova MicroAmazon$0.035/M284.2 tok/s$0.061/M
#12Gemma 3 4B InstructGoogle$0.040/Mn/a$0.050/M
#13NVIDIA Nemotron Nano 9B V2 (Reasoning)NVIDIA$0.040/M118 tok/s$0.070/M
#14Sarvam 105B (high)Sarvam$0.042/M100.7 tok/s$0.074/M
#15Llama 3 Instruct 8BMeta$0.045/M88.3 tok/s$0.070/M
#16GPT-5 nano (high)OpenAI$0.050/M150.4 tok/s$0.138/M
#17GPT-5 nano (medium)OpenAI$0.050/M167 tok/s$0.138/M
#18GPT-5 nano (minimal)OpenAI$0.050/M153.9 tok/s$0.138/M
#19gpt-oss-20B (high)OpenAI$0.050/M240 tok/s$0.088/M
#20Granite 4.1 8BIBM$0.050/M134.2 tok/s$0.063/M
#21Llama 2 Chat 7BMeta$0.050/M100.6 tok/s$0.100/M
#22Llama 3.2 Instruct 1BMeta$0.050/M92.9 tok/s$0.050/M
#23NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)NVIDIA$0.050/M87.3 tok/s$0.088/M
#24NVIDIA Nemotron Nano 9B V2 (Non-reasoning)NVIDIA$0.050/M133.6 tok/s$0.086/M
#25Qwen2.5 TurboAlibaba$0.050/M66.4 tok/s$0.088/M
#26NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)NVIDIA$0.055/M133.6 tok/s$0.096/M
#27gpt-oss-20B (low)OpenAI$0.060/M224.2 tok/s$0.095/M
#28Granite 4.0 H SmallIBM$0.060/M454.2 tok/s$0.107/M
#29Nova LiteAmazon$0.060/M191.7 tok/s$0.105/M
#30GLM-4.7-Flash (Non-reasoning)Z AI$0.070/M105.6 tok/s$0.153/M
#31GLM-4.7-Flash (Reasoning)Z AI$0.070/M94.1 tok/s$0.153/M
#32Mistral Small 3Mistral$0.075/M153.7 tok/s$0.104/M
#33Nemotron 3 Nano Omni 30B A3B ReasoningNVIDIA$0.075/M276.7 tok/s$0.131/M
#34Qwen3 30B A3B (Non-reasoning)Alibaba$0.080/M68.6 tok/s$0.133/M
#35Mistral Small 3.2Mistral$0.087/M127.1 tok/s$0.128/M
#36Gemma 3 12B InstructGoogle$0.090/Mn/a$0.140/M
#37Qwen3 30B A3B (Reasoning)Alibaba$0.090/M68.5 tok/s$0.180/M
#38Apertus 8B InstructSwiss AI Initiative$0.100/Mn/a$0.125/M
#39Devstral Small (Jul '25)Mistral$0.100/M42.3 tok/s$0.150/M
#40Gemini 2.5 Flash-Lite (Non-reasoning)Google$0.100/M229.5 tok/s$0.175/M
#41Gemini 2.5 Flash-Lite (Reasoning)Google$0.100/M265.2 tok/s$0.175/M
#42Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)Google$0.100/Mn/a$0.175/M
#43Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)Google$0.100/Mn/a$0.175/M
#44Gemma 4 12B (Reasoning)Google$0.100/M158.6 tok/s$0.150/M
#45GPT-4.1 nanoOpenAI$0.100/M118.2 tok/s$0.175/M
#46Ling 2.6 FlashInclusionAI$0.100/Mn/a$0.150/M
#47Llama 3.1 Instruct 8BMeta$0.100/M201.5 tok/s$0.100/M
#48Llama Nemotron Super 49B v1.5 (Non-reasoning)NVIDIA$0.100/M43.9 tok/s$0.175/M
#49Llama Nemotron Super 49B v1.5 (Reasoning)NVIDIA$0.100/M44.2 tok/s$0.175/M
#50MiMo-V2-Flash (Feb 2026)Xiaomi$0.100/M124.9 tok/s$0.150/M
#51MiMo-V2-Flash (Non-reasoning)Xiaomi$0.100/M122.8 tok/s$0.150/M
#52MiMo-V2-Flash (Reasoning)Xiaomi$0.100/M129.5 tok/s$0.150/M
#53Ministral 3 3BMistral$0.100/M174.3 tok/s$0.100/M
#54Olmo 3 7B InstructAllen Institute for AI$0.100/Mn/a$0.125/M
#55Qwen3.5 9B (Reasoning)Alibaba$0.100/M69.4 tok/s$0.113/M
#56Qwen3.5 Omni FlashAlibaba$0.100/M224.4 tok/s$0.275/M
#57Step 3.5 FlashStepFun$0.100/M217.5 tok/s$0.150/M
#58Step 3.5 Flash 2603StepFun$0.100/M231 tok/s$0.150/M
#59Mistral Small 3.1Mistral$0.105/M158.2 tok/s$0.138/M
#60Gemma 3 27B InstructGoogle$0.110/Mn/a$0.145/M
#61Qwen3 0.6B (Non-reasoning)Alibaba$0.110/Mn/a$0.188/M
#62Qwen3 0.6B (Reasoning)Alibaba$0.110/Mn/a$0.398/M
#63Qwen3 1.7B (Non-reasoning)Alibaba$0.110/Mn/a$0.188/M
#64Qwen3 1.7B (Reasoning)Alibaba$0.110/Mn/a$0.398/M
#65Qwen3 4B (Non-reasoning)Alibaba$0.110/Mn/a$0.188/M
#66Qwen3 4B (Reasoning)Alibaba$0.110/Mn/a$0.398/M
#67Qwen3 8B (Reasoning)Alibaba$0.110/M62.5 tok/s$0.370/M
#68TEHy3-preview (Non-reasoning)Tencent$0.123/M83.9 tok/s$0.200/M
#69TEHy3-preview (Reasoning)Tencent$0.123/M96 tok/s$0.200/M
#70Phi-4Microsoft$0.125/M36.4 tok/s$0.219/M
#71Gemma 4 26B A4B (Non-reasoning)Google$0.130/M48 tok/s$0.198/M
#72Gemma 4 26B A4B (Reasoning)Google$0.130/Mn/a$0.198/M
#73Hermes 4 - Llama-3.1 70B (Non-reasoning)Nous Research$0.130/M84.9 tok/s$0.198/M
#74Hermes 4 - Llama-3.1 70B (Reasoning)Nous Research$0.130/M87.2 tok/s$0.198/M
#75DeepSeek V4 Flash (Non-reasoning)DeepSeek$0.140/M97.4 tok/s$0.175/M
#76DeepSeek V4 Flash (Reasoning, High Effort)DeepSeek$0.140/Mn/a$0.175/M
#77DeepSeek V4 Flash (Reasoning, Max Effort)DeepSeek$0.140/M98.3 tok/s$0.175/M
#78Gemma 4 31B (Non-reasoning)Google$0.140/M54.9 tok/s$0.205/M
#79Ling-flash-2.0InclusionAI$0.140/M72.3 tok/s$0.247/M
#80MiMo-V2.5Xiaomi$0.140/M77.4 tok/s$0.175/M