Price per 1M tokens using the Artificial Analysis 3:1 blended ratio.
Blended Price is a cost metric for comparing providers with different input and output token prices. Artificial Analysis calculates it with a 3:1 input-to-output token ratio so a single number can summarize typical API cost.
Test type: Provider price normalization. Lower values rank better.
357 models have this metric.
Current leader: Qwen3.5 0.8B (Non-reasoning)
Project links
Prices come from Artificial Analysis pricing data in the committed snapshot.
Top models ranked by Blended $.
| Rank | Model | Creator | Value | Speed | Blended Price |
|---|---|---|---|---|---|
| #1 | Qwen3.5 0.8B (Non-reasoning) | Alibaba | $0.020/M | 88 tok/s | $0.020/M |
| #2 |
| Alibaba |
| $0.020/M |
| n/a |
| $0.020/M |
| #3 | Gemma 3n E4B Instruct | $0.025/M | 50 tok/s | $0.025/M |
| #4 | Qwen3.5 2B (Non-reasoning) | Alibaba | $0.040/M | 318.9 tok/s | $0.040/M |
| #5 | Qwen3.5 2B (Reasoning) | Alibaba | $0.040/M | n/a | $0.040/M |
| #6 | Sarvam 30B (high) | Sarvam | $0.047/M | 147.9 tok/s | $0.047/M |
| #7 | Gemma 3 4B Instruct | $0.050/M | n/a | $0.050/M |
| #8 | Llama 3.2 Instruct 1B | Meta | $0.050/M | 92.9 tok/s | $0.050/M |
| #9 | LFM2 24B A2B | Liquid AI | $0.052/M | 127.6 tok/s | $0.052/M |
| #10 | Qwen3.5 4B (Non-reasoning) | Alibaba | $0.060/M | 208.9 tok/s | $0.060/M |
| #11 | Qwen3.5 4B (Reasoning) | Alibaba | $0.060/M | 195.8 tok/s | $0.060/M |
| #12 | Nova Micro | Amazon | $0.061/M | 284.2 tok/s | $0.061/M |
| #13 | Granite 4.1 8B | IBM | $0.063/M | 134.2 tok/s | $0.063/M |
| #14 | Llama 3 Instruct 8B | Meta | $0.070/M | 88.3 tok/s | $0.070/M |
| #15 | NVIDIA Nemotron Nano 9B V2 (Reasoning) | NVIDIA | $0.070/M | 118 tok/s | $0.070/M |
| #16 | Sarvam 105B (high) | Sarvam | $0.074/M | 100.7 tok/s | $0.074/M |
| #17 | Granite 3.3 8B (Non-reasoning) | IBM | $0.085/M | 400.9 tok/s | $0.085/M |
| #18 | NVIDIA Nemotron Nano 9B V2 (Non-reasoning) | NVIDIA | $0.086/M | 133.6 tok/s | $0.086/M |
| #19 | gpt-oss-20B (high) | OpenAI | $0.088/M | 240 tok/s | $0.088/M |
| #20 | NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) | NVIDIA | $0.088/M | 87.3 tok/s | $0.088/M |
| #21 | Qwen2.5 Turbo | Alibaba | $0.088/M | 66.4 tok/s | $0.088/M |
| #22 | gpt-oss-20B (low) | OpenAI | $0.095/M | 224.2 tok/s | $0.095/M |
| #23 | NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) | NVIDIA | $0.096/M | 133.6 tok/s | $0.096/M |
| #24 | Llama 2 Chat 7B | Meta | $0.100/M | 100.6 tok/s | $0.100/M |
| #25 | Llama 3.1 Instruct 8B | Meta | $0.100/M | 201.5 tok/s | $0.100/M |
| #26 | Ministral 3 3B | Mistral | $0.100/M | 174.3 tok/s | $0.100/M |
| #27 | Mistral Small 3 | Mistral | $0.104/M | 153.7 tok/s | $0.104/M |
| #28 | Nova Lite | Amazon | $0.105/M | 191.7 tok/s | $0.105/M |
| #29 | Granite 4.0 H Small | IBM | $0.107/M | 454.2 tok/s | $0.107/M |
| #30 | Qwen3.5 9B (Reasoning) | Alibaba | $0.113/M | 69.4 tok/s | $0.113/M |
| #31 | Apertus 8B Instruct | Swiss AI Initiative | $0.125/M | n/a | $0.125/M |
| #32 | Olmo 3 7B Instruct | Allen Institute for AI | $0.125/M | n/a | $0.125/M |
| #33 | Mistral Small 3.2 | Mistral | $0.128/M | 127.1 tok/s | $0.128/M |
| #34 | Nemotron 3 Nano Omni 30B A3B Reasoning | NVIDIA | $0.131/M | 276.7 tok/s | $0.131/M |
| #35 | Qwen3 30B A3B (Non-reasoning) | Alibaba | $0.133/M | 68.6 tok/s | $0.133/M |
| #36 | GPT-5 nano (high) | OpenAI | $0.138/M | 150.4 tok/s | $0.138/M |
| #37 | GPT-5 nano (medium) | OpenAI | $0.138/M | 167 tok/s | $0.138/M |
| #38 | GPT-5 nano (minimal) | OpenAI | $0.138/M | 153.9 tok/s | $0.138/M |
| #39 | Mistral Small 3.1 | Mistral | $0.138/M | 158.2 tok/s | $0.138/M |
| #40 | Gemma 3 12B Instruct | $0.140/M | n/a | $0.140/M |
| #41 | Gemma 3 27B Instruct | $0.145/M | n/a | $0.145/M |
| #42 | Devstral Small (Jul '25) | Mistral | $0.150/M | 42.3 tok/s | $0.150/M |
| #43 | Gemma 4 12B (Reasoning) | $0.150/M | 158.6 tok/s | $0.150/M |
| #44 | Ling 2.6 Flash | InclusionAI | $0.150/M | n/a | $0.150/M |
| #45 | Llama 3.2 Instruct 3B | Meta | $0.150/M | 52.3 tok/s | $0.150/M |
| #46 | MiMo-V2-Flash (Feb 2026) | Xiaomi | $0.150/M | 124.9 tok/s | $0.150/M |
| #47 | MiMo-V2-Flash (Non-reasoning) | Xiaomi | $0.150/M | 122.8 tok/s | $0.150/M |
| #48 | MiMo-V2-Flash (Reasoning) | Xiaomi | $0.150/M | 129.5 tok/s | $0.150/M |
| #49 | Ministral 3 8B | Mistral | $0.150/M | 103.6 tok/s | $0.150/M |
| #50 | Solar Mini | Upstage | $0.150/M | 75.9 tok/s | $0.150/M |
| #51 | Step 3.5 Flash | StepFun | $0.150/M | 217.5 tok/s | $0.150/M |
| #52 | Step 3.5 Flash 2603 | StepFun | $0.150/M | 231 tok/s | $0.150/M |
| #53 | GLM-4.7-Flash (Non-reasoning) | Z AI | $0.153/M | 105.6 tok/s | $0.153/M |
| #54 | GLM-4.7-Flash (Reasoning) | Z AI | $0.153/M | 94.1 tok/s | $0.153/M |
| #55 | DeepSeek V4 Flash (Non-reasoning) | DeepSeek | $0.175/M | 97.4 tok/s | $0.175/M |
| #56 | DeepSeek V4 Flash (Reasoning, High Effort) | DeepSeek | $0.175/M | n/a | $0.175/M |
| #57 | DeepSeek V4 Flash (Reasoning, Max Effort) | DeepSeek | $0.175/M | 98.3 tok/s | $0.175/M |
| #58 | Gemini 2.5 Flash-Lite (Non-reasoning) | $0.175/M | 229.5 tok/s | $0.175/M |
| #59 | Gemini 2.5 Flash-Lite (Reasoning) | $0.175/M | 265.2 tok/s | $0.175/M |
| #60 | Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | $0.175/M | n/a | $0.175/M |
| #61 | Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | $0.175/M | n/a | $0.175/M |
| #62 | GPT-4.1 nano | OpenAI | $0.175/M | 118.2 tok/s | $0.175/M |
| #63 | Llama Nemotron Super 49B v1.5 (Non-reasoning) | NVIDIA | $0.175/M | 43.9 tok/s | $0.175/M |
| #64 | Llama Nemotron Super 49B v1.5 (Reasoning) | NVIDIA | $0.175/M | 44.2 tok/s | $0.175/M |
| #65 | MiMo-V2.5 | Xiaomi | $0.175/M | 77.4 tok/s | $0.175/M |
| #66 | Qwen3 30B A3B (Reasoning) | Alibaba | $0.180/M | 68.5 tok/s | $0.180/M |
| #67 | Qwen3 8B (Non-reasoning) | Alibaba | $0.185/M | 64.4 tok/s | $0.185/M |
| #68 | Qwen3 0.6B (Non-reasoning) | Alibaba | $0.188/M | n/a | $0.188/M |
| #69 | Qwen3 1.7B (Non-reasoning) | Alibaba | $0.188/M | n/a | $0.188/M |
| #70 | Qwen3 4B (Non-reasoning) | Alibaba | $0.188/M | n/a | $0.188/M |
| #71 | Gemma 4 26B A4B (Non-reasoning) | $0.198/M | 48 tok/s | $0.198/M |
| #72 | Gemma 4 26B A4B (Reasoning) | $0.198/M | n/a | $0.198/M |
| #73 | Hermes 4 - Llama-3.1 70B (Non-reasoning) | Nous Research | $0.198/M | 84.9 tok/s | $0.198/M |
| #74 | Hermes 4 - Llama-3.1 70B (Reasoning) | Nous Research | $0.198/M | 87.2 tok/s | $0.198/M |
| #75 | Hy3-preview (Non-reasoning) | Tencent | $0.200/M | 83.9 tok/s | $0.200/M |
| #76 | Hy3-preview (Reasoning) | Tencent | $0.200/M | 96 tok/s | $0.200/M |
| #77 | Ministral 3 14B | Mistral | $0.200/M | 106.9 tok/s | $0.200/M |
| #78 | Gemma 4 31B (Non-reasoning) | $0.205/M | 54.9 tok/s | $0.205/M |
| #79 | Mistral 7B Instruct | Mistral | $0.206/M | 110.4 tok/s | $0.206/M |
| #80 | Qwen3 30B A3B 2507 Instruct | Alibaba | $0.213/M | 105.2 tok/s | $0.213/M |