Gemini 2.5 Flash-Lite is a balanced, low-latency model with configurable thinking budgets and tool connectivity (e.g., Google Search grounding and code execution). It supports multimodal input and offers a 1M-token context window.
Gemini 2.5 thinking modelsRank #376 across 526
Rank #367 across 436
Rank #58 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 12.7 | #376 |
| Artificial Analysis Coding Index | coding | 7.4 | #367 |
| Artificial Analysis Math Index | math | 35.3 | #173 |
| MMLU-Pro | reasoning | 72.4% | #205 |
| reasoning |
| 47.4% |
| #373 |
| Humanity's Last Exam | reasoning | 3.7% | #460 |
| LiveCodeBench | coding | 40.0% | #182 |
| SciCode | coding, reasoning | 17.7% | #402 |
| MATH-500 | math | 92.6% | #61 |
| AIME | math | 50.0% | #59 |
| Output Speed | speed | 229.5 tok/s | #19 |
| Time to First Token | speed | 0.36s | #15 |
| Blended Price | cost | $0.175/M | #58 |
| Input Price | cost | $0.100/M | #40 |
| Output Price | cost | $0.400/M | #59 |
| Value Index | cost, overall | 72.6 | #90 |