Google

Gemini 2.5 Flash-Lite (Non-reasoning)

Gemini 2.5 Flash-Lite is a balanced, low-latency model with configurable thinking budgets and tool connectivity (e.g., Google Search grounding and code execution). It supports multimodal input and offers a 1M-token context window.

Gemini 2.5 thinking models

TypeLanguage

ReleaseJun 17, 2025

Context1,048,576

Tagsfile-input, reasoning, tool-use, vision

6.9

n/a

$0.175/M

* Cost is inverted: lower input, output, and blended prices rank higher.

Model Details

Runtime

Max Output65,535

Streaming speed is not measured for this model yet.

Pricing

Blended$0.175/M

Input$0.100/M

Output$0.400/M

Catalog$0.100/M in / $0.400/M out

Metadata

Modalities

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	6.9	#468
Artificial Analysis Math Index	math	35.3	#173
GPQA	reasoning	47.4%	#423
Humanity's Last Exam	reasoning	3.7%	#511
LiveCodeBench

Gemini 2.5 Flash-Lite (Non-reasoning)

Model Snapshot

Overall

Coding

Blended Price

Strength Profile

Benchmark Percentiles

Model Details

Runtime

Pricing

Metadata

All Benchmarks