Google

Gemini 2.5 Flash-Lite (Reasoning)

Gemini 2.5 Flash-Lite is a balanced, low-latency model with configurable thinking budgets and tool connectivity (e.g., Google Search grounding and code execution). It supports multimodal input and offers a 1M-token context window.

Gemini 2.5 thinking models

TypeLanguage

ReleaseJun 17, 2025

Context1,048,576

Tagsfile-input, reasoning, tool-use, vision

11.4

n/a

$0.175/M

* Cost is inverted: lower input, output, and blended prices rank higher.

Model Details

Runtime

Max Output65,535

Streaming speed is not measured for this model yet.

Pricing

Blended$0.175/M

Input$0.100/M

Output$0.400/M

Catalog$0.100/M in / $0.400/M out

Metadata

Modalities

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	11.4	#370
Artificial Analysis Math Index	math	53.3	#135
GPQA	reasoning	62.5%	#328
Humanity's Last Exam	reasoning	6.4%	#295
LiveCodeBench

Gemini 2.5 Flash-Lite (Reasoning)

Model Snapshot

Overall

Coding

Blended Price

Strength Profile

Benchmark Percentiles

Model Details

Runtime

Pricing

Metadata

All Benchmarks