EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Google

Gemini 2.5 Flash-Lite (Non-reasoning)

Gemini 2.5 Flash-Lite (Non-reasoning) is a Google Gemini 2.5 model, part of the Gemini generation that emphasized thinking, long-context work, multimodal reasoning, and developer access through AI Studio and Vertex AI. The profile lets you compare the specific Pro, Flash, or Flash-Lite variant on measured quality, speed, and cost.

Gemini 2.5 thinking models

Operational Metrics

Output Speed239.9 tok/s
First Token0.46s
Blended Price$0.175/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseJun 17, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: Speed

Rank #11 across 293 models.

239.9 tok/s

Strength: Input $

Rank #29 across 325 models.

$0.100/M

Strength: TTFT

Rank #36 across 293 models.

0.46s

Watch Area: HLE

Rank #434 across 474 models.

3.7%

Watch Area: Coding

Rank #345 across 410 models.

7.4

Watch Area: SciCode

Rank #380 across 472 models.

17.7%

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall12.7#367
Artificial Analysis Coding Indexcoding7.4#345
Artificial Analysis Math Indexmath35.3#173
MMLU-Proreasoning72.4%#205
GPQA
reasoning
47.4%
#351
Humanity's Last Examreasoning3.7%#434
LiveCodeBenchcoding40.0%#182
SciCodecoding, reasoning17.7%#380
MATH-500math92.6%#61
AIMEmath50.0%#59
Output Speedspeed239.9 tok/s#11
Time to First Tokenspeed0.46s#36
Blended Pricecost$0.175/M#47
Input Pricecost$0.100/M#29
Output Pricecost$0.400/M#46
Value Indexcost, overall72.6#76