EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

OpenAI

GPT-5.1 Codex (high)

GPT-5.1 Codex (high) is an OpenAI GPT-5 family model, positioned around stronger reasoning, coding, factuality, and agentic work than earlier GPT generations. This profile keeps the release context separate from the local Artificial Analysis measurements so you can compare the specific reasoning tier on benchmark quality, speed, and cost.

Introducing GPT-5

Operational Metrics

Output Speed162.7 tok/s
First Token3.82s
Blended Price$3.44/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseNov 13, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: Math

Rank #8 across 269 models.

95.7

Strength: LCB

Rank #15 across 343 models.

84.9%

Strength: MMLU-Pro

Rank #23 across 345 models.

86.0%

Watch Area: Output $

Rank #270 across 325 models.

$10.00/M

Watch Area: Blended $

Rank #260 across 325 models.

$3.44/M

Watch Area: TTFT

Rank #234 across 293 models.

3.82s

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall43.1#52
Artificial Analysis Coding Indexcoding36.6#67
Artificial Analysis Math Indexmath95.7#8
MMLU-Proreasoning86.0%#23
GPQA
reasoning
86.0%
#43
Humanity's Last Examreasoning23.4%#53
LiveCodeBenchcoding84.9%#15
SciCodecoding, reasoning40.2%#94
Output Speedspeed162.7 tok/s#43
Time to First Tokenspeed3.82s#234
Blended Pricecost$3.44/M#260
Input Pricecost$1.25/M#241
Output Pricecost$10.00/M#270
Value Indexcost, overall12.5#224