EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

OpenAI

GPT-5.3 Codex (xhigh)

GPT-5.3 Codex (xhigh) is an OpenAI GPT-5 family model, positioned around stronger reasoning, coding, factuality, and agentic work than earlier GPT generations. This profile keeps the release context separate from the local Artificial Analysis measurements so you can compare the specific reasoning tier on benchmark quality, speed, and cost.

Introducing GPT-5

Operational Metrics

Output Speed87.1 tok/s
First Token69.23s
Blended Price$4.81/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseFeb 5, 2026
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: GPQA

Rank #6 across 478 models.

91.5%

Strength: HLE

Rank #6 across 474 models.

39.9%

Strength: Coding

Rank #7 across 410 models.

53.1

Watch Area: TTFT

Rank #287 across 293 models.

69.23s

Watch Area: Output $

Rank #283 across 325 models.

$14.00/M

Watch Area: Blended $

Rank #281 across 325 models.

$4.81/M

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall53.6#9
Artificial Analysis Coding Indexcoding53.1#7
GPQAreasoning91.5%#6
Humanity's Last Examreasoning39.9%#6
SciCode
coding, reasoning
53.2%
#10
Output Speedspeed87.1 tok/s#144
Time to First Tokenspeed69.23s#287
Blended Pricecost$4.81/M#281
Input Pricecost$1.75/M#260
Output Pricecost$14.00/M#283
Value Indexcost, overall11.1#236