Easy Benchmarks
Workspace
Overview
Benchmarks
Benchmarks list
Compare
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
LLMs
Audio
Image
Video
Feedback
Log inSign up

Easy Benchmarks

LLM Performance Index

Compare current LLMs by benchmark, latency, speed, and price.

A compact analysis surface for choosing the right model with evidence, clear tradeoffs, and grounded assistant answers.

Source

Artificial Analysis free LLM benchmark API. Attribution required by source terms.

artificialanalysis.ai
Best Overall
61.4
#1
Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
Models Tracked
703
78 creators
Snapshot Jun 8, 2026, 5:36 AM
Best Overall
100
domain
Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
Best Value
525
758.7 tok/s
Qwen3.5 0.8B (Reasoning)

Explore Models

Ranking by Artificial Analysis Intelligence Index. Search narrows this leaderboard.

Percentile
523 matching models on Overall

Top 10 by Overall

Artificial Analysis aggregate LLM intelligence score.

Overall

1 metrics

General leaderboard using the Artificial Analysis Intelligence Index.

Claude Opus 4.8 (Adaptive Reasoning, Max Effort)100

Coding

3 metrics

Coding leaderboard from coding-specific benchmark metrics.

GPT-5.5 (xhigh)100

Math

3 metrics

Math leaderboard from math-specific benchmark metrics.

GPT-5.2 (xhigh)100

Reasoning & Knowledge

4 metrics

Reasoning leaderboard across GPQA, HLE, MMLU-Pro, and related metrics.

Gemini 3.1 Pro Preview100

Speed

2 metrics

Runtime leaderboard using output speed and latency.

Qwen3.5 2B (Non-reasoning)99

Price & Value

4 metrics

Cost leaderboard using input, output, blended price, and value index.

Qwen3.5 0.8B (Non-reasoning)100