MMLU-Pro

Knowledge and reasoning benchmark score.

MMLU-Pro extends MMLU with more challenging, reasoning-focused questions, removes trivial or noisy items, and expands multiple-choice options from four to ten. It is meant to be more discriminative for advanced language models.

Test type: Multiple-choice reasoning and knowledge benchmark across broad academic domains.

89.8%

Current leader: Gemini 3 Pro Preview (high)

Project links

This app ranks the MMLU-Pro score exposed by the Artificial Analysis snapshot.

Paper GitHub

Leaderboard

Rank	Model	Creator	Value	Speed	Blended Price
#1	Gemini 3 Pro Preview (high)	Google	89.8%	128.7 tok/s	$4.50/M
#2

MMLU-Pro

Coverage

Top MMLU-Pro Models

Leaderboard

MMLU-Pro

Coverage

Top MMLU-Pro Models

Leaderboard