DeepSeek

DeepSeek V3.1 (Non-reasoning)

DeepSeek-V3.1 is post-trained on the top of DeepSeek-V3.1-Base, which is built upon the original V3 base checkpoint through a two-phase long context extension approach, following the methodology outlined in the original DeepSeek-V3 report. We have expanded our dataset by collecting additional long documents and substantially extending both training phases. The 32K extension phase has been increased 10-fold to 630B tokens, while the 128K extension phase has been extended by 3.3x to 209B tokens. Additionally, DeepSeek-V3.1 is trained using the UE8M0 FP8 scale data format to ensure compatibility with microscaling data formats.

DeepSeek release notes

TypeLanguage

ReleaseAug 21, 2025

Context163,840

Tagsimplicit-caching, reasoning, tool-use

21.0

n/a

$0.840/M

* Cost is inverted: lower input, output, and blended prices rank higher.

Model Details

Runtime

Max Output32,768

Streaming speed is not measured for this model yet.

Pricing

Blended$0.840/M

Input$0.560/M

Output$1.68/M

Catalog$0.250/M in / $0.950/M out

Metadata

Modalities

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	21.0	#236
Artificial Analysis Math Index	math	49.7	#141
GPQA	reasoning	73.5%	#233
Humanity's Last Exam	reasoning	6.3%	#298
LiveCodeBench

DeepSeek V3.1 (Non-reasoning)

Model Snapshot

Overall

Coding

Blended Price

Strength Profile

Benchmark Percentiles

Model Details

Runtime

Pricing

Metadata

All Benchmarks