LLM Releases

Lab release history

Last updated Apr 24, 2026

DeepSeek model releases

Chinese lab shipping permissively licensed frontier-class models. This page collects the lab's model releases, lifecycle events, source links, and model metadata in one crawlable record.

18
Models
1
Labs
17
Open
4
Recent

18 models

DeepSeek V4-Flash

Preview
DeepSeekOpen source

Efficient V4 companion model with 284B total / 13B active parameters and the same one-million-token context window.

MoE284B1M ctxApr 24, 2026

DeepSeek V4-Pro

Preview
DeepSeekFrontierOpen source

Preview-series sparse MoE flagship with a one-million-token context window and 1.6T total / 49B active parameters.

MoE1.6T1M ctxApr 24, 2026

DeepSeek-V3.2

Available
DeepSeekFrontierOpen source

Reasoning-first agent model that adds DeepSeek Sparse Attention and thinking directly inside tool-use workflows.

MoE685B128K ctxDec 1, 2025

DeepSeek-V3.2-Speciale

Available
DeepSeekFrontierOpen source

High-compute reasoning variant of V3.2, positioned for olympiad-level math, programming, and other deep reasoning tasks.

MoE685B128K ctxDec 1, 2025

DeepSeek-V3.2-Exp

Preview
DeepSeekOpen source

Experimental checkpoint that introduced DeepSeek Sparse Attention as an efficiency bridge between V3.1-Terminus and V3.2.

MoE685B128K ctxSep 29, 2025

DeepSeek-V3.1-Terminus

Available
DeepSeekOpen source

Stability update to V3.1 focused on language consistency, code-agent reliability, and search-agent behavior.

MoE685B128K ctxSep 22, 2025

DeepSeek-V3.1

Available
DeepSeekOpen source

Hybrid thinking/non-thinking release that upgraded tool calling, long-context training, and agent task performance.

MoE671B128K ctxAug 21, 2025

DeepSeek-R1-0528

Available
DeepSeekFrontierOpen source

Major R1 reasoning update with stronger math, programming, general logic, function calling, and reduced hallucinations.

MoE671B128K ctxMay 28, 2025

DeepSeek-V3-0324

Available
DeepSeekOpen source

Post-R1 V3 update with improved reasoning, front-end coding, Chinese writing, search, and function calling.

MoE671B128K ctxMar 25, 2025

DeepSeek-R1

Available
DeepSeekFrontierOpen source

Breakout open reasoning model trained with large-scale reinforcement learning and released with weights under MIT.

MoE671B128K ctxJan 20, 2025

DeepSeek-V3

Available
DeepSeekOpen source

The 671B/37B-active MoE release that made DeepSeek a central open-model lab before the R1 breakthrough.

MoE671B128K ctxDec 26, 2024

DeepSeek-R1-Lite-Preview

Retired
DeepSeekProprietary

Reasoning-preview model exposed in DeepSeek Chat ahead of the open DeepSeek-R1 release.

Undisc. ctxNov 20, 2024

DeepSeek-V2.5

Available
DeepSeekOpen source

Unified DeepSeek V2 generation combining general-chat and coding strengths before the V3 series.

MoE236B128K ctxSep 5, 2024

DeepSeek-Coder-V2

Available
DeepSeekOpen source

Open code-focused MoE built from DeepSeek-V2, expanding programming-language coverage and coding benchmark performance.

MoE236B128K ctxJun 17, 2024

DeepSeek-V2

Available
DeepSeekOpen source

DeepSeek's first major MoE general model with Multi-head Latent Attention and low-cost API positioning.

MoE236B128K ctxMay 7, 2024

DeepSeekMoE 16B

Available
DeepSeekOpen source

Early DeepSeek sparse MoE research model that foreshadowed the later V2/V3 architecture direction.

MoE16B4K ctxJan 11, 2024

DeepSeek LLM 67B

Available
DeepSeekOpen source

First general DeepSeek language model family, with 7B and 67B base/chat checkpoints.

Dense67B4K ctxNov 29, 2023

DeepSeek Coder 33B

Available
DeepSeekOpen source

DeepSeek's first public code-model family, released before the general DeepSeek LLM line.

Dense33B16K ctxNov 2, 2023