LLM Releases

The model release tracker

Last updated Jun 17, 2026

Every large language model release, sourced and tracked over time.

Frontier and open-weight. Filter by access, license, size, modality, and country of origin. Each entry links to a primary source and keeps a full history of changes, deprecations, and retractions.

83
Models
28
Frontier
65
Open
8
Countries
7
New · 30d
7
Retired

83 models

GLM-5.2

Available
Z.ai (Zhipu AI)FrontierOpen source

Z.ai's latest open flagship for long-horizon coding, agentic engineering, and million-token workflows, adding IndexShare sparse-attention reuse over GLM-5.1.

MoE753B1M ctxJun 17, 2026

MiniMax-M3

Available
MiniMaxFrontierOpen weights

Native multimodal MiniMax model with a one-million-token context, sparse attention, and agentic coding/cowork positioning.

MoE428B1M ctxJun 16, 2026

GPT-5.6

Preview
OpenAIFrontierProprietary

OpenAI's mid-2026 flagship, headlined by an industry-leading 1.5M-token context window and long-horizon agentic tool use.

MoEUndisc.1.5M ctxJun 9, 2026

Claude Fable 5

Withdrawn
AnthropicFrontierProprietary

The public, guardrailed sibling of Mythos and Anthropic's most capable widely-released model, built for long-horizon agentic work. Launched June 9, 2026 across the Claude API, AWS, and Microsoft Foundry — then pulled three days later under a US government export-control directive barring access by foreign nationals.

Undisc. ctxJun 9, 2026

Nemotron 3 Ultra 550B-A55B

Available
NVIDIAFrontierOpen weights

NVIDIA's largest Nemotron 3 open-weight hybrid Mamba-Transformer MoE, tuned for agentic reasoning, coding, planning, and tool calling.

Hybrid550B1M ctxJun 4, 2026

Claude Opus 4.8

Available
AnthropicFrontierProprietary

Anthropic's most capable model, with strengthened agentic and long-running task performance.

Undisc.500K ctxMay 28, 2026

MiniMax-M2.7

Available
MiniMaxFrontierOpen weights

Open-weight agentic model from MiniMax focused on real-world software engineering, office tasks, tool use, and self-improving training workflows.

MoE229.9B ctxMay 26, 2026

Gemini 3.5 Pro

Preview
Google DeepMindFrontierProprietary

Announced at Google I/O 2026; emphasizes deep multimodal reasoning over a 2M-token context.

MoEUndisc.2M ctxMay 19, 2026

Qwen3.6-27B

Available
Alibaba (Qwen)Open source

Dense 27B that punches far above its weight on agentic coding — easy to self-host on a single GPU node.

Dense27B256K ctxMay 12, 2026

Grok 4.3

Available
xAIFrontierProprietary

xAI's agentic flagship with a 1M-token context and aggressive API pricing.

MoEUndisc.1M ctxMay 6, 2026

DeepSeek V4-Flash

Preview
DeepSeekOpen source

Efficient V4 companion model with 284B total / 13B active parameters and the same one-million-token context window.

MoE284B1M ctxApr 24, 2026

DeepSeek V4-Pro

Preview
DeepSeekFrontierOpen source

Preview-series sparse MoE flagship with a one-million-token context window and 1.6T total / 49B active parameters.

MoE1.6T1M ctxApr 24, 2026

Hunyuan-A13B-Instruct

Available
Tencent HunyuanOpen weights

Tencent Hunyuan open-weight fine-grained MoE model with 80B total parameters and 13B active parameters, optimized for agentic tool use.

MoE80B ctxApr 22, 2026

GLM-5.1

Available
Z.ai (Zhipu AI)FrontierOpen source

Z.ai agentic-engineering follow-up to GLM-5, with stronger coding performance and better long-horizon tool-use behavior.

MoE754B ctxApr 8, 2026

Claude Mythos

Preview
AnthropicFrontierProprietary

A frontier model Anthropic disclosed on April 7, 2026 but declined to release publicly, citing security risk. Shipped only via 'Project Glasswing' to ~50 defensive-security partners, then suspended on June 12, 2026 under a US government directive.

Undisc. ctxApr 7, 2026

Gemma 4 31B

Available
Google DeepMindOpen source

Google DeepMind's Gemma 4 advanced-reasoning open model for personal computers, part of the April 2026 Gemma 4 family.

Dense31B ctxApr 2, 2026

Kimi K2.6

Available
Moonshot AIOpen weights

Moonshot's open native multimodal agentic model for long-horizon coding, visual interface generation, and autonomous tool orchestration.

MoE1T256K ctxMar 30, 2026

Mistral Medium 3.5

Available
Mistral AIOpen weights

Dense 128B open-weight model with a 256k context and strong coding performance for its size.

Dense128B256K ctxMar 18, 2026

Nemotron 3 Super 120B-A12B

Available
NVIDIAFrontierOpen weights

Open-weight hybrid Mamba-Transformer MoE designed for collaborative agents and high-volume enterprise workflows.

Hybrid120B1M ctxMar 16, 2026

Step-3.5-Flash

Available
StepFunOpen source

StepFun's Apache-licensed sparse MoE model for fast agentic execution, coding, math, browsing, and tool-use workflows.

MoE196B256K ctxMar 14, 2026

Sarvam-105B

Available
Sarvam AIOpen source

Apache-licensed Indian-context MoE from Sarvam AI, optimized for reasoning, coding, agentic tasks, and 22 Indian languages.

MoE105B128K ctxMar 6, 2026

GPT-5.4

Available
OpenAIFrontierProprietary

Workhorse GPT-5 release with a dedicated Thinking mode; widely deployed across ChatGPT and the API.

MoEUndisc.400K ctxMar 5, 2026

Qwen3.5-397B

Available
Alibaba (Qwen)FrontierOpen source

Native vision-language MoE supporting 201 languages with a 1M-token context.

MoE397B1M ctxFeb 20, 2026

Gemini 3.1 Pro

Available
Google DeepMindFrontierProprietary

Generally available multimodal flagship with native tool use and a 2M-token context.

MoEUndisc.2M ctxFeb 19, 2026

GLM-5

Available
Z.ai (Zhipu AI)FrontierOpen source

Z.ai flagship for complex systems engineering and long-horizon agentic tasks, scaling the GLM line to 744B total / 40B active parameters.

MoE744B ctxFeb 11, 2026

Claude Opus 4.6

Available
AnthropicFrontierProprietary

Introduced genuinely autonomous multi-file coding and stronger computer use.

Undisc.200K ctxFeb 5, 2026

Qwen3-Coder-Next

Available
Alibaba (Qwen)Open source

Apache-licensed Qwen3-Next coding-agent model with 80B total / 3B active parameters, 256K context, and long-horizon tool-use training.

Hybrid80B262K ctxFeb 3, 2026

GLM-4.7

Available
Z.ai (Zhipu AI)FrontierOpen source

Coding-focused GLM release with improved multilingual agentic coding, terminal tasks, tool use, and interface generation.

MoE358B ctxJan 8, 2026

OLMo 3 Think 32B

Available
Allen Institute for AI (Ai2)Open source

Ai2's fully open thinking model with public weights, code, data, checkpoints, and training details across the OLMo 3 pipeline.

Dense32B ctxDec 15, 2025

Nemotron 3 Nano 30B-A3B

Available
NVIDIAOpen weights

Efficient Nemotron 3 MoE checkpoint for agentic reasoning and coding, activating about 3B parameters while supporting 1M-token contexts.

Hybrid30B1M ctxDec 15, 2025

GLM-4.6V

Available
Z.ai (Zhipu AI)Open source

Open 106B-class vision-language model with native multimodal function calling for visual agents.

MoE106B128K ctxDec 8, 2025

Mistral Large 3

Available
Mistral AIFrontierOpen weights

Mistral's largest open-weight MoE, aimed at frontier reasoning while remaining self-hostable.

MoE675B256K ctxDec 2, 2025

DeepSeek-V3.2

Available
DeepSeekFrontierOpen source

Reasoning-first agent model that adds DeepSeek Sparse Attention and thinking directly inside tool-use workflows.

MoE685B128K ctxDec 1, 2025

DeepSeek-V3.2-Speciale

Available
DeepSeekFrontierOpen source

High-compute reasoning variant of V3.2, positioned for olympiad-level math, programming, and other deep reasoning tasks.

MoE685B128K ctxDec 1, 2025

GLM-4.6

Available
Z.ai (Zhipu AI)FrontierOpen source

Agentic reasoning and coding upgrade over GLM-4.5, expanding the text context window from 128K to 200K tokens.

MoE357B200K ctxSep 30, 2025

DeepSeek-V3.2-Exp

Preview
DeepSeekOpen source

Experimental checkpoint that introduced DeepSeek Sparse Attention as an efficiency bridge between V3.1-Terminus and V3.2.

MoE685B128K ctxSep 29, 2025

DeepSeek-V3.1-Terminus

Available
DeepSeekOpen source

Stability update to V3.1 focused on language consistency, code-agent reliability, and search-agent behavior.

MoE685B128K ctxSep 22, 2025

Gemma 3 27B

Available
Google DeepMindOpen weights

Google's open multimodal model: 128k context, 140+ languages, runs on a single GPU.

Dense27B128K ctxSep 4, 2025

DeepSeek-V3.1

Available
DeepSeekOpen source

Hybrid thinking/non-thinking release that upgraded tool calling, long-context training, and agent task performance.

MoE671B128K ctxAug 21, 2025

Seed-OSS-36B-Instruct

Available
ByteDance SeedOpen source

ByteDance Seed's Apache-licensed long-context reasoning and agent model, with controllable thinking budgets and a native 512K context.

Dense36B512K ctxAug 20, 2025

GLM-4.5V

Available
Z.ai (Zhipu AI)Open source

Vision-language GLM based on GLM-4.5-Air, covering image, video, document, grounding, and GUI-agent tasks.

MoE106B ctxAug 11, 2025

Falcon-H1 34B

Available
Technology Innovation InstituteOpen weights

A hybrid attention + state-space-model (SSM) design that matches 70B-class models with fewer parameters.

Hybrid34B256K ctxJul 31, 2025

GLM-4.5

Available
Z.ai (Zhipu AI)FrontierOpen source

Open agentic, reasoning, and coding foundation model that marked Z.ai international rebrand and MIT-licensed GLM push.

MoE355B128K ctxJul 28, 2025

GLM-4.5-Air

Available
Z.ai (Zhipu AI)Open source

Compact GLM-4.5 companion with 106B total / 12B active parameters for efficient agentic reasoning and coding.

MoE106B128K ctxJul 28, 2025

EXAONE 4.0 32B

Available
LG AI ResearchOpen weights

LG AI Research's unified model with non-reasoning and reasoning modes, agentic tool use, and English, Korean, and Spanish support.

Dense32B ctxJul 15, 2025

ERNIE-4.5-300B-A47B

Available
BaiduOpen source

Baidu's open ERNIE 4.5 language MoE, part of a 10-variant Apache-licensed model family built with heterogeneous multimodal MoE training.

MoE300B128K ctxJun 30, 2025

ERNIE-4.5-VL-424B-A47B

Available
BaiduOpen source

Baidu's largest ERNIE 4.5 vision-language MoE, supporting text, image, and video inputs with thinking and non-thinking modes.

MoE424B128K ctxJun 30, 2025

MiniMax-M1-80k

Available
MiniMaxFrontierOpen source

Open Apache-licensed hybrid-attention reasoning model with 456B total / 45.9B active parameters and a native 1M-token context.

Hybrid456B1M ctxJun 16, 2025

DeepSeek-R1-0528

Available
DeepSeekFrontierOpen source

Major R1 reasoning update with stronger math, programming, general logic, function calling, and reduced hallucinations.

MoE671B128K ctxMay 28, 2025

Llama 4 Maverick

Available
Meta AIFrontierOpen weights

Meta's flagship open-weight MoE; highest MMLU among open models at release.

MoE400B1M ctxApr 5, 2025

Llama 4 Scout

Available
Meta AIOpen weights

Efficient open-weight MoE designed for very long context on modest hardware.

MoE109B10M ctxApr 5, 2025

DeepSeek-V3-0324

Available
DeepSeekOpen source

Post-R1 V3 update with improved reasoning, front-end coding, Chinese writing, search, and function calling.

MoE671B128K ctxMar 25, 2025

OLMo 2 32B

Available
Allen Institute for AI (Ai2)Open source

A fully open model — weights, data, and training code all public — and the first such to beat GPT-3.5 / GPT-4o mini.

Dense32B4K ctxMar 13, 2025

Command A

Available
CohereOpen weights

Enterprise-grade model tuned for RAG, tool use, and multilingual business workloads.

Dense111B256K ctxMar 13, 2025

Claude 3.7 Sonnet

Retired
AnthropicProprietary

Anthropic's first hybrid-reasoning Sonnet. Shut down May 11, 2026 as the 4.x line matured.

Undisc.200K ctxFeb 24, 2025

Mistral Small 3

Available
Mistral AIOpen source

A latency-optimized 24B dense model under Apache-2.0 — a popular local-deployment workhorse.

Dense24B32K ctxJan 30, 2025

DeepSeek-R1

Available
DeepSeekFrontierOpen source

Breakout open reasoning model trained with large-scale reinforcement learning and released with weights under MIT.

MoE671B128K ctxJan 20, 2025

DeepSeek-V3

Available
DeepSeekOpen source

The 671B/37B-active MoE release that made DeepSeek a central open-model lab before the R1 breakthrough.

MoE671B128K ctxDec 26, 2024

Granite 3.1 8B

Available
IBMOpen source

IBM's enterprise-focused open model with a 128k context, Apache-2.0 licensed.

Dense8B128K ctxDec 18, 2024

Falcon 3 10B

Available
Technology Innovation InstituteOpen weights

UAE's TII open model designed to run on light infrastructure, including laptops.

Dense10B32K ctxDec 17, 2024

Command R7B

Available
CohereOpen weights

Cohere's smallest, fastest R-series model, tuned for RAG and tool use on modest hardware.

Dense8B128K ctxDec 13, 2024

Phi-4

Available
MicrosoftOpen source

A 14B dense model that rivals far larger ones on math and reasoning, under a permissive MIT license.

Dense14B16K ctxDec 12, 2024

Amazon Nova Pro

Available
AmazonProprietary

AWS-native multimodal model with a 300k context; size and architecture undisclosed.

Undisc.300K ctxDec 3, 2024

Hunyuan-Large

Available
Tencent HunyuanOpen weights

Tencent's 389B total / 52B active open-weight Transformer MoE, released with a 256K pretraining context and 128K instruct context.

MoE389B128K ctxNov 4, 2024

Yi-Lightning

Available
01.AIProprietary

01.AI's MoE API model that reached the global top-10 on Chatbot Arena, strong in Chinese, math, and coding.

MoEUndisc. ctxOct 16, 2024

Jamba 1.5 Large

Available
AI21 LabsOpen weights

Israel's AI21 hybrid Mamba-Transformer MoE, with a 256k context and strong long-document throughput.

Hybrid398B256K ctxAug 22, 2024

GPT-4o

Retired
OpenAIProprietary

The 2024 omni-modal model that defined a generation of assistants. Deprecated in Feb 2026 and fully retired across ChatGPT on April 3, 2026.

Undisc.128K ctxMay 13, 2024

Mixtral 8x7B

Available
Mistral AIOpen source

The open sparse Mixture-of-Experts that brought MoE efficiency to the open ecosystem.

MoE47B32K ctxDec 11, 2023

Gemini 1.0 Ultra

Deprecated
Google DeepMindProprietary

Google's first natively multimodal Gemini flagship, since superseded by the 1.5/2/3 lines.

Undisc.32K ctxDec 6, 2023

Qwen-72B

Available
Alibaba (Qwen)Open weights

Alibaba's first major open Qwen model and the start of a prolific open-weight line.

Dense72B32K ctxNov 30, 2023

Yi-34B

Available
01.AIOpen weights

01.AI's strong bilingual open model, with a 200k-context variant.

Dense34B200K ctxNov 6, 2023

Mistral 7B

Available
Mistral AIOpen source

The 7B that punched far above its weight and put Mistral on the map.

Dense7B8K ctxSep 27, 2023

Falcon 180B

Available
Technology Innovation InstituteOpen weights

At launch the largest openly available model, from the UAE's TII.

Dense180B2K ctxSep 6, 2023

Llama 2 70B

Available
Meta AIOpen weights

The release that made capable open-weight models genuinely usable for production.

Dense70B4K ctxJul 18, 2023

Claude 2

Retired
AnthropicProprietary

Anthropic's first widely-available Claude, notable for an early 100k-token context window.

Undisc.100K ctxJul 11, 2023

GPT-4

Deprecated
OpenAIProprietary

The model that brought reliable multi-step reasoning to the mainstream; size never disclosed.

Undisc.8K ctxMar 14, 2023

LLaMA

Available
Meta AIOpen weights

Meta's first LLaMA, released to researchers; its leak catalyzed the open-weight movement.

Dense65B2K ctxFeb 24, 2023

Galactica

Withdrawn
Meta AIOpen weights

A science-focused model whose public demo was withdrawn after just three days over confidently wrong outputs — an early, instructive retraction.

Dense120B2K ctxNov 15, 2022

BLOOM

Available
BigScienceOpen weights

An open, multilingual 176B model (46 languages) from a global research collaboration.

Dense176B2K ctxJul 12, 2022

PaLM

Retired
Google DeepMindProprietary

Google's 540B Pathways model; the API was later deprecated in favor of Gemini.

Dense540B ctxApr 4, 2022

GPT-3

Retired
OpenAIProprietary

The 175B model that proved in-context learning at scale; its base API models were retired in 2024.

Dense175B2K ctxJun 11, 2020

GPT-2

Available
OpenAIOpen source

Initially withheld over misuse fears, then fully released in Nov 2019 — an early 'limited release' debate.

Dense1.5B1K ctxNov 5, 2019

BERT

Available
Google DeepMindOpen source

The bidirectional encoder that reshaped NLP and seeded the transformer era.

Dense0.34B512 ctxOct 11, 2018