The model release tracker
Last updated Jun 17, 2026
Every large language model release, sourced and tracked over time.
Frontier and open-weight. Filter by access, license, size, modality, and country of origin. Each entry links to a primary source and keeps a full history of changes, deprecations, and retractions.
83 models
GLM-5.2
AvailableZ.ai's latest open flagship for long-horizon coding, agentic engineering, and million-token workflows, adding IndexShare sparse-attention reuse over GLM-5.1.
MiniMax-M3
AvailableNative multimodal MiniMax model with a one-million-token context, sparse attention, and agentic coding/cowork positioning.
GPT-5.6
PreviewOpenAI's mid-2026 flagship, headlined by an industry-leading 1.5M-token context window and long-horizon agentic tool use.
Claude Fable 5
WithdrawnThe public, guardrailed sibling of Mythos and Anthropic's most capable widely-released model, built for long-horizon agentic work. Launched June 9, 2026 across the Claude API, AWS, and Microsoft Foundry — then pulled three days later under a US government export-control directive barring access by foreign nationals.
Nemotron 3 Ultra 550B-A55B
AvailableNVIDIA's largest Nemotron 3 open-weight hybrid Mamba-Transformer MoE, tuned for agentic reasoning, coding, planning, and tool calling.
Claude Opus 4.8
AvailableAnthropic's most capable model, with strengthened agentic and long-running task performance.
MiniMax-M2.7
AvailableOpen-weight agentic model from MiniMax focused on real-world software engineering, office tasks, tool use, and self-improving training workflows.
Gemini 3.5 Pro
PreviewAnnounced at Google I/O 2026; emphasizes deep multimodal reasoning over a 2M-token context.
Qwen3.6-27B
AvailableDense 27B that punches far above its weight on agentic coding — easy to self-host on a single GPU node.
Grok 4.3
AvailablexAI's agentic flagship with a 1M-token context and aggressive API pricing.
DeepSeek V4-Flash
PreviewEfficient V4 companion model with 284B total / 13B active parameters and the same one-million-token context window.
DeepSeek V4-Pro
PreviewPreview-series sparse MoE flagship with a one-million-token context window and 1.6T total / 49B active parameters.
Hunyuan-A13B-Instruct
AvailableTencent Hunyuan open-weight fine-grained MoE model with 80B total parameters and 13B active parameters, optimized for agentic tool use.
GLM-5.1
AvailableZ.ai agentic-engineering follow-up to GLM-5, with stronger coding performance and better long-horizon tool-use behavior.
Claude Mythos
PreviewA frontier model Anthropic disclosed on April 7, 2026 but declined to release publicly, citing security risk. Shipped only via 'Project Glasswing' to ~50 defensive-security partners, then suspended on June 12, 2026 under a US government directive.
Gemma 4 31B
AvailableGoogle DeepMind's Gemma 4 advanced-reasoning open model for personal computers, part of the April 2026 Gemma 4 family.
Kimi K2.6
AvailableMoonshot's open native multimodal agentic model for long-horizon coding, visual interface generation, and autonomous tool orchestration.
Mistral Medium 3.5
AvailableDense 128B open-weight model with a 256k context and strong coding performance for its size.
Nemotron 3 Super 120B-A12B
AvailableOpen-weight hybrid Mamba-Transformer MoE designed for collaborative agents and high-volume enterprise workflows.
Step-3.5-Flash
AvailableStepFun's Apache-licensed sparse MoE model for fast agentic execution, coding, math, browsing, and tool-use workflows.
Sarvam-105B
AvailableApache-licensed Indian-context MoE from Sarvam AI, optimized for reasoning, coding, agentic tasks, and 22 Indian languages.
GPT-5.4
AvailableWorkhorse GPT-5 release with a dedicated Thinking mode; widely deployed across ChatGPT and the API.
Qwen3.5-397B
AvailableNative vision-language MoE supporting 201 languages with a 1M-token context.
Gemini 3.1 Pro
AvailableGenerally available multimodal flagship with native tool use and a 2M-token context.
GLM-5
AvailableZ.ai flagship for complex systems engineering and long-horizon agentic tasks, scaling the GLM line to 744B total / 40B active parameters.
Claude Opus 4.6
AvailableIntroduced genuinely autonomous multi-file coding and stronger computer use.
Qwen3-Coder-Next
AvailableApache-licensed Qwen3-Next coding-agent model with 80B total / 3B active parameters, 256K context, and long-horizon tool-use training.
GLM-4.7
AvailableCoding-focused GLM release with improved multilingual agentic coding, terminal tasks, tool use, and interface generation.
OLMo 3 Think 32B
AvailableAi2's fully open thinking model with public weights, code, data, checkpoints, and training details across the OLMo 3 pipeline.
Nemotron 3 Nano 30B-A3B
AvailableEfficient Nemotron 3 MoE checkpoint for agentic reasoning and coding, activating about 3B parameters while supporting 1M-token contexts.
GLM-4.6V
AvailableOpen 106B-class vision-language model with native multimodal function calling for visual agents.
Mistral Large 3
AvailableMistral's largest open-weight MoE, aimed at frontier reasoning while remaining self-hostable.
DeepSeek-V3.2
AvailableReasoning-first agent model that adds DeepSeek Sparse Attention and thinking directly inside tool-use workflows.
DeepSeek-V3.2-Speciale
AvailableHigh-compute reasoning variant of V3.2, positioned for olympiad-level math, programming, and other deep reasoning tasks.
GLM-4.6
AvailableAgentic reasoning and coding upgrade over GLM-4.5, expanding the text context window from 128K to 200K tokens.
DeepSeek-V3.2-Exp
PreviewExperimental checkpoint that introduced DeepSeek Sparse Attention as an efficiency bridge between V3.1-Terminus and V3.2.
DeepSeek-V3.1-Terminus
AvailableStability update to V3.1 focused on language consistency, code-agent reliability, and search-agent behavior.
Gemma 3 27B
AvailableGoogle's open multimodal model: 128k context, 140+ languages, runs on a single GPU.
DeepSeek-V3.1
AvailableHybrid thinking/non-thinking release that upgraded tool calling, long-context training, and agent task performance.
Seed-OSS-36B-Instruct
AvailableByteDance Seed's Apache-licensed long-context reasoning and agent model, with controllable thinking budgets and a native 512K context.
GLM-4.5V
AvailableVision-language GLM based on GLM-4.5-Air, covering image, video, document, grounding, and GUI-agent tasks.
Falcon-H1 34B
AvailableA hybrid attention + state-space-model (SSM) design that matches 70B-class models with fewer parameters.
GLM-4.5
AvailableOpen agentic, reasoning, and coding foundation model that marked Z.ai international rebrand and MIT-licensed GLM push.
GLM-4.5-Air
AvailableCompact GLM-4.5 companion with 106B total / 12B active parameters for efficient agentic reasoning and coding.
EXAONE 4.0 32B
AvailableLG AI Research's unified model with non-reasoning and reasoning modes, agentic tool use, and English, Korean, and Spanish support.
ERNIE-4.5-300B-A47B
AvailableBaidu's open ERNIE 4.5 language MoE, part of a 10-variant Apache-licensed model family built with heterogeneous multimodal MoE training.
ERNIE-4.5-VL-424B-A47B
AvailableBaidu's largest ERNIE 4.5 vision-language MoE, supporting text, image, and video inputs with thinking and non-thinking modes.
MiniMax-M1-80k
AvailableOpen Apache-licensed hybrid-attention reasoning model with 456B total / 45.9B active parameters and a native 1M-token context.
DeepSeek-R1-0528
AvailableMajor R1 reasoning update with stronger math, programming, general logic, function calling, and reduced hallucinations.
Llama 4 Maverick
AvailableMeta's flagship open-weight MoE; highest MMLU among open models at release.
Llama 4 Scout
AvailableEfficient open-weight MoE designed for very long context on modest hardware.
DeepSeek-V3-0324
AvailablePost-R1 V3 update with improved reasoning, front-end coding, Chinese writing, search, and function calling.
OLMo 2 32B
AvailableA fully open model — weights, data, and training code all public — and the first such to beat GPT-3.5 / GPT-4o mini.
Command A
AvailableEnterprise-grade model tuned for RAG, tool use, and multilingual business workloads.
Claude 3.7 Sonnet
RetiredAnthropic's first hybrid-reasoning Sonnet. Shut down May 11, 2026 as the 4.x line matured.
Mistral Small 3
AvailableA latency-optimized 24B dense model under Apache-2.0 — a popular local-deployment workhorse.
DeepSeek-R1
AvailableBreakout open reasoning model trained with large-scale reinforcement learning and released with weights under MIT.
DeepSeek-V3
AvailableThe 671B/37B-active MoE release that made DeepSeek a central open-model lab before the R1 breakthrough.
Granite 3.1 8B
AvailableIBM's enterprise-focused open model with a 128k context, Apache-2.0 licensed.
Falcon 3 10B
AvailableUAE's TII open model designed to run on light infrastructure, including laptops.
Command R7B
AvailableCohere's smallest, fastest R-series model, tuned for RAG and tool use on modest hardware.
Phi-4
AvailableA 14B dense model that rivals far larger ones on math and reasoning, under a permissive MIT license.
Amazon Nova Pro
AvailableAWS-native multimodal model with a 300k context; size and architecture undisclosed.
Hunyuan-Large
AvailableTencent's 389B total / 52B active open-weight Transformer MoE, released with a 256K pretraining context and 128K instruct context.
Yi-Lightning
Available01.AI's MoE API model that reached the global top-10 on Chatbot Arena, strong in Chinese, math, and coding.
Jamba 1.5 Large
AvailableIsrael's AI21 hybrid Mamba-Transformer MoE, with a 256k context and strong long-document throughput.
GPT-4o
RetiredThe 2024 omni-modal model that defined a generation of assistants. Deprecated in Feb 2026 and fully retired across ChatGPT on April 3, 2026.
Mixtral 8x7B
AvailableThe open sparse Mixture-of-Experts that brought MoE efficiency to the open ecosystem.
Gemini 1.0 Ultra
DeprecatedGoogle's first natively multimodal Gemini flagship, since superseded by the 1.5/2/3 lines.
Qwen-72B
AvailableAlibaba's first major open Qwen model and the start of a prolific open-weight line.
Yi-34B
Available01.AI's strong bilingual open model, with a 200k-context variant.
Mistral 7B
AvailableThe 7B that punched far above its weight and put Mistral on the map.
Falcon 180B
AvailableAt launch the largest openly available model, from the UAE's TII.
Llama 2 70B
AvailableThe release that made capable open-weight models genuinely usable for production.
Claude 2
RetiredAnthropic's first widely-available Claude, notable for an early 100k-token context window.
GPT-4
DeprecatedThe model that brought reliable multi-step reasoning to the mainstream; size never disclosed.
LLaMA
AvailableMeta's first LLaMA, released to researchers; its leak catalyzed the open-weight movement.
Galactica
WithdrawnA science-focused model whose public demo was withdrawn after just three days over confidently wrong outputs — an early, instructive retraction.
BLOOM
AvailableAn open, multilingual 176B model (46 languages) from a global research collaboration.
PaLM
RetiredGoogle's 540B Pathways model; the API was later deprecated in favor of Gemini.
GPT-3
RetiredThe 175B model that proved in-context learning at scale; its base API models were retired in 2024.
GPT-2
AvailableInitially withheld over misuse fears, then fully released in Nov 2019 — an early 'limited release' debate.
BERT
AvailableThe bidirectional encoder that reshaped NLP and seeded the transformer era.