LLM Releases
← Catalog

Laguna XS 2.1

Available
PoolsideOpen source

Poolside's open-weight small coding model: a 33B-total / 3B-active Mixture-of-Experts built for agentic coding and long-horizon work on a local machine, served at 256K context. An upgraded XS.2 (same architecture) that lifts SWE-bench Multilingual by 5.4 points to 63.1% and improves terminal-style tasks. Ships with open-weighted DFlash speculator (draft) models for each checkpoint that roughly double local tokens/sec, plus BF16/FP8/INT4/NVFP4 quantized checkpoints; supported in vLLM, SGLang, TensorRT-LLM, HF transformers, and Ollama (llama.cpp coming). Newly relicensed under the fully permissive OpenMDW-1.1. Available free on Hugging Face and via a free OpenRouter tier, with paid API pricing of $0.10 / $0.20 / $0.05 per 1M input / output / cache-read tokens. Its predecessor Laguna XS.2 sunsets on Poolside's API one week after launch.

Specifications

License
Open source · OpenMDW-1.1
Weights
Downloadable
Architecture
Mixture-of-Experts
Parameters
33B · 3B active
Context window
256K tokens
Max output
Knowledge cutoff
Price (in / out, $/M)
$0.1 / $0.2
Modalities
TextCode

Benchmarks

No benchmark scores recorded yet. Spotted some? Submit a correction.

Vendor-reported figures are claims until independently verified. See methodology.