

SubQ
#2 in LLM Inference & Servingsubquadratic · since 2026-05-05 · 16× · last seen Jun 30, 2026
38
Momentum
SubQ (SubQ 1M-Preview) is the first commercial large language model by Miami-based Subquadratic, built on a fully sub-quadratic architecture called Subquadratic Sparse Attention (SSA). SSA replaces O(n²) dense attention with content-dependent dynamic token selection, achieving compute that scales linearly with context length. The production version offers a 1-million-token context window; the research model has been tested at up to 12 million tokens. At launch on May 5, 2026, three products entered private beta: SubQ API, SubQ Code (CLI coding agent), and SubQ Search.
Momentum trend
04.04.03.07.
Features
| Protocol Compatibility | OpenAI-compatible API endpoints (HTTP). Drop-in replacement for existing OpenAI/Anthropic client libraries without SDK changes. Supports streaming and tool use. |
| Release Date | May 5, 2026 (launch from stealth, private beta via subq.ai) |