

Kimi K2 Thinking
#20 v Reasoning modelymoonshot-ai · thinking · od 2025-11-06 · 8× · naposledy 30. 6. 2026
17
Momentum
Kimi K2 Thinking is Moonshot AI's reasoning model released in November 2025, built on the Kimi K2 base (MoE, 1 trillion total parameters, 32 billion active per forward pass) and post-trained for multi-step reasoning with dynamic tool use. The model employs native INT4 Quantization-Aware Training (QAT), delivering approximately 2× faster inference and ~50% less GPU memory usage compared to FP16. It supports a 256k-token context window and is released as an open-weight model under a modified MIT license.
Vývoj momenta
04.04.03.07.
Vlastnosti
| Context Window (Tokens) | 256,000 tokens (256k); Max. output: 16,384 tokens (per Amazon Bedrock). Benchmarks were conducted at 256k context length. |