

DeepSeek-V2
#54 in Open-Source-Spraakmodelledeepseek · v2 · siet Mai 2024 · 2× · tolest 30. Juni 2026
7
Momentum
DeepSeek-V2 is a language model by DeepSeek. The product was developed in May 2024.
Momentum-Verloop
04.04.03.07.
Features
| Benchmark Score (MMLU/Similar) | MMLU (5-shot): 78.5% (DeepSeek-V2 Base); Chat variant: 78.1% MMLU per DeepSeek-Coder-V2 paper |
| Inference Speed | Generation throughput >50,000 tokens/s (on 1 node with 8× H800 GPUs, FP8 precision); prompt input throughput >100,000 tokens/s; equals 5.76× the throughput of DeepSeek 67B |
| Context Window | 128,000 tokens |
| Model Size (Parameters) | 236B total parameters (MoE); 21B activated parameters per token |
| Price Tier | API (at release): approx. $0.14/M input tokens and $0.28/M output tokens; open-weights model available free of charge (DeepSeek License Agreement, commercial use permitted) |
| Memory Requirement | Full model (BF16): at least 8× 80 GB GPUs recommended (e.g., 8× H800/H100); with 4-bit quantization approx. 136 GB VRAM (multi-GPU required) |