DeepSeek-V2

#54 in Open-Source-Spraakmodelle

deepseek · v2 · siet Mai 2024 · 2× · tolest 30. Juni 2026

Momentum

DeepSeek-V2 is a language model by DeepSeek. The product was developed in May 2024.

Momentum-Verloop

04.04.03.07.

Features

Benchmark Score (MMLU/Similar)	MMLU (5-shot): 78.5% (DeepSeek-V2 Base); Chat variant: 78.1% MMLU per DeepSeek-Coder-V2 paper
Inference Speed	Generation throughput >50,000 tokens/s (on 1 node with 8× H800 GPUs, FP8 precision); prompt input throughput >100,000 tokens/s; equals 5.76× the throughput of DeepSeek 67B
Context Window	128,000 tokens
Model Size (Parameters)	236B total parameters (MoE); 21B activated parameters per token
Price Tier	API (at release): approx. $0.14/M input tokens and $0.28/M output tokens; open-weights model available free of charge (DeepSeek License Agreement, commercial use permitted)
Memory Requirement	Full model (BF16): at least 8× 80 GB GPUs recommended (e.g., 8× H800/H100); with 4-bit quantization approx. 136 GB VRAM (multi-GPU required)