Synthszr Charts — die großen AI-Marken im Wettkampf ums Podium
synthszr charts
deepseek

DeepSeek-V2

#54 in Open-Source-Spraakmodelle

deepseek · v2 · siet Mai 2024 · 2× · tolest 30. Juni 2026

7
Momentum

DeepSeek-V2 is a language model by DeepSeek. The product was developed in May 2024.

Momentum-Verloop
04.04.03.07.

Features

Benchmark Score (MMLU/Similar)MMLU (5-shot): 78.5% (DeepSeek-V2 Base); Chat variant: 78.1% MMLU per DeepSeek-Coder-V2 paper
Inference SpeedGeneration throughput >50,000 tokens/s (on 1 node with 8× H800 GPUs, FP8 precision); prompt input throughput >100,000 tokens/s; equals 5.76× the throughput of DeepSeek 67B
Context Window128,000 tokens
Model Size (Parameters)236B total parameters (MoE); 21B activated parameters per token
Price TierAPI (at release): approx. $0.14/M input tokens and $0.28/M output tokens; open-weights model available free of charge (DeepSeek License Agreement, commercial use permitted)
Memory RequirementFull model (BF16): at least 8× 80 GB GPUs recommended (e.g., 8× H800/H100); with 4-bit quantization approx. 136 GB VRAM (multi-GPU required)

Belege (2)

Mehr Produkten in disse Kategorie: Open-Source-Spraakmodelle

Subscribe free. Unsubscribe the second it sucks.

High-signal news across AI, business, UX, and tech. Every morning.