Synthszr Charts — die großen AI-Marken im Wettkampf ums Podium
synthszr charts
deepseek

DeepSeek V3

#3

deepseek · v3 · seit 2024-12-26 · 42× · zuletzt 01. Juli 2026

79
Momentum

DeepSeek V3 is an open-source language model by DeepSeek, released on December 26, 2024. It is based on a Mixture-of-Experts (MoE) architecture with 671 billion total parameters, activating only 37 billion per token. The model was pre-trained on 14.8 trillion tokens and employs Multi-head Latent Attention (MLA) and FP8 training. It achieves benchmark performance comparable to leading proprietary models, particularly in mathematics, coding, and multilingual tasks.

Historique du momentum
04.04.03.07.

Fonctionnalités

Key Benchmark (%)MMLU: 88.5% | MATH-500: 90.2% | GPQA: 59.1% | Codeforces Percentile: 51.6% | SWE-Bench Verified: 42.0%
Context Window (Tokens)128,000 tokens
LicenseMIT License (code repository); DeepSeek Model License for model weights – commercial use allowed
MultimodalityNo native multimodality – text-only. DeepSeek announced multimodal support as a future feature. Separate multimodal models exist as the standalone Janus series.
PlatformDeepSeek API (platform.deepseek.com, OpenAI-compatible endpoint); self-hosting via HuggingFace, SGLang, vLLM, TensorRT-LLM, LMDeploy, AMD GPU, Huawei Ascend NPU
PriceFree (open weights, self-hosting); API access via platform.deepseek.com paid per token
Price per 1M Tokens$0.27 / 1M input tokens (cache miss), $0.07 / 1M input tokens (cache hit), $1.10 / 1M output tokens (original launch pricing)
Release DateDecember 26, 2024

Preuves (42)

Subscribe free. Unsubscribe the second it sucks.

High-signal news across AI, business, UX, and tech. Every morning.