Synthszr Charts — die großen AI-Marken im Wettkampf ums Podium

DeepSeek R1

#2 v Reasoning modely

deepseek · od 2025-01-20 · 81× · naposledy 03. 7. 2026

Momentum

DeepSeek R1 is an open-source reasoning model by DeepSeek, released on January 20, 2025. It is built on a Mixture-of-Experts (MoE) architecture with 671 billion total parameters, of which only approximately 37 billion are active per inference pass. The model uses reinforcement learning (RL) with chain-of-thought training and achieves benchmark performance comparable to OpenAI o1 in mathematics, coding, and reasoning tasks. It is released under the MIT license and is available both via the DeepSeek API and as open-weight model for self-hosting.

Vývoj momenta

04.04.03.07.

Vlastnosti

Key Benchmark (%)	AIME 2024: 79.8% (Pass@1) · MATH-500: 97.3% (Pass@1) · MMLU: 90.8% (Pass@1) — from the official technical paper (arxiv.org/abs/2501.12948). Update R1-0528 (May 2025): AIME 2025: 87.5%
Context Window (Tokens)	128,000 tokens (128K) – per technical report and model card; updated version R1-0528 approx. 130K tokens
License	MIT License (model weights & API outputs; commercial use, modifications, and distillation permitted)
Multimodality	No – DeepSeek R1 supports text input only (no image, audio, or video input). For multimodal tasks, DeepSeek offers a separate model line (Janus series).
Platform	DeepSeek API (api.deepseek.com, model=deepseek-reasoner) · DeepSeek Chat (chat.deepseek.com) · Open-source weights on Hugging Face & GitHub (self-hosting via vLLM, llama.cpp, Ollama, among others)
Price	API (DeepSeek official, at release): $0.14/1M input tokens (cache hit) · $0.55/1M input tokens (cache miss) · $2.19/1M output tokens
Price per 1M Tokens	Input (cache miss): $0.55 · Input (cache hit): $0.14 · Output: $2.19 (official DeepSeek API pricing at launch; deepseek-reasoner alias)
Release Date	January 20, 2025

DeepSeek R1

Vlastnosti

Zdroje (60)

Další produkty v této kategorii: Reasoning modely

Subscribe free. Unsubscribe the second it sucks.