Synthszr Charts — die großen AI-Marken im Wettkampf ums Podium

Qwen2.5-14B-Instruct

#56 in Open-Source-Spraakmodelle

alibaba · v2.5 · 14b instruct · siet 2024-09-19 · 2× · tolest 30. Juni 2026

Momentum

Qwen2.5-14B-Instruct is an instruction-tuned, dense language model from Alibaba Cloud's Qwen team with 14.7 billion parameters, released in September 2024 under the Apache 2.0 license. It natively supports a context length of up to 128,000 tokens and can generate up to 8,000 tokens. The model was pre-trained on 18 trillion tokens and covers over 29 languages. According to the official technical report, it performs comparably to GPT-4o-mini across several benchmarks.

Momentum-Verloop

04.04.03.07.

Features

Benchmark Score (MMLU/Similar)	MMLU: 79.7; BBH: 78.2 (Qwen official blog); MMLU-Redux: 80.0%; GSM8k: 94.8%; MATH: 80.0%; HumanEval: 83.5% (llm-stats.com)
Context Window	128,000 tokens (native support); default config.json set to 32,768 tokens; output up to 8,000 tokens
Model Size (Parameters)	14.7 billion parameters (active and total); trained on 18 trillion tokens
Memory Requirement	BF16 (full precision): approx. 29.6 GB VRAM (model weights); Q4_K_M quantization: approx. 8.7 GB; Q8_0: approx. 14.7 GB (each plus 1–2 GB KV cache overhead)

Qwen2.5-14B-Instruct

Features

Belege (2)

Mehr Produkten in disse Kategorie: Open-Source-Spraakmodelle

Subscribe free. Unsubscribe the second it sucks.