Synthszr Charts — die großen AI-Marken im Wettkampf ums Podium
synthszr charts

Llada Instruct

#41 in Open-Source-Spraakmodelle

unknown · siet 2025-02-14 · 2× · tolest 29. Juni 2026

12
Momentum

LLaDA-8B-Instruct (Large Language Diffusion with mAsking) is an 8-billion-parameter language model developed by the GSAI-ML group (Renmin University of China), trained entirely from scratch without building on any existing autoregressive model. It employs a masked diffusion architecture: during pre-training, tokens are randomly masked and the model learns to iteratively reconstruct them. After supervised fine-tuning (SFT) on 4.5 million pairs, LLaDA-8B-Instruct exhibits instruction-following capabilities comparable to LLaMA3 8B Instruct, but without reinforcement learning. The model is released under the MIT license and available on Hugging Face.

Momentum-Verloop
04.04.03.07.

Features

Benchmark Score (MMLU/Similar)MMLU (5-shot): 65.9 (Base); LLaDA 8B Instruct: GSM8K 69.4 / MATH 31.9 / GPQA 33.3 / HumanEval 49.4 / MBPP 41.0 (per official paper, Tab. 2)
Model Size (Parameters)8 billion parameters (8B), trained on 2.3 trillion tokens; SFT on 4.5 million pairs
Price TierFree / Open Source (MIT license); model weights freely available on Hugging Face (GSAI-ML/LLaDA-8B-Instruct)

Belege (2)

Mehr Produkten in disse Kategorie: Open-Source-Spraakmodelle

Subscribe free. Unsubscribe the second it sucks.

High-signal news across AI, business, UX, and tech. Every morning.