

Qwen-RobotWorld
#9alibaba · 2× · zuletzt 29. Juni 2026
34
Momentum
Qwen-RobotWorld is a language-conditioned video world model by Alibaba that uses natural language as a unified action interface. The model spans over 20 different embodiments and 500+ action categories, and was trained on a dataset of 8.6 million video-text pairs and 200+ million frames.
Historique du momentum
04.04.03.07.
Fonctionnalités
| Multimodality | Video + natural language |