

Starchild-1
#100 v Frontier jazykové modelyodyssey · v1 · od 2026-05-17 · 13× · naposledy 30. 6. 2026
Starchild-1 is the first real-time multimodal world model from Odyssey (odyssey.ml), autoregressively generating synchronized audio and video in real time while continuously responding to streamed user inputs (text, speech, actions). Released as a preview with no public interactive demo yet available. Technically, it relies on a causal distillation pipeline that converts a bidirectional audio-video foundation model (Ovi) into a real-time autoregressive world model, plus an asynchronous KV-cache architecture handling the differing temporal frequencies of audio and video. Target domains include gaming, robotics, education, healthcare, and defense.
Vlastnosti
| License | Closed-weights / proprietary, access only via vendor API or product |
| Multimodality | Audio + video, autoregressively generated in sync, responds to text, voice, and action inputs in real time |
| Platform | Web preview/API at Odyssey (odyssey.ml), technical report at starchild.odyssey.ml |
| Release Date | May 17, 2026 (preview) |