Your brand here — Reach our audience of professional directory owners and boost your sales.

AgiBot GO-1

AgiBot · 智元机器人

Embodied / robotics model — runs on robot hardware, not a token API. No per-1M pricing. For API-priced LLMs see the model catalog.

At a glance

Creator	AgiBot (智元机器人)
Architecture	Vision-Language-Latent-Action (ViLLA)
Embodiment	Generalist (cross-embodiment: arms, humanoid)
Released	2025-03-10

Overview

GO-1 is AgiBot's generalist embodied foundation model, notable for pioneering the ViLLA (Vision-Language-Latent-Action) architecture — an evolution of standard VLA. It combines a Vision-Language Model, a Latent Planner trained on cross-embodiment and human-operation data, and an Action Expert (MoE), with latent action tokens quantized via VQ-VAE. A lighter GO-1-Air variant is also published. Open weights are on Hugging Face.

What it's used for

Cross-embodiment manipulation research; teams adapting a generalist base to their own robot arms or humanoid platforms.

Primary sources

Hugging Face: https://huggingface.co/agibot-world/GO-1
GitHub: https://github.com/OpenDriveLab/AgiBot-World
Official site: https://www.agibot.com/

AgiBot GO-1

At a glance

Overview

What it's used for

Primary sources

Other Chinese embodied models