Your brand here — Reach our audience of professional directory owners and boost your sales.

Unitree UnifoLM-VLA

Unitree Robotics · 宇树科技

Embodied / robotics model — runs on robot hardware, not a token API. No per-1M pricing. For API-priced LLMs see the model catalog.

At a glance

Creator	Unitree Robotics (宇树科技)
Architecture	Vision-Language-Action (VLA)
Embodiment	Humanoid + manipulation
License	CC BY-NC-SA 4.0

Overview

UnifoLM-VLA is Unitree's open vision-language-action model, built to give humanoid robots physically grounded embodied intelligence. Published with a VLM base (UnifoLM-VLM-Base), a VLA base, and a LIBERO-benchmark variant. Released under CC BY-NC-SA 4.0 (non-commercial). Pairs naturally with Unitree's own humanoid hardware but the weights are open for research.

What it's used for

Humanoid robot control research; benchmarking VLA policies on LIBERO; teams already on Unitree hardware.

Primary sources

Hugging Face: https://huggingface.co/unitreerobotics/UnifoLM-VLA-Base
GitHub: https://github.com/unitreerobotics/unifolm-vla
Official site: https://www.unitree.com/

Unitree UnifoLM-VLA

At a glance

Overview

What it's used for

Primary sources

Other Chinese embodied models