Unitree UnifoLM-VLA

Unitree Robotics · 宇树科技

Embodied / robotics model — runs on robot hardware, not a token API. No per-1M pricing. For API-priced LLMs see the model catalog.

At a glance

CreatorUnitree Robotics (宇树科技)
ArchitectureVision-Language-Action (VLA)
EmbodimentHumanoid + manipulation
LicenseCC BY-NC-SA 4.0

Overview

UnifoLM-VLA is Unitree's open vision-language-action model, built to give humanoid robots physically grounded embodied intelligence. Published with a VLM base (UnifoLM-VLM-Base), a VLA base, and a LIBERO-benchmark variant. Released under CC BY-NC-SA 4.0 (non-commercial). Pairs naturally with Unitree's own humanoid hardware but the weights are open for research.

What it's used for

Humanoid robot control research; benchmarking VLA policies on LIBERO; teams already on Unitree hardware.

Primary sources

Other Chinese embodied models