🤖 FlagOS-Robo is built upon the unified and open-source AI system software stack, FlagOS, which supports various AI chips. It serves as an integrated training and inference framework for AI models used in robots🤖 , so-called Embodied Intelligence. It can be deployed across diverse scenarios, ranging from edge to cloud. Being portable across various chip models, it enables efficient training, inference, and deployment for both Vision Language Models (VLMs) and Vision Language Action (VLA) models. Here, VLMs usually act as the brain🧠 for task planning, while VLA models act as the cerebellum to output actions for robot control🦾.
FlagOS-Robo supports the full lifecycle of embodied intelligence models, including data loading from diverse formats (webdataset, Megatron-Energon and lerobot dataset), supervised fine-tuning (SFT), inference deployment, and integrated testing and evaluation via the FlagEval-Robo platform. Users can easily reproduce the full end-to-end pipeline in their own environment by downloading and running the provided examples.
FlagOS-Robo has been deeply integrated into BAAI’s Embodied Intelligence platform RoboXStudio, which provides one-stop services including real-robot data collection, data annotation, supervised fine-tuning of VLA models, and evaluation. Users without a local setup can directly access RoboXStudio and run experiments without any installation.
FlagOS-Robo provides a powerful computational foundation and systematic support for cutting-edge researches and industrial applications in embodied intelligence, accelerating innovations and real-world deployments of intelligent agents.
- FlagScale as users' entrypoint supports robot related AI model training and inference, including Pi-0, RoboBrain2, and RoboBrainX0, etc.
- FlagOS-Robo supports RoboOS-based cross-embodiment collaboration, ensuring compatibility with different data formats, efficient edge-cloud coordination, and real-machine evaluation.
| Models | Type | Checkpoint | Train | Inference | Serve | Evaluate |
|---|---|---|---|---|---|---|
| PI0 | VLA | Huggingface | ✅︎ Guide | ✅︎ Guide | ✅ Guide | ❌ |
| PI0.5 | VLA | Huggingface | ✅︎ Guide | ✅ Guide | ✅ Guide | ❌ |
| RoboBrain-2.0 | VLM | Huggingface | ✅︎ Guide | ✅Guide | ✅Guide | ✅ Guide |
| RoboBrain-X0 | VLA | Huggingface | ✅︎ Guide | ❌ | ✅ Guide | ❌ |