This competition marks a significant step forward in embodied AI, moving beyond simulation to real-world validation and practical benchmarking.
- 526 teams from 27 countries participated, with over 100 surpassing the baseline.
- The competition featured two tracks: Reasoning to Action (R2A) and World Model (WM), with a new supermarket benchmark emphasizing real-world interaction.
- PrismBot from vivo won the championship with 43.47 points.
- The NeoVerse-ABot won the WM track, showcasing advancements in AI prediction and interaction modeling.
- The competition's benchmark-driven approach and real-robot validation set a new standard for evaluating embodied AI systems.
Problem: Bridging the Gap Between Simulation and Real-World Deployment
The field of embodied AI has long faced a significant challenge: the disconnect between simulated environments and real-world performance. While simulation scores provide valuable insights, they often fail to account for the complexities of physical interactions, environmental variability, and long-term task reliability. This gap has hindered the practical deployment of AI systems in real-world applications, particularly in dynamic and unpredictable environments.
Solution: A Benchmark-Driven Approach with Real-Robot Validation
AGIBOT Innovation Technology Co., in collaboration with ICRA 2026, addressed this challenge by hosting the AGIBOT World Challenge 2026 in Vienna. The competition introduced a benchmark-driven format that combined online automated evaluation with offline real-robot finals, utilizing AGIBOT's EWMBench and Genie Sim Benchmark. This approach aimed to:
- **Standardize Evaluation Metrics**: By adopting a consistent framework, the competition ensured that all teams were evaluated on the same criteria, enabling fair and reproducible results. - **Incorporate Real-World Testing**: The offline finals, conducted using the AGIBOT G2 humanoid robot, focused on robot stability, real-world adaptability, and long-horizon task reliability. This shift emphasized practical deployment needs over purely theoretical performance. - **Expand Evaluation Scope**: The competition featured two tracks: Reasoning to Action (R2A) and World Model (WM). The R2A track, an evolution of the 2025 Manipulation track, evaluated the full process of environment understanding, task planning, and physical execution. The WM track focused on predicting physical-world changes and modeling interactions based on robot actions and sensor inputs.
Results: A New Benchmark for Embodied AI
The competition attracted 526 teams from 27 countries, including leading research institutions and companies such as the Chinese Academy of Sciences, Tsinghua University, UC San Diego, Sber Robotics Center, Alibaba, and vivo. Key outcomes include:
- **Over 100 teams surpassed the official baseline**, demonstrating the high level of competition and the effectiveness of the benchmarking approach. - **PrismBot from vivo claimed the championship** with 43.47 points, followed by Shanghai RoboParty's RP-VLA (35.66 points) and Russia's GreenVLA (33.19 points). - **Real-World Application Focus**: The introduction of a supermarket benchmark track, developed in partnership with Dexmal, further emphasized practical deployment. This track incorporated non-ideal physical interactions, such as object drops and grasping failures, and required models to complete tasks under physical constraints like shelf height limits and randomized item placement.
"By integrating real-robot validation into the evaluation process, we have shifted the focus from simulation scores to practical, real-world performance," said AGIBOT. "This aligns technical evaluation more closely with the needs of practical deployment."
NeoVerse-ABot's Victory in the WM Track
In the World Model (WM) track, the NeoVerse-ABot, a joint team from the Institute of Automation of the Chinese Academy of Sciences and Amap CV Lab, emerged as the winner. The PAI@IAII team from the Institute of Industrial Artificial Intelligence at the Chinese Academy of Sciences ranked second, highlighting the strong performance of Chinese teams in this category.
##
Is this your company?
This article features your business. Claim it to add your logo, contact details, and a link to your website — or upgrade to reach more buyers.
Did you know 80% of Press Releases trigger AI content warnings? Reach out and the M4S team can assist.
