Originally published by:therobotreport.com
M4S Take

This competition marks a significant step forward in embodied AI, moving beyond simulation to real-world validation and practical benchmarking.

  • 526 teams from 27 countries participated, with over 100 surpassing the baseline.
  • The competition featured two tracks: Reasoning to Action (R2A) and World Model (WM), with a new supermarket benchmark emphasizing real-world interaction.
  • PrismBot from vivo won the championship with 43.47 points.
  • The NeoVerse-ABot won the WM track, showcasing advancements in AI prediction and interaction modeling.
  • The competition's benchmark-driven approach and real-robot validation set a new standard for evaluating embodied AI systems.

Problem: Bridging the Gap Between Simulation and Real-World Deployment

The field of embodied AI has long faced a significant challenge: the disconnect between simulated environments and real-world performance. While simulation scores provide valuable insights, they often fail to account for the complexities of physical interactions, environmental variability, and long-term task reliability. This gap has hindered the practical deployment of AI systems in real-world applications, particularly in dynamic and unpredictable environments.

Solution: A Benchmark-Driven Approach with Real-Robot Validation

AGIBOT Innovation Technology Co., in collaboration with ICRA 2026, addressed this challenge by hosting the AGIBOT World Challenge 2026 in Vienna. The competition introduced a benchmark-driven format that combined online automated evaluation with offline real-robot finals, utilizing AGIBOT's EWMBench and Genie Sim Benchmark. This approach aimed to:

- **Standardize Evaluation Metrics**: By adopting a consistent framework, the competition ensured that all teams were evaluated on the same criteria, enabling fair and reproducible results. - **Incorporate Real-World Testing**: The offline finals, conducted using the AGIBOT G2 humanoid robot, focused on robot stability, real-world adaptability, and long-horizon task reliability. This shift emphasized practical deployment needs over purely theoretical performance. - **Expand Evaluation Scope**: The competition featured two tracks: Reasoning to Action (R2A) and World Model (WM). The R2A track, an evolution of the 2025 Manipulation track, evaluated the full process of environment understanding, task planning, and physical execution. The WM track focused on predicting physical-world changes and modeling interactions based on robot actions and sensor inputs.

Results: A New Benchmark for Embodied AI

The competition attracted 526 teams from 27 countries, including leading research institutions and companies such as the Chinese Academy of Sciences, Tsinghua University, UC San Diego, Sber Robotics Center, Alibaba, and vivo. Key outcomes include:

- **Over 100 teams surpassed the official baseline**, demonstrating the high level of competition and the effectiveness of the benchmarking approach. - **PrismBot from vivo claimed the championship** with 43.47 points, followed by Shanghai RoboParty's RP-VLA (35.66 points) and Russia's GreenVLA (33.19 points). - **Real-World Application Focus**: The introduction of a supermarket benchmark track, developed in partnership with Dexmal, further emphasized practical deployment. This track incorporated non-ideal physical interactions, such as object drops and grasping failures, and required models to complete tasks under physical constraints like shelf height limits and randomized item placement.

"By integrating real-robot validation into the evaluation process, we have shifted the focus from simulation scores to practical, real-world performance," said AGIBOT. "This aligns technical evaluation more closely with the needs of practical deployment."

NeoVerse-ABot's Victory in the WM Track

In the World Model (WM) track, the NeoVerse-ABot, a joint team from the Institute of Automation of the Chinese Academy of Sciences and Amap CV Lab, emerged as the winner. The PAI@IAII team from the Institute of Industrial Artificial Intelligence at the Chinese Academy of Sciences ranked second, highlighting the strong performance of Chinese teams in this category.

##

SM

Simon McLoughlin

Founder & Editor, M4S News

20+ years in manufacturing and engineering. I started M4S News to cut through the noise and deliver real intelligence to the people who actually make things. When I'm not writing or editing, I'm talking to engineers on factory floors.

Is this your company?

This article features your business. Claim it to add your logo, contact details, and a link to your website — or upgrade to reach more buyers.

Did you know 80% of Press Releases trigger AI content warnings? Reach out and the M4S team can assist.