Advertisement
Robotics
Tech

Robotics’ ‘ChatGPT moment’ could come within 2 years, founder of China’s Unitree says

AI could propel advances in embodied intelligence to allow autonomous robots to function in unfamiliar environments, Wang Xingxing says

Reading Time:2 minutes
Why you can trust SCMP
4
A Unitree Robotics robot gives a combat performance at the 2025 World Robot Conference in Beijing on August 8, 2025. Photo: Xinhua
Coco Fengin Guangdong
The “ChatGPT moment” for the robotics industry could arrive in as little as two years if powerful artificial intelligence technology develops to propel robots’ movements, according to the founder of China’s industry leader Unitree Robotics.

Wang Xingxing defined this moment as the first time a robot could perform a task, such as cleaning a room or bringing a bottle of water to a targeted person, in a venue that it had never been to before.

“If things develop fast, it could happen in the next year or two, or maybe two to three years”, he said on Saturday at the World Robot Conference in Beijing.

Advertisement

Although both robot hardware, such as dexterous hands, and training data were good enough to enable the feat, the crucial element of “AI for embodied intelligence is completely inadequate”, he said.

He had “doubts” about whether popular vision language action (VLA) models, which used a rather “dumb” architecture, were up to the task, he said. Although Unitree also used such models, along with reinforcement learning to improve pre-trained VLAs in downstream tasks, the approach required a lot of optimisation, he said.

Advertisement

Another approach, generating a video or interactive model based on text prompts and making robots follow this to perform tasks, could have a “higher probability” of succeeding in robot motion control, Wang said.

He cited Google’s general-purpose Genie 3 “world model”, which was launched on Tuesday and is billed as capable of generating models of dynamic worlds that include information on physical properties, as an example of technology development in this area.

Advertisement
Select Voice
Choose your listening speed
Get through articles 2x faster
1.25x
250 WPM
Slow
Average
Fast
1.25x