US-China tech war: Beijing-funded AI researchers surpass Google and OpenAI with new language processing model

Name: AI instructors teach student drivers in Shanghai how to get behind the wheel
Uploaded: 2021-06-02T09:30:17.000Z
Duration: 1 min 46 s
Description: AI instructors teach student drivers in Shanghai how to get behind the wheel

The WuDao 2.0 natural language processing model had 1.75 trillion parameters, topping the 1.6 trillion that Google unveiled in a similar model in January

China has been pouring money into AI to try to close the gap with the US, which maintains an edge because of its dominance in semiconductors

Reading Time:3 minutes

Why you can trust SCMP

People view the exhibits at the World Intelligence Congress in Tianjin, Hebei province, on May 20. China has been pouring resources into artificial intelligence and other critical technologies in a push to close the gap with the US. Photo: Xinhua

Coco Fengin Beijing

Published: 5:30pm, 2 Jun 2021Updated: 8:27pm, 2 Jun 2021

A government-funded artificial intelligence (AI) institute in Beijing unveiled on Monday the world’s most sophisticated natural language processing (NLP) model, surpassing those from Google and OpenAI, as China seeks to increase its technological competitiveness on the world stage.

The WuDao 2.0 model is a pre-trained AI model that uses 1.75 trillion parameters to simulate conversational speech, write poems, understand pictures and even generate recipes. The project was led by the non-profit research institute Beijing Academy of Artificial Intelligence (BAAI) and developed with more than 100 scientists from multiple organisations.

Parameters are variables defined by machine learning models. As the model evolves, parameters are further refined to allow the algorithm to get better at finding the correct outcome over time. Once a model is trained on a specific data set, such as samples of human speech, the outcome can then be applied to solving similar problems.

In general, the more parameters a model contains, the more sophisticated it is. However, creating a more complex model requires time, money, and research breakthroughs.

01:46

AI instructors teach student drivers in Shanghai how to get behind the wheel

In an era of fast-evolving AI models, BAAI researchers claim to have broken the record set in January by Google’s Switch Transformer, which has 1.6 billion parameters. OpenAI’s GPT-3 model made waves last year when it was released with 175 billion parameters, making it the largest NPL model at the time.

WuDao 2.0 covers both Chinese and English with skills acquired by studying 4.9 terabytes of images and texts, including 1.2 terabytes each of Chinese and English texts. It already has 22 partners, including smartphone maker Xiaomi, on-demand delivery service provider Meituan and short-video giant Kuaishou.