Chinese scientists’ attack on ChatGPT shows how criminals can use weak spots to exploit AI

Name: Apple supplier Foxconn to build ‘AI factories’ using US hardware leader Nvidia’s chips and software
Uploaded: 2023-11-07T14:00:20.000Z
Duration: 2 min 38 s
Description: Apple supplier Foxconn to build ‘AI factories’ using US hardware leader Nvidia’s chips and software

Using researchers’ attack method, a giant panda’s face is misclassified as a woman by Bard and a bald eagle is misclassified as a cat and a dog in Bing Chat

Researchers hone in on models’ vulnerabilities as world’s powers sign up to Bletchley Declaration at UK summit on AI safety

Reading Time:3 minutes

Why you can trust SCMP

Multimodal large language models (MLLMs) such as ChatGPT and Bard are vulnerable to attack and exploitation, as Chinese scientists have discovered and reported. Photo: Shutterstock Images

Zhang Tongin Beijing

Published: 10:00pm, 7 Nov 2023Updated: 12:47am, 8 Nov 2023

An AI model under attack could mistake giant pandas for humans or fail to detect harmful content, according to a research team in Beijing that says it discovered an effective method for attacking ChatGPT and other popular commercial AI models.

Doctored images used by the researchers appeared almost identical to the original, but they could effectively circumvent the models’ mechanisms designed to filter out toxic information.

The findings highlight significant security concerns within artificial intelligence and help shed light on the vulnerabilities of commercial multimodal large language models (MLLMs) from tech giants including Google, Microsoft and Baidu.

At the inaugural Global AI Security Summit held in the UK last week, representatives from the US, Britain, the European Union, China and India signed the Bletchley Declaration, an unprecedented deal to encourage safe and ethical development and use of AI.

02:38

Apple supplier Foxconn to build ‘AI factories’ using US hardware leader Nvidia’s chips and software

Wu Zhaohui, China’s vice-minister of science and technology, took part in the conference and presented proposals, advocating for stronger technical risk controls in AI governance.

Select Voice

Choose your listening speed

Get through articles 2x faster

1.25x

250 WPM

Slow

Average

Fast

00:0000:00

1.25x

Chinese scientists’ attack on ChatGPT shows how criminals can use weak spots to exploit AI

Using researchers’ attack method, a giant panda’s face is misclassified as a woman by Bard and a bald eagle is misclassified as a cat and a dog in Bing Chat Researchers hone in on models’ vulnerabilities as world’s powers sign up to Bletchley Declaration at UK summit on AI safety

Using researchers’ attack method, a giant panda’s face is misclassified as a woman by Bard and a bald eagle is misclassified as a cat and a dog in Bing Chat

Researchers hone in on models’ vulnerabilities as world’s powers sign up to Bletchley Declaration at UK summit on AI safety