Alibaba’s new AI model rivals OpenAI’s GPT-4o in coding ability amid open source competition

In nine out of 12 evaluations, Qwen2.5 Coder’s flagship variant performed better than GPT-4o and Claude 3.5 Sonnet, according to a statement

Ben Jiangin Beijing
The Alibaba logo is seen at the company’s headquarters in Hangzhou, Zhejiang province, China, November 11, 2019. Photo: Reuters

Alibaba Group Holding has developed an artificial intelligence (AI) model that rivals leading ones from US peers such as OpenAI and Anthropic in terms of coding capabilities, in a sign that Chinese tech firms are running neck-and-neck with American players in open source models.

Qwen2.5 Coder, the latest open source large language model (LLM) from Alibaba’s cloud computing arm, has matched or surpassed OpenAI’s GPT-4o and Claude 3.5 Sonnet from Amazon.com-backed Anthropic in coding capabilities, according to evaluations that included HumanEval, EvalPlus and Aider, the Qwen team said in a statement on Tuesday. Alibaba owns the South China Morning Post.

Print option is available for subscribers only.
SUBSCRIBE NOW
Copyright © 2025 South China Morning Post Publishers Ltd. All rights reserved.