In nine out of 12 evaluations, Qwen2.5 Coder’s flagship variant performed better than GPT-4o and Claude 3.5 Sonnet, according to a statement
Alibaba Group Holding has developed an artificial intelligence (AI) model that rivals leading ones from US peers such as OpenAI and Anthropic in terms of coding capabilities, in a sign that Chinese tech firms are running neck-and-neck with American players in open source models.
Qwen2.5 Coder, the latest open source large language model (LLM) from Alibaba’s cloud computing arm, has matched or surpassed OpenAI’s GPT-4o and Claude 3.5 Sonnet from Amazon.com-backed Anthropic in coding capabilities, according to evaluations that included HumanEval, EvalPlus and Aider, the Qwen team said in a statement on Tuesday. Alibaba owns the South China Morning Post.