Advertisement

Alibaba Cloud open sources its two generative AI models based on ChatGPT-style Tongyi Qianwen

  • The Chinese company has made two of its large language models, Qwen-7B and Qwen-7B-Chat, freely available for commercial and research use
  • Alibaba Cloud, which is set to be spun off from its parent next year, has been doubling down on generative AI amid a global frenzy around ChatGPT

Reading Time:2 minutes
Why you can trust SCMP
0
Alibaba Cloud said it would open-source two of its large language models. Photo: Reuters
Ann Caoin Shanghai
Alibaba Cloud, the digital technology backbone of Alibaba Group Holding, has publicly shared two of its large language models (LLMs), as the Chinese tech giant moves to expand its influence in the field of generative artificial intelligence (AI).
The two open-source models, Qwen-7B and Qwen-7B-Chat, have each been trained on 7 billion parameters. Qwen-7B-Chat is a fine-tuned version of Qwen-7B capable of conducting humanlike conversations. They represent simpler forms of Tongyi Qianwen, the LLM that Alibaba’s cloud computing service unit launched in April.

These models’ internal mechanisms – including their codes and documentation – will be made freely accessible to scholars, researchers and commercial institutions worldwide through Alibaba Cloud’s AI model repository ModelScope and the US collaborative AI platform Hugging Face, the Hangzhou-based firm said.

Companies with fewer than 100 million monthly active users will be allowed to deploy the open-source models for commercial use free of charge, while those with more users will need to request a licence from Alibaba Cloud.

Alibaba owns the South China Morning Post.

02:54

Socially awkward? This AI solution using ChatGPT is aimed at helping you figure out what to say

Socially awkward? This AI solution using ChatGPT is aimed at helping you figure out what to say

LLMs are deep-learning AI algorithms that can recognise, summarise, translate and generate content using very large data sets. The technology is used to train highly intelligent chatbots like OpenAI’s ChatGPT, which has generated immense global interest since its launch in November.

Advertisement