Advertisement

Chinese AI start-up Baichuan claims to beat Anthropic, OpenAI with model that can process 350,000 Chinese characters

  • The Beijing-based company says the latest version of its large language model has a bigger ‘context window’ than its foreign competitors
  • The company also says its model surpasses Anthropic’s Claude 2 in its quality of responses, as well as its understanding of long text

Reading Time:2 minutes
Why you can trust SCMP
1
Baichuan founder Wang Xiaochuan. Photo: Weibo
Ben Jiangin Beijing

Chinese artificial intelligence start-up Baichuan has launched an AI model that it said can digest and summarise novels, making it the world’s most powerful model in handling long text prompts.

The Beijing-based company, established by Chinese search engine Sogou’s founder Wang Xiaochuan, on Monday announced its Baichuan2-192k large language model (LLM), the latest iteration, saying its “context window” can handle around 350,000 Chinese characters.

A context window is the combination of input and output text that a model can process during conversations with users.

For comparison, Claude 2, introduced in July by Amazon.com-backed Anthropic as the world’s most advanced AI model in terms of the number of words that users could include in their chat queries, was said to have a context window of around 75,000 English words, corresponding to hundreds of pages of documents or a book.

The context window of the Baichuan model is 14 times bigger than that of OpenAI’s GPT-4-32k, according to a WeChat post by the Chinese company.

The Baichuan website. Photo: Screenshot
The Baichuan website. Photo: Screenshot

Baichuan also said its model surpassed Claude 2 in its quality of responses, as well as its understanding and summarisation of long text, citing test results by LongEval, a project launched by University of California, Berkeley and other US institutions to evaluate how well LLMs handle large prompts.

Advertisement