Chinese AI start-up Moonshot cuts LLM feature price amid fierce domestic competition
- The feature could ‘significantly improve efficiency and lower costs’, Moonshot said when it launched public testing in July

Chinese artificial intelligence (AI) firm Moonshot AI has halved the price of a new feature on its Kimi large language model (LLM), as start-ups compete with the country’s technology giants to monetise their generative AI products.
The price of the context caching feature on Moonshot’s Kimi has been cut to 5 yuan (US$0.70) per 1 million-tokens per minute from 10 yuan, Moonshot said in a blog post on Wednesday.
Context caching lets LLM developers store for a period of time certain information that might be frequently requested so their model responds faster to similar queries.
The feature could “significantly improve efficiency and lower costs”, Moonshot said when it launched public testing of the feature in July.

Moonshot is the latest generative AI firm in China to cut service prices amid fierce competition in the country, where Big Tech and start-ups are racing to commercialise their LLMs, the technology that underpins generative AI services such as ChatGPT.