返回精选
AI 精选动态 智能评分 80

美团发布开源编程模型LongCat-2.0,实现全栈国产化训练

来源: twitter关注列表
作者: Rohan Paul (@rohanpaul_ai)
发布于: 2026-06-30
收录于: 2026-06-30
AI 推荐理由
与DeepSeek仅用国产芯片推理不同,LongCat-2.0实现全栈国产训练,展示了国产芯片生态的实质性突破,值得持续跟踪。
核心解读
美团发布LongCat-2.0,一个开源1.6T参数MoE(33B-56B参数)编程模型,支持1M token上下文窗口。该模型在50,000块国产芯片上从零训练,使用华为HCCL通信库,证明大规模模型训练可在国产计算集群完成,区别于DeepSeek仅用国产芯片推理。
全文
🇨🇳China claims a new milestone in locally trained AI, as Meituan rolls out LongCat-2.0. Meituan, China's food delivery giant, just released LongCat-2.0, an open-source 1.6T-parameter MoE (33B–56B parameters) coding model. 1M tokens context window. Open-source: Available on longcat[.]ai and OpenRouter, top 3 globally by call volume. LongCat-2.0 was trained from scratch on 50,000 Chinese domestic chips and Meituan said this proves large-scale model training can now be done on domestic compute clusters. Shows again the rising push for self-reliance in China’s AI market, as DeepSeek, Alibaba, ByteDance, and others try to depend less on U.S. chips for model training after Washington’s export controls since 2022. While DeepSeek-V4-pro relied on home-grown chips only for inference, LongCat-2.0 used domestic hardware for both inference and pre-training, according to Meituan. Meituan did not directly identify its hardware supplier, but said in a WeChat post on Tuesday that it used Huawei Collective Communication Library (HCCL) to make training more stable. HCCL is a chip-to-chip communication system like Nvidia Collective Communication Library (NCCL). This removed doubts that Atlas-950 SuperPoDs could not train large LLMs for Zhipu AI and DeepSeek. ![photo](https://pbs.twimg.com/media/HMEwgnxawAEOEuG.jpg) Rohan Paul (@rohanpaul_ai): https://t.co/VBWCKd55Wb
#模型发布#开源#技术突破