Back to List
Meituan Launches LongCat-2.0: A 1.6 Trillion Parameter Model Trained on Domestic Computing Clusters
Industry NewsMeituanLarge Language ModelsAI Infrastructure

Meituan Launches LongCat-2.0: A 1.6 Trillion Parameter Model Trained on Domestic Computing Clusters

Meituan's technology team has officially announced the release of LongCat-2.0, a massive 1.6 trillion parameter model. This release marks a significant milestone as the industry's first model of this scale to complete its entire training and inference lifecycle on a domestic computing cluster consisting of 50,000 cards. LongCat-2.0 was pre-trained from scratch and features a dynamic activation architecture, with an average of 48B parameters active during operation. Designed with a native 1 million (1M) token ultra-long context window, the model is specifically optimized for Agentic Coding tasks. Its core objective is to provide superior stability and efficiency in code understanding, generation, and execution, addressing the complex needs of modern software development environments.

美团技术团队

Key Takeaways

  • Massive Scale: LongCat-2.0 features 1.6 trillion total parameters with a dynamic activation range of 33B to 56B.
  • Infrastructure Milestone: It is the first trillion-parameter model to complete full-process training and inference on a 50,000-card domestic computing cluster.
  • Ultra-Long Context: The model natively supports a 1 million (1M) token context window for processing extensive datasets.
  • Specialized Focus: Engineered specifically for Agentic Coding to improve code understanding, generation, and execution stability.

In-Depth Analysis

Breakthrough in Domestic Computing Infrastructure

The release of LongCat-2.0 by Meituan represents a pivotal moment for the AI industry, particularly regarding hardware independence. It is the first time a model with 1.6 trillion parameters has successfully navigated the entire pipeline—from initial pre-training to final inference—using a domestic computing cluster of 50,000 cards. This achievement demonstrates that large-scale AI development is no longer strictly dependent on international hardware ecosystems, proving that domestic infrastructure can support the rigorous demands of training and deploying world-class, trillion-parameter models.

Architectural Innovation and Dynamic Efficiency

LongCat-2.0 is not just notable for its size but also for its architectural efficiency. While the total parameter count reaches 1.6T, the model utilizes a dynamic activation mechanism. On average, only 48B parameters are activated during tasks, with a dynamic range fluctuating between 33B and 56B. This approach allows the model to maintain the high-level reasoning capabilities associated with trillion-parameter architectures while optimizing for computational throughput. Furthermore, the native support for a 1 million (1M) token context window allows the model to ingest and analyze massive codebases or long-form documents without losing coherence, a critical requirement for advanced technical tasks.

Optimized for Agentic Coding Tasks

Unlike many general-purpose large language models, LongCat-2.0 was designed from the ground up with a specific application in mind: Agentic Coding. The Meituan technology team focused the model's architecture on the core goal of making code-related workflows more efficient and stable. In real-world Agentic Coding scenarios, models must not only generate snippets but also understand complex project structures and execute code accurately. LongCat-2.0 is tailored to handle these multi-step processes, ensuring that code understanding and execution are handled with a high degree of reliability, which is essential for autonomous coding agents.

Industry Impact

The introduction of LongCat-2.0 signals a shift toward highly specialized, large-scale models that are deeply integrated with specific hardware environments. By successfully utilizing a 50,000-card domestic cluster, Meituan has set a new benchmark for infrastructure utilization in AI. This development is likely to encourage further investment in localized computing clusters and specialized model architectures. Additionally, the focus on 1M context windows and Agentic Coding suggests that the next frontier for AI lies in the ability to manage increasingly complex, long-context technical environments, moving beyond simple chat interfaces toward fully functional digital agents.

Frequently Asked Questions

Question: What makes LongCat-2.0 different from other trillion-parameter models?

LongCat-2.0 is unique because it is the first model of its size (1.6T parameters) to be trained and deployed entirely on a domestic 50,000-card computing cluster. It also features a dynamic activation range (33B-56B) and is specifically optimized for Agentic Coding rather than general-purpose tasks.

Question: How does the 1M context window benefit developers?

A 1 million token context window allows the model to process and "remember" an entire codebase or multiple large technical documents simultaneously. This is crucial for Agentic Coding, where the model needs to understand the relationships between different files and functions across a large project to generate or debug code effectively.

Question: What is meant by "Agentic Coding" in the context of LongCat-2.0?

Agentic Coding refers to the model's ability to act as an autonomous or semi-autonomous agent that can understand, generate, and execute code. LongCat-2.0 is designed to perform these tasks with higher stability and efficiency in real-world programming environments compared to standard models.

Related News

Meituan Technical Team Presents Selected Academic Research at ICML 2026
Industry News

Meituan Technical Team Presents Selected Academic Research at ICML 2026

The Meituan Technical Team has announced its participation in the International Conference on Machine Learning (ICML) 2026, showcasing a selection of academic papers. As one of the most influential international academic conferences in the field, ICML serves as a premier platform for discussing the critical challenges and core issues facing the future of machine learning. Meituan's involvement highlights its commitment to contributing to frontier research that possesses both significant theoretical value and practical impact. By engaging with this global community, the Meituan Technical Team aims to help drive the development of the field and influence future research directions through the evaluation and dissemination of high-impact research results.

Meituan Technical Team Showcases Cutting-Edge AI Research in Search and Recommendation at Top Global Conferences
Industry News

Meituan Technical Team Showcases Cutting-Edge AI Research in Search and Recommendation at Top Global Conferences

Meituan's Business R&D Platform/Search & Recommendation ASX (Agentic System X) team has recently shared insights from their latest research published at premier AI conferences. Focusing on the development of an Agent technology system powered by Large Language Models (LLMs), the team has made significant strides in LLM post-training, Agentic Reinforcement Learning, and multi-modal understanding. With dozens of papers accepted by prestigious venues such as ICLR, NeurIPS, CVPR, and AAAI, Meituan is positioning itself at the forefront of AI innovation. This special feature highlights six selected papers that demonstrate the team's commitment to advancing search and recommendation technologies through sophisticated agentic systems and multi-modal integration, providing valuable insights for the broader AI research community.

Meituan LongCat Launches General 365: New Reasoning Benchmark Reveals AI Performance Gaps
Industry News

Meituan LongCat Launches General 365: New Reasoning Benchmark Reveals AI Performance Gaps

Meituan's LongCat team has officially released General 365, a new evaluation benchmark specifically designed to measure the reasoning capabilities of large language models. In a comprehensive assessment of 26 mainstream AI models, the benchmark revealed a significant struggle across the industry to handle complex reasoning tasks. According to the results, Gemini 3 Pro emerged as the top performer but only managed an accuracy rate of 62.8%. Most notably, the vast majority of the models tested failed to reach the 60% accuracy threshold, which is considered the passing mark. This release by Meituan's technical team establishes a more rigorous standard for AI evaluation, highlighting that even the most advanced models currently available face substantial challenges in logical reasoning.