Back to List
Managing AI Coding with Agent Evaluation: Meituan's 310,000-Line Code Refactoring Practice
Industry NewsAI CodingRefactoringSoftware Engineering

Managing AI Coding with Agent Evaluation: Meituan's 310,000-Line Code Refactoring Practice

Meituan's technical team has detailed a transformative approach to software maintenance by refactoring 310,000 lines of code using AI. As AI now generates over 90% of code in certain environments, the focus has shifted from coding speed to the implementation of strict constraints. The team introduced an 'Agent evaluation' mindset to manage AI-driven development, utilizing technical debt analysis, rule construction, Standard Operating Procedures (SOPs), and a Pre-PR mechanism. This framework successfully transitioned large-scale refactoring from a high-cost, specialized project into a continuous, daily iterative process. By establishing these systematic boundaries, the team ensures that AI enhances system quality rather than amplifying chaos, providing a scalable model for long-term AI-native code management.

美团技术团队

Key Takeaways

  • Scale of Success: Successfully managed the refactoring of 310,000 lines of code using AI-driven methodologies.
  • Governance Over Speed: When AI generates more than 90% of code, the primary challenge shifts from output velocity to the enforcement of architectural constraints.
  • Agent Evaluation Mindset: Applied evaluation logic typically used for AI Agents to manage and audit the quality of AI-generated code.
  • Systematic Framework: Utilized a combination of technical debt assessment, rule-based governance, and Standard Operating Procedures (SOPs).
  • Continuous Refactoring: Implemented a Pre-PR mechanism that integrates code refactoring into daily development cycles, reducing the need for high-cost specialized projects.

In-Depth Analysis

The Shift from Generation to Governance

In the current landscape of software engineering, the bottleneck is no longer how fast code can be written, but how effectively it can be governed. Meituan’s technical team points out that when AI is responsible for over 90% of code generation, the absence of uniform standards can lead to an exponential increase in system chaos. AI, while highly productive, does not inherently understand the long-term architectural goals of a complex system. Therefore, the role of the technical team has evolved from manual coding to defining the constraints and rules that guide AI behavior. The goal is to ensure that AI-generated code adheres to the same quality and consistency standards as human-written code, preventing the accumulation of unmanageable technical debt.

The Agent Evaluation Framework for AI Coding

To manage a massive 310,000-line refactoring effort, Meituan adopted an "Agent evaluation" approach. This methodology treats the AI as an autonomous agent whose outputs must be continuously validated against a set of predefined criteria. The process is structured around several key technical pillars:

  1. Technical Debt Assessment: Before refactoring begins, the system identifies existing debt to prioritize areas for AI intervention.
  2. Rule Construction: Establishing a robust set of coding rules that the AI must follow to maintain system integrity.
  3. Refactoring SOPs: Standard Operating Procedures provide a repeatable, reliable workflow for AI-driven changes, ensuring that the refactoring process is consistent across different modules.
  4. Pre-PR Mechanism: By introducing a verification stage before the Pull Request (PR), the team can catch and correct AI errors early, ensuring that only high-quality, compliant code enters the main branch.

Integrating Refactoring into the Daily Workflow

One of the most significant breakthroughs of this practice is the transformation of refactoring from a "special project" into a "daily habit." Historically, refactoring hundreds of thousands of lines of code would require a dedicated task force and significant downtime. By leveraging AI and the Pre-PR mechanism, Meituan has made it possible to perform continuous refactoring during regular feature iterations. This approach ensures that the codebase remains healthy and modern without the need for periodic, high-risk overhauls. It effectively democratizes code quality, making it a byproduct of the standard development lifecycle rather than an afterthought.

Industry Impact

Meituan's practice sets a significant precedent for the AI-native era of software engineering. It demonstrates that the key to scaling AI in development is not more powerful models, but better management frameworks. By sharing their success in refactoring 310,000 lines of code, they provide a blueprint for other large-scale tech organizations to handle the transition to AI-heavy codebases. This shift toward "AI-managed AI"—where automated systems and evaluation logic oversee the generation of code—marks a critical evolution in how software is maintained and scaled in the age of Large Language Models (LLMs).

Frequently Asked Questions

Question: Why is the 90% AI-generated code threshold significant?

At this level, the volume of code produced by AI exceeds the capacity for manual human review in traditional ways. Without strict constraints and automated governance like the Agent evaluation mindset, the AI can amplify existing system inconsistencies and create massive technical debt very quickly.

Question: What role does the Pre-PR mechanism play in AI coding?

The Pre-PR mechanism acts as a critical quality gate. It allows the system to evaluate AI-generated refactoring against established rules and SOPs before the code is even submitted for human review, ensuring that refactoring becomes a seamless part of the daily development iteration.

Question: How does Meituan's approach reduce the cost of refactoring?

By using AI to handle the bulk of the work and using SOPs to standardize the process, the team moves away from high-cost, manual refactoring projects. This allows for continuous improvement of the codebase, which is much more cost-effective than performing large-scale, disruptive refactoring every few years.

Related News

Project N.O.M.A.D: A Self-Sufficient Offline Survival Computer Integrating AI and Critical Knowledge Tools
Industry News

Project N.O.M.A.D: A Self-Sufficient Offline Survival Computer Integrating AI and Critical Knowledge Tools

Project N.O.M.A.D, developed by Crosstalk Solutions, is a specialized offline survival computer designed for total self-sufficiency. By integrating critical tools, a comprehensive knowledge base, and built-in artificial intelligence, the project aims to provide users with essential information and empowerment in environments where internet connectivity is unavailable or compromised. This initiative addresses the growing demand for resilient, decentralized technology that can function independently of the global cloud infrastructure. As an offline-first platform, Project N.O.M.A.D ensures that vital data and analytical capabilities remain accessible anytime and anywhere, marking a significant development in the intersection of survival technology and edge computing.

iOS 27 Developer Beta 1 First Look: Siri AI Waitlist and Early Testing on iPhone 16 Pro
Industry News

iOS 27 Developer Beta 1 First Look: Siri AI Waitlist and Early Testing on iPhone 16 Pro

Following the WWDC 2026 keynote, Apple has released the first developer beta of iOS 27. Early hands-on testing by industry experts, including Jay Peters from The Verge, highlights a significant shift toward integrated AI. While the update is now available for the iPhone 16 Pro, the most anticipated feature—the revamped Siri AI—is currently restricted by a waitlist. This phased rollout suggests a controlled deployment of Apple's latest intelligence features. Beyond the AI components, testers are beginning to explore a variety of new system features that define the next generation of the iPhone experience. This analysis covers the initial hours of the beta release, the hardware requirements, and the strategic implications of Apple's waitlist approach for its new AI ecosystem.

Sam Altman's Tools for Humanity Faces Layoffs Amid Revenue Struggles as OpenAI Files for IPO
Industry News

Sam Altman's Tools for Humanity Faces Layoffs Amid Revenue Struggles as OpenAI Files for IPO

Tools for Humanity, the identity verification company co-founded by Sam Altman, is reportedly undergoing a workforce reduction due to significant challenges in generating revenue. This development surfaces at a critical juncture as OpenAI, another major entity led by Altman, has officially filed for its Initial Public Offering (IPO). The contrast between these two ventures highlights a divergent path within Altman's portfolio: while OpenAI moves toward the public markets following a period of massive growth, Tools for Humanity is forced to downsize its operations to address financial sustainability. The report, originating from TechCrunch, underscores the difficulties faced by the eye-scanning technology firm in establishing a viable business model despite the high profile of its leadership and the innovative nature of its identity verification mission.