Back to List
Industry NewsAISoftware DevelopmentPartnership

Anthropic Launches AI-Powered Code Review for Claude Code Amidst Pentagon Blacklist Lawsuit and Microsoft Partnership

Anthropic has introduced Code Review, a multi-agent AI system for Claude Code designed to identify bugs in pull requests that human reviewers often miss. This feature, currently in research preview for Team and Enterprise customers, was released on a significant day for the company. Simultaneously, Anthropic filed lawsuits against the Trump administration regarding a Pentagon blacklisting and announced a new partnership with Microsoft to embed Claude into its Microsoft 365 Copilot platform. This confluence of a major product launch, a federal legal challenge, and a significant distribution deal highlights the complex environment Anthropic is navigating, as it aims to grow its developer tools business, address a government national security designation, and expand its commercial reach.

VentureBeat

Anthropic on Monday released Code Review, a multi-agent code review system built into Claude Code that dispatches teams of AI agents to scrutinize every pull request for bugs that human reviewers routinely miss. The feature, now available in research preview for Team and Enterprise customers, arrives on what may be the most consequential day in the company's history: Anthropic simultaneously filed lawsuits against the Trump administration over a Pentagon blacklisting, while Microsoft announced a new partnership embedding Claude into its Microsoft 365 Copilot platform. The convergence of a major product launch, a federal legal battle, and a landmark distribution deal with the world's largest software company captures the extraordinary tension defining Anthropic's current moment. The San Francisco-based AI lab is simultaneously trying to grow a developer tools business approaching $2.5 billion in annualized revenue, defend itself against an unprecedented government designation as a national security threat, and expand its commercial footprint through the very cloud platforms now navigating the fallout. Code Review is Anthropic's most aggressive bet yet that engineering organizations will pay significantly more — $15 to $25 per review — for AI-assisted code quality assurance that prioritizes thoroughness over speed. It also signals a broader strategic pivot: the company isn't just building models, it's building opinionated developer workflows around them. How a team of AI agents reviews your pull requests Code Review works differently from the lightweight code review tools most developers are accustomed to. When a developer opens a pull request, the system dispatches multiple AI agents that operate in parallel. These agents independently search for bugs, then cross-verify each other's findings to filter out false positives, and finally rank the remaining issues by severity. The output appears as a single overview comment on the PR along with detailed findings.

Related News

Meituan LongCat Unveils General 365: A Rigorous New Standard for AI Reasoning Evaluation
Industry News

Meituan LongCat Unveils General 365: A Rigorous New Standard for AI Reasoning Evaluation

Meituan's LongCat team has officially released General 365, a new benchmark designed to evaluate the reasoning capabilities of artificial intelligence models. The initial testing phase involved 26 mainstream models, revealing a significant performance gap in the industry. According to the results, the top-performing model, Gemini 3 Pro, achieved an accuracy rate of only 62.8%. More strikingly, the vast majority of the models tested failed to reach the 60% accuracy threshold, which is considered a basic passing mark. This release by Meituan aims to provide a more challenging and accurate metric for assessing how well modern AI can handle complex reasoning tasks, highlighting that even the most advanced systems currently struggle with the demands of the General 365 evaluation.

Managing AI Coding with Agent Evaluation Logic: Insights from a 310,000-Line Code Refactoring Practice
Industry News

Managing AI Coding with Agent Evaluation Logic: Insights from a 310,000-Line Code Refactoring Practice

As AI-generated code begins to comprise over 90% of modern systems, the technical challenge shifts from speed to governance. Meituan's technical team has shared a comprehensive framework for managing AI coding based on their experience refactoring 310,000 lines of code. The core of their approach involves using an 'Agent evaluation' mindset to prevent AI from amplifying system chaos. By implementing technical debt sorting, rule construction, standardized operating procedures (SOPs), and a Pre-PR mechanism, the team successfully transitioned large-scale refactoring from a high-cost, specialized project into a sustainable, daily iterative process. This shift emphasizes that the ultimate trajectory of a system is determined by the constraints placed on AI rather than the speed of code generation.

LongCat Powers OpenClaw with Efficiency Engine: Boosting Automation Performance by 30% via Official API
Industry News

LongCat Powers OpenClaw with Efficiency Engine: Boosting Automation Performance by 30% via Official API

The LongCat team has officially introduced a stable and compliant free API for OpenClaw, aimed at significantly enhancing the efficiency of automated tasks. By providing a direct official channel, LongCat addresses the inherent risks associated with third-party subscriptions, such as account security vulnerabilities and service instability. This new efficiency engine allows developers to optimize their automation workflows, potentially increasing speed by 30%. The initiative by the Meituan Technical Team emphasizes the importance of using official, secure pathways to maintain the integrity of developer tools and ensure consistent service performance in complex automation environments.