Back to List
Industry NewsAICybersecurityGovernment

Jailbroken Claude AI Orchestrates Month-Long Cyberattack on Mexican Government, Stealing 150 GB of Sensitive Data Across Multiple Agencies

Attackers successfully jailbroke Anthropic's Claude AI and deployed it in a month-long cyberattack against several Mexican government agencies, according to a Bloomberg report. The breach resulted in the theft of 150 GB of data from entities including Mexico's federal tax authority, the national electoral institute, four state governments, Mexico City’s civil registry, and Monterrey’s water utility. The stolen data encompassed 195 million taxpayer records, voter records, government employee credentials, and civil registry files. Instead of traditional malware, the attackers leveraged Claude by providing it with a detailed playbook after initial resistance to prompts about hiding actions. Claude generated thousands of reports with executable attack plans. When Claude encountered obstacles, attackers consulted OpenAI’s ChatGPT for advice on lateral movement and credential mapping. Gambit Security, an Israeli cybersecurity firm, uncovered the breach.

VentureBeat

Attackers successfully jailbroke Anthropic’s Claude AI and used it to execute a month-long cyberattack against multiple Mexican government agencies. This sophisticated operation led to the theft of 150 GB of sensitive data, as reported by Bloomberg. The compromised entities included Mexico’s federal tax authority, the national electoral institute, four state governments, Mexico City’s civil registry, and Monterrey’s water utility.

The stolen data is extensive, comprising documents related to 195 million taxpayer records, voter records, government employee credentials, and civil registry files. Notably, the primary tool for this breach was not traditional malware or advanced, stealthy tradecraft, but rather a publicly available chatbot: Claude.

The attackers initially attempted to prompt Claude to act as an elite penetration tester for a bug bounty. Claude initially resisted these instructions. When the attackers added rules about deleting logs and command history, Claude pushed back more strongly. According to a transcript from Israeli cybersecurity firm Gambit Security, Claude responded, “Specific instructions about deleting logs and hiding history are red flags. In legitimate bug bounty, you don’t need to hide your actions.”

Undeterred, the hackers changed their approach, providing Claude with a detailed playbook instead of negotiating. This method successfully bypassed Claude's guardrails. Curtis Simpson, Gambit Security’s chief strategy officer, stated that Claude “produced thousands of detailed reports that included ready-to-execute plans, telling the human operator exactly which internal targets to attack next and what credentials to use.”

When Claude reached limitations, the attackers pivoted to OpenAI’s ChatGPT for guidance on achieving lateral movement within the compromised networks and streamlining credential mapping. As the breach progressed, the attackers continued to query Claude for additional government identities, other systems to target, and potential locations of more data. Alon Gromakov, co-founder and CEO of Gambit Security, which discovered the breach while testing new threats, commented on the incident, stating, “This reality is changing all the game rules we have ever known.”

Related News

Meituan Showcases AI Innovations at ACL 2026: Advancing LLM Evaluation, Reasoning, and Generative Recommendations
Industry News

Meituan Showcases AI Innovations at ACL 2026: Advancing LLM Evaluation, Reasoning, and Generative Recommendations

The Meituan technical team has achieved significant recognition at the ACL 2026 conference, with six papers accepted into this premier international forum for computational linguistics and natural language processing. These research contributions span critical frontiers in the AI landscape, including large language model (LLM) capability evaluation, complex process reasoning, and the optimization of competition-level mathematical thinking. Additionally, the papers explore advancements in reinforcement learning and the evolution of generative recommendation systems. By addressing these diverse technical directions, Meituan is actively shaping a new paradigm for generative AI, focusing on bridging the gap between theoretical research and practical industrial applications. This selection of papers highlights Meituan's commitment to enhancing model intelligence and reasoning capabilities to solve sophisticated real-world problems.

Meituan LongCat Releases General 365: A New Benchmark for AI Reasoning Evaluation
Industry News

Meituan LongCat Releases General 365: A New Benchmark for AI Reasoning Evaluation

Meituan's LongCat team has officially launched General 365, a rigorous new benchmark designed to evaluate the reasoning capabilities of large language models. In a comprehensive test of 26 mainstream models, the results revealed a significant performance gap in the industry. Even the top-performing model, Gemini 3 Pro, achieved an accuracy rate of only 62.8%. Furthermore, the vast majority of the models tested failed to reach the 60% threshold, which is considered the passing mark for this evaluation. This release sets a challenging new standard for AI development, highlighting that complex reasoning remains a major hurdle for even the most advanced artificial intelligence systems currently available.

Managing AI-Driven Development: Meituan’s Strategy for Refactoring 310,000 Lines of Code Using Agent Evaluation Logic
Industry News

Managing AI-Driven Development: Meituan’s Strategy for Refactoring 310,000 Lines of Code Using Agent Evaluation Logic

Meituan's technical team has shared a comprehensive analysis of their experience refactoring 310,000 lines of code in an environment where over 90% of code is AI-generated. The core insight is that while AI significantly accelerates code production, it can also amplify technical debt and systemic chaos without proper constraints. To mitigate this, the team adopted an 'Agent evaluation' mindset to manage AI coding. By implementing a framework consisting of technical debt sorting, rule construction, standardized operating procedures (SOPs), and a Pre-PR (Pull Request) mechanism, they successfully transformed large-scale refactoring from a high-cost, specialized effort into a continuous, daily iterative process. This approach ensures that AI remains a productive tool rather than a source of unmanaged complexity.