Back to List
System Prompt Leaks: Comprehensive Repository Reveals Internal Instructions for GPT-5.4, Claude 4.6, and Gemini 3.1
Industry NewsAI SecurityLarge Language ModelsGitHub Trending

System Prompt Leaks: Comprehensive Repository Reveals Internal Instructions for GPT-5.4, Claude 4.6, and Gemini 3.1

A significant repository hosted on GitHub by user asgeirtj has surfaced, documenting the leaked system prompts for the industry's most advanced AI models. The collection includes internal instructions for OpenAI's GPT-5.4 and GPT-5.3, Anthropic's Claude Opus 4.6 and Sonnet 4.6, and Google's Gemini 3.1 Pro and 3 Flash. Additionally, the leak covers system prompts for Grok 4.2 and Perplexity. These system prompts serve as the foundational behavioral guidelines for Large Language Models (LLMs), dictating how they interact with users and maintain safety protocols. The repository is reportedly updated on a regular basis, providing a rare look into the backend configurations of next-generation AI systems.

GitHub Trending

Key Takeaways

  • Extensive Model Coverage: The leak includes system prompts for high-profile models including GPT-5.4, Claude 4.6, Gemini 3.1, and Grok 4.2.
  • Centralized Repository: The data is hosted and regularly updated on GitHub under the project 'system_prompts_leaks'.
  • Diverse AI Ecosystem: The collection spans multiple developers, including OpenAI, Anthropic, Google, xAI, and Perplexity.
  • Technical Insight: These prompts reveal the underlying instructions and constraints placed on AI agents and coding tools like Claude Code and Gemini CLI.

In-Depth Analysis

Unveiling the Architecture of AI Behavior

The 'system_prompts_leaks' repository provides a detailed look at the internal directives that govern the behavior of leading AI models. By extracting prompts from versions such as GPT-5.4 and Claude Opus 4.6, the repository highlights the specific personas and operational boundaries set by AI developers. These system prompts are critical because they define the model's identity, its tone of voice, and the safety guardrails it must follow before a user even enters a query.

Comparative Directives Across Platforms

The inclusion of prompts from Gemini 3.1 Pro, Grok 4.2, and Perplexity allows for a comparative study of how different organizations approach AI alignment. For instance, the repository contains specific prompts for specialized tools like 'Claude Code' and 'Gemini CLI,' suggesting that system instructions are becoming increasingly modular and task-specific. The ongoing updates to this repository indicate a persistent effort to track how these instructions evolve as models are patched or upgraded.

Industry Impact

The disclosure of system prompts for flagship models like GPT-5.4 and Claude 4.6 has significant implications for the AI industry. For researchers, it provides transparency into the 'black box' of AI alignment and safety engineering. However, for developers, such leaks represent a potential security challenge, as understanding the system prompt is often the first step in developing 'jailbreak' techniques to bypass model restrictions. This repository underscores the ongoing tension between open-source transparency and the proprietary safety measures of major AI labs.

Frequently Asked Questions

Question: Which specific AI models are included in the leak?

The repository contains system prompts for OpenAI (GPT-5.4, GPT-5.3, Codex), Anthropic (Claude Opus 4.6, Sonnet 4.6, Claude Code), Google (Gemini 3.1 Pro, 3 Flash, CLI), xAI (Grok 4.2, 4), and Perplexity.

Question: What is the purpose of a system prompt?

A system prompt is a set of foundational instructions that tells an AI model how to behave, what rules to follow, and what its specific role or persona should be during a conversation.

Question: Where can this information be found?

The information is maintained in a GitHub repository titled 'system_prompts_leaks' by the author asgeirtj.

Related News

Meituan LongCat Unveils General 365: A Rigorous New Standard for AI Reasoning Evaluation
Industry News

Meituan LongCat Unveils General 365: A Rigorous New Standard for AI Reasoning Evaluation

Meituan's LongCat team has officially released General 365, a new benchmark designed to evaluate the reasoning capabilities of artificial intelligence models. The initial testing phase involved 26 mainstream models, revealing a significant performance gap in the industry. According to the results, the top-performing model, Gemini 3 Pro, achieved an accuracy rate of only 62.8%. More strikingly, the vast majority of the models tested failed to reach the 60% accuracy threshold, which is considered a basic passing mark. This release by Meituan aims to provide a more challenging and accurate metric for assessing how well modern AI can handle complex reasoning tasks, highlighting that even the most advanced systems currently struggle with the demands of the General 365 evaluation.

Managing AI Coding with Agent Evaluation Logic: Insights from a 310,000-Line Code Refactoring Practice
Industry News

Managing AI Coding with Agent Evaluation Logic: Insights from a 310,000-Line Code Refactoring Practice

As AI-generated code begins to comprise over 90% of modern systems, the technical challenge shifts from speed to governance. Meituan's technical team has shared a comprehensive framework for managing AI coding based on their experience refactoring 310,000 lines of code. The core of their approach involves using an 'Agent evaluation' mindset to prevent AI from amplifying system chaos. By implementing technical debt sorting, rule construction, standardized operating procedures (SOPs), and a Pre-PR mechanism, the team successfully transitioned large-scale refactoring from a high-cost, specialized project into a sustainable, daily iterative process. This shift emphasizes that the ultimate trajectory of a system is determined by the constraints placed on AI rather than the speed of code generation.

LongCat Powers OpenClaw with Efficiency Engine: Boosting Automation Performance by 30% via Official API
Industry News

LongCat Powers OpenClaw with Efficiency Engine: Boosting Automation Performance by 30% via Official API

The LongCat team has officially introduced a stable and compliant free API for OpenClaw, aimed at significantly enhancing the efficiency of automated tasks. By providing a direct official channel, LongCat addresses the inherent risks associated with third-party subscriptions, such as account security vulnerabilities and service instability. This new efficiency engine allows developers to optimize their automation workflows, potentially increasing speed by 30%. The initiative by the Meituan Technical Team emphasizes the importance of using official, secure pathways to maintain the integrity of developer tools and ensure consistent service performance in complex automation environments.