Mercury Edit 2

Mercury Edit 2: The Fastest Diffusion-Based LLM for Next-Edit Prediction and Real-Time Coding

Introduction:

Mercury Edit 2 is a purpose-built diffusion Large Language Model (dLLM) designed for the most latency-sensitive coding workflows. By generating tokens in parallel, Mercury Edit 2 provides lightning-fast next-edit predictions that feel instantaneous. This advanced model upgrades previous iterations to deliver higher quality, more targeted code suggestions with a 48% higher acceptance rate. Optimized for the Inception Platform and integrated with editors like Zed, Mercury Edit 2 offers superior speed and quality across multiple programming languages and development scenarios.

Added On:

2026-04-06

Monthly Visitors:

--K

Code & IT

Mercury Edit 2 - AI Tool Screenshot and Interface Preview

Mercury Edit 2 Product Information

Introducing Mercury Edit 2: The Fastest Reasoning LLM for Next-Edit Prediction

In the rapidly evolving world of software development, latency is the enemy of flow. Today, we are proud to introduce Mercury Edit 2, a purpose-built diffusion LLM (dLLM) specifically engineered for the most latency-sensitive component of modern development workflows: next-edit prediction.

Mercury Edit 2 represents a significant upgrade to our previous next-edit models. It is designed to complement our existing auto-complete endpoints by providing a predictive layer that anticipates your next move within the codebase. By utilizing cutting-edge diffusion technology, Mercury Edit 2 generates tokens in parallel, ensuring that predictions arrive fast enough to feel like a natural extension of your own thought process.

What is Mercury Edit 2?

Mercury Edit 2 is a specialized Large Language Model that focuses on predicting the very next change a developer will make. Unlike traditional models that suggest code one token at a time, this diffusion LLM allows for high-speed parallel generation.

Using your recent edits and comprehensive codebase context, Mercury Edit 2 analyzes the state of your project to suggest the most logical next step. Whether you are refactoring, renaming variables, or implementing a new feature, you can simply press Tab to accept the suggestion and keep your momentum.

Key Features of Mercury Edit 2

Diffusion-Based Token Generation

The standout feature of Mercury Edit 2 is its use of diffusion to generate tokens in parallel. This architectural choice drastically reduces latency, making it the fastest reasoning LLM for edit predictions on the market.

High-Quality Training and Alignment

To ensure the utility of Mercury Edit 2, we utilized a sophisticated training pipeline:

Curated Datasets: The model was trained on high-quality edits across a broad range of programming languages.
Human Preference Alignment (KTO): We leveraged an unpaired reinforcement learning method called KTO to align the model with actual human preferences.
Reduced Distraction: This alignment makes Mercury Edit 2 27% more selective, ensuring suggestions are targeted and less likely to distract the developer.

Superior Quality and Performance

According to our internal and open-source benchmarks (including Instinct, FIM, and NEP), Mercury Edit 2 delivers:

48% higher acceptance rate compared to previous models.
Superior speed when compared to custom next-edit models and speed-optimized frontier models.
High accuracy across tasks like variable renaming, refactoring, and line completion.

Use Cases for Mercury Edit 2

Mercury Edit 2 is versatile and adapts to various coding scenarios including:

Next-Edit Prediction: Automatically anticipating the next block of code or modification based on your recent history.
Refactoring: Streamlining the process of restructuring existing code with high-confidence suggestions.
Variable Renaming: Quickly updating identifiers across a contextually aware scope.
Feature Implementation: Accelerating the creation of new functionality by predicting the logical flow of new code.
Line Completion: Providing instant completion for repetitive or predictable code patterns.

How to Use Mercury Edit 2

Getting started with Mercury Edit 2 is straightforward, whether you are an individual developer or an enterprise team.

Access via Inception Platform: Mercury Edit 2 is available now on the Inception API Platform.
Integrate with Zed: Users of the Zed code editor can configure the model to be their primary edit prediction provider.
Use the API: Developers can integrate Mercury Edit 2 into their own tools using our API. The pricing is highly competitive:
- Input Tokens: $0.25 / 1M tokens
- Output Tokens: $0.75 / 1M tokens
- Cached Input: $0.025 / 1M tokens
Claim Free Tokens: Every new account on the Inception API Platform is automatically granted 10 million FREE tokens.

Pro Tip for Zed Users: Unlock one free month of Mercury Edit 2 suggestions by using the API key: sk_ae471146ea60fc117c131b574b00ba96.

FAQ

Q: What makes Mercury Edit 2 different from standard auto-complete? A: While auto-complete suggests the next few characters or words, Mercury Edit 2 is a next-edit prediction model. It uses the context of your recent edits and the entire codebase to predict the next meaningful change you will make, often spanning multiple lines or specific logic shifts.

Q: Why does Mercury Edit 2 use diffusion? A: Diffusion allows the model to generate tokens in parallel rather than sequentially. This is what makes Mercury Edit 2 exceptionally fast, providing the low latency required for a seamless coding experience.

Q: How was the model's accuracy measured? A: We used a suite of four benchmarks: Instinct, Fill-in-the-middle (FIM), Next-edit Prediction (NEP), and an internal benchmark. These tests use LLM-as-a-judge and functional test cases to ensure the suggestions match human-written gold standards.

Q: Is there a free trial available? A: Yes! New accounts on the Inception API Platform receive 10 million free tokens, and Zed users can access a free month using the specific promotional API key provided above.

Q: How can I provide feedback or get support? A: You can reach out to the team at [email protected] or join our Discord community for early access and support.

Alternatives Tools

Theneo

Theneo: The All-in-One AI-Powered Developer Portal for API References and Private Customer Portals

Theneo is a comprehensive developer portal designed to keep API documentation, guides, and changelogs in sync automatically. Trusted by 15,000+ teams, it features real-time co-editing, AI-powered doc generation, and secure private portals for B2B collaboration.

Code & IT

Latitude for Claude Code

Latitude: The Complete LLM Control Plane and Claude Code Telemetry for Reliable AI Scaling

Latitude is the definitive LLM control plane designed to scale AI products by providing a clear path to reliable AI. Through comprehensive observability tools and Claude Code telemetry, Latitude enables developers to monitor agent behaviors, detect failures, and track costs with precision. With features like full session traces, tool schema capture, and global installation via a single npx command, Latitude ensures that every AI agent interaction in the terminal, IDE, or Claude Desktop is fully transparent and optimized. Sign up for a free account to access issue monitoring and cost-per-session analytics for your AI projects.

Code & IT

Open Vibe

Open Vibe: A Free Open-Source AI Tutor for Building Production-Ready SaaS Apps

Open Vibe is a 100% free, MIT-licensed open-source tool that transforms AI agents like Claude Code into a professional SaaS-building assistant. It helps users master web development systems while shipping real applications using the Open SaaS template.

Code & IT

display.dev

display.dev: A Gated Publishing Engine for Securely Sharing AI-Generated HTML Artifacts

display.dev is a specialized gated publishing engine designed to help teams share agent-generated artifacts, such as HTML reports, interactive dashboards, and documentation, behind secure company authentication. By integrating seamlessly with tools like Claude Code, Cursor, and Codex, display.dev enables users to publish artifacts via CLI, MCP, or web upload, generating permanent URLs protected by Google or Microsoft SSO. Unlike traditional hosting platforms that charge per viewer, display.dev offers flat-rate pricing with unlimited viewers, ensuring that interactive content remains secure and accessible to stakeholders across the organization without the high costs of enterprise-tier alternatives.

Code & IT

Graphbit PRFlow

PRFlow: AI-Powered PR Review Agent for Automated Security Analysis and Cross-File Dependency Tracking

PRFlow is a high-performance AI reviewer that indexes your entire codebase to find critical bugs and security vulnerabilities. It delivers structured reviews in under three minutes, tracing data flow across files to catch XSS, SSRF, and auth bypasses. Featuring persistent learning and a pay-per-review model, PRFlow ensures high-quality code without seat-based pricing.

Code & IT

Atomic Mail

Atomic Mail: Secure Encrypted Email with AI Writing Tools and End-to-End Privacy Protection

Atomic Mail is a privacy-first encrypted email service featuring end-to-end encryption, zero-access storage, and advanced AI productivity tools. It offers secure communication through email aliases, password-protected messaging, and seed phrase account recovery, ensuring complete data ownership across Windows, macOS, iOS, Android, and Web platforms.

Code & IT

Superset 2.0

Superset: The Advanced Code Editor for Orchestrating 100+ AI Coding Agents in Parallel

Superset is the premier code editor for the AI era, designed to orchestrate 100+ coding agents simultaneously. Featuring universal agent compatibility, isolated Git worktrees, and seamless IDE integration, Superset allows developers to run parallel tasks like bug fixes and feature development without merge conflicts. Supporting Claude Code, Cursor, and more, it is the ultimate environment for AI-driven software engineering.

Code & IT

Waydev Agent

Waydev: The Market-Leading AI Software Engineering Intelligence Platform for Optimizing Developer Productivity and Engineering Performance

Waydev is an AI-powered software engineering intelligence platform designed to help engineering leaders optimize performance, accelerate delivery, and align technical work with business goals through real-time data and automated reports.

Code & IT

Loading related products...