Back to List
NVIDIA CEO Jensen Huang Highlights Parabolic Demand and Cost Efficiency of Vera Rubin NVL72 at Dell Technologies World
Industry NewsNVIDIADell TechnologiesEnterprise AI

NVIDIA CEO Jensen Huang Highlights Parabolic Demand and Cost Efficiency of Vera Rubin NVL72 at Dell Technologies World

At Dell Technologies World, NVIDIA CEO Jensen Huang described the current surge in AI interest as "utterly parabolic," signaling a massive shift in enterprise adoption. Central to this momentum is the NVIDIA Vera Rubin NVL72, a breakthrough architecture designed to optimize agentic AI inference. The platform reportedly reduces the cost per token to one-tenth of previous levels, while the Vera CPU accelerates enterprise data queries by up to 3x. With over 5,000 enterprises—including global leaders like Lilly, Samsung, and Honeywell—already utilizing Dell AI Factories, the collaboration between NVIDIA and Dell is redefining the infrastructure for large-scale AI workloads. This transition toward agentic AI, supported by faster sandboxes and more efficient processing, marks a significant milestone in the industrialization of artificial intelligence.

NVIDIA Newsroom

Key Takeaways

  • Parabolic Demand: NVIDIA CEO Jensen Huang reports that demand for AI infrastructure is experiencing exponential, "parabolic" growth across the enterprise sector.
  • Drastic Cost Reduction: The NVIDIA Vera Rubin NVL72 architecture enables agentic AI inference at just one-tenth the cost per token compared to traditional methods.
  • Performance Breakthroughs: Agent sandboxes operate 50% faster on NVIDIA Vera hardware, while the Vera CPU delivers a 3x speed increase for enterprise data queries.
  • Massive Enterprise Adoption: Over 5,000 organizations, including Honeywell, Samsung, and Lilly, are now running AI workloads through Dell AI Factories.

In-Depth Analysis

The Economics of Agentic AI Inference

The announcement by Jensen Huang at Dell Technologies World underscores a pivotal shift in the economic landscape of artificial intelligence. By introducing the NVIDIA Vera Rubin NVL72, NVIDIA is addressing the primary barrier to widespread AI deployment: the cost of inference. The claim that agentic AI inference can now be performed at one-tenth the cost per token is a transformative metric for the industry.

As enterprises move beyond simple chatbots toward "agentic AI"—systems capable of autonomous reasoning and multi-step task execution—the volume of tokens processed increases significantly. A 90% reduction in cost per token allows businesses to scale these complex agents without a linear increase in operational expenses. This efficiency is not merely a marginal improvement but a fundamental restructuring of the AI cost-benefit analysis, enabling use cases that were previously financially unviable.

Hardware Acceleration for Enterprise Data and Sandboxing

Beyond cost, the technical performance of the Vera architecture provides a dual advantage in speed and agility. The integration of the Vera CPU has shown a 3x improvement in the speed of enterprise data queries. In a corporate environment where data is often siloed and massive in scale, the ability to query information three times faster directly impacts the responsiveness of AI-driven insights.

Furthermore, the 50% performance boost in agent sandboxes is critical for the development cycle of AI agents. Sandboxes are the controlled environments where AI agents are tested and refined before deployment. By running these environments 50% faster than traditional CPUs, NVIDIA and Dell are effectively shortening the R&D lifecycle for enterprise AI. This speed allows for more rapid iteration, safer testing of autonomous behaviors, and faster time-to-market for AI-driven services.

The Scale of the Dell and NVIDIA Partnership

The collaboration between NVIDIA and Dell Technologies has reached a massive scale, evidenced by the 5,000 enterprises currently leveraging Dell AI Factories. The mention of specific industry titans such as Lilly, Samsung, and Honeywell indicates that the demand is not limited to the tech sector but spans pharmaceuticals, electronics, and industrial manufacturing.

These "Dell AI Factories" represent a standardized approach to AI infrastructure, combining NVIDIA’s specialized hardware like the Vera Rubin NVL72 with Dell’s enterprise deployment capabilities. The fact that demand is described as "utterly parabolic" suggests that the infrastructure build-out is struggling to keep pace with the appetite for generative and agentic AI. This partnership serves as the backbone for this growth, providing the necessary compute power to handle the increasingly complex workloads required by global corporations.

Industry Impact

The transition to the Vera Rubin architecture and the resulting performance gains signal a new era for the AI industry. By focusing on the efficiency of the Vera CPU and the NVL72 system, NVIDIA is moving the conversation from raw training power to the practicalities of enterprise-scale inference. The 3x increase in query speed and 50% faster sandboxing suggest that the next phase of AI competition will be won by those who can integrate AI most deeply into existing data workflows.

For the broader industry, the "parabolic" demand mentioned by Huang implies a sustained investment cycle in hardware. As 5,000 enterprises lead the way, the success of companies like Samsung and Honeywell in utilizing these AI factories will likely serve as a blueprint for the rest of the global market. The focus on reducing the cost per token specifically for "agentic AI" indicates that the industry is moving toward more autonomous, high-token-usage applications, necessitating the high-efficiency hardware NVIDIA is now delivering.

Frequently Asked Questions

Question: What is the primary cost benefit of the NVIDIA Vera Rubin NVL72?

According to the announcement, the Vera Rubin NVL72 allows for agentic AI inference at one-tenth the cost per token compared to previous standards, representing a 90% reduction in inference expenses.

Question: How does the Vera CPU improve enterprise data operations?

The Vera CPU significantly enhances data processing capabilities, enabling enterprise data queries to run up to 3x faster than traditional systems, which accelerates the retrieval of insights from large datasets.

Question: Which major companies are currently using Dell AI Factories for their workloads?

Over 5,000 enterprises are utilizing the platform, with prominent examples including the pharmaceutical leader Lilly, electronics giant Samsung, and industrial conglomerate Honeywell.

Related News

Managing AI Coding with Agent Evaluation Logic: Lessons from a 310,000-Line Code Refactoring Project
Industry News

Managing AI Coding with Agent Evaluation Logic: Lessons from a 310,000-Line Code Refactoring Project

Meituan's technical team has introduced a novel approach to managing AI-driven development by applying Agent evaluation logic to a massive 310,000-line code refactoring initiative. With AI now capable of generating over 90% of code, the primary challenge has shifted from production speed to the management of system complexity and chaos. By implementing a structured framework—including technical debt sorting, rule construction, a standardized refactoring SOP, and a Pre-PR mechanism—the team has successfully transitioned refactoring from a high-cost, periodic task into a continuous, iterative daily action. This methodology ensures that AI's capabilities are constrained by unified standards, preventing the amplification of technical debt and ensuring long-term system stability in an AI-native development environment.

openpilot: The Robotics Operating System Revolutionizing Driver Assistance for 300+ Vehicle Models
Industry News

openpilot: The Robotics Operating System Revolutionizing Driver Assistance for 300+ Vehicle Models

openpilot, developed by commaai, has positioned itself as a pivotal operating system specifically designed for the robotics sector. Its current primary application is the enhancement and upgrading of driver assistance systems across a vast range of automotive hardware. With compatibility extending to over 300 supported car models, openpilot demonstrates a unique approach to scalable automation. By functioning as a foundational operating system rather than a standalone application, it provides the necessary infrastructure to bridge complex robotic software with diverse vehicle hardware. This development signifies a major step in the democratization of advanced driving technologies, offering a standardized platform for robotic control that can be integrated into a wide variety of existing consumer vehicles, thereby extending their functional capabilities through software-driven innovation.

Asia’s Most Active AI Investors: A Comprehensive Analysis of Regional Capital Inflow
Industry News

Asia’s Most Active AI Investors: A Comprehensive Analysis of Regional Capital Inflow

Tech in Asia has released a significant report identifying the most active investors currently directing capital toward the artificial intelligence sector within Asia. The report highlights a major trend where substantial financial resources are being poured into AI startups across the continent. This compilation serves as a critical guide for understanding which entities are driving the growth of the Asian AI ecosystem. By focusing on the most active participants, the list provides a clear picture of the investment landscape, emphasizing the high level of interest and financial commitment from the investment community toward Asian AI innovation. This influx of capital is a defining characteristic of the current technological and financial environment in the region.