AI News on June 24, 2026

Meituan Open Sources LongCat-Next: A Native Multimodal Model for Real-World AI Perception and Interaction
Open Source

Meituan Open Sources LongCat-Next: A Native Multimodal Model for Real-World AI Perception and Interaction

Meituan's technical team has officially released and open-sourced LongCat-Next, a native multimodal model designed to bridge the gap between AI and the physical world. By treating vision and voice as "native languages," this model aims to enhance how AI perceives and interacts with its environment. The release includes the core LongCat-Next model and its discrete tokenizer, providing developers with the tools to build systems capable of understanding and acting within real-world scenarios. This move marks a significant step in Meituan's exploration of physical-world AI applications, offering the global developer community a foundation for creating AI that can truly sense and respond to the complexities of the physical realm.

美团技术团队
Meituan Open Sources AIGC Poster Generation Framework: Analyzing the Generation-Editing-Evaluation Technical Loop
Open Source

Meituan Open Sources AIGC Poster Generation Framework: Analyzing the Generation-Editing-Evaluation Technical Loop

Meituan's Intelligent Creation Team has officially unveiled and open-sourced its comprehensive technical system for AIGC-driven poster generation. The framework is built upon a sophisticated "Generation-Editing-Evaluation" closed loop, designed to bridge the gap between raw AI output and production-ready commercial assets. Currently deployed within Meituan Waimai and various Brand IP scenarios, this system addresses the practical challenges of automated design by integrating creative generation with precise editing tools and automated quality assessment. By open-sourcing the entire technical stack, Meituan aims to provide the developer community with a proven, industrial-grade solution for scalable visual content creation. This move signifies a major step in the practical application of AIGC within the food delivery and digital branding sectors, offering a structured approach to maintaining design quality at scale.

美团技术团队
Meituan LongCat Team Unveils WBench: The First Systematic Multi-Round Benchmark for Interactive Video World Models
Industry News

Meituan LongCat Team Unveils WBench: The First Systematic Multi-Round Benchmark for Interactive Video World Models

The Meituan LongCat team has announced the release and open-sourcing of WBench, a pioneering systematic multi-round evaluation benchmark specifically designed for interactive video world models. Positioned as a diagnostic "CT scanner" for AI, WBench aims to provide precise insights into the technical bottlenecks that occur during the transition from passive video generation to active user interaction. By evaluating models across diverse scenarios—ranging from lunar walks to futuristic cyber cities—WBench addresses the critical need for standardized metrics in the evolving field of world models. This benchmark represents a significant step in identifying where current AI systems struggle to maintain consistency and logic during complex, multi-stage interactive sequences, offering a roadmap for future development in the industry.

美团技术团队
Meituan at ACL 2026: Advancing Generative AI Through Evaluation, Reasoning, and Optimization
Industry News

Meituan at ACL 2026: Advancing Generative AI Through Evaluation, Reasoning, and Optimization

The Meituan Technical Team has announced that six of its research papers have been accepted for ACL 2026, a premier international conference in computational linguistics and natural language processing (NLP). These papers represent a significant contribution to the field, covering a diverse range of cutting-edge topics including large language model (LLM) evaluation, complex process reasoning, and competition-level mathematical thinking optimization. Furthermore, the research explores advancements in reinforcement learning and the emerging field of generative recommendation systems. By focusing on these critical areas, Meituan aims to establish a new paradigm for generative AI, bridging the gap between theoretical research and practical industry applications. This selection underscores Meituan's growing influence in the global AI research community and its commitment to solving complex technical challenges in the NLP domain.

美团技术团队
Meituan LongCat Open Sources General 365: A New Benchmark Revealing AI Reasoning Challenges
Industry News

Meituan LongCat Open Sources General 365: A New Benchmark Revealing AI Reasoning Challenges

Meituan's LongCat team has officially released General 365, an open-source benchmark designed to evaluate the reasoning capabilities of modern AI models. Through a rigorous assessment of 26 mainstream models, the team discovered a significant performance gap in the industry. Gemini 3 Pro emerged as the top performer with an accuracy rate of 62.8%, yet it remains one of the few to surpass the 60% mark. The majority of the models tested failed to reach this basic competency level, highlighting the ongoing challenges in developing advanced reasoning within artificial intelligence. This benchmark serves as a critical new tool for the AI community to measure and improve logical processing, setting a high bar for future model development.

美团技术团队
Meituan Technical Team Launches LARYBench to Standardize Latent Action Representation Learning from Human Video Data
Research Breakthrough

Meituan Technical Team Launches LARYBench to Standardize Latent Action Representation Learning from Human Video Data

The Meituan Technical Team has unveiled LARYBench (Latent Action Representation Yielding Benchmark), a systematic framework for evaluating general latent action representations derived from large-scale visual datasets. The benchmark's initial findings challenge the status quo of embodied AI development, showing that general-purpose vision models significantly surpass specialized action expert models in both generalization and control precision. Crucially, the research demonstrates that embodied action representations can emerge spontaneously from large-scale human video data, providing a new pathway for training robots and autonomous systems using existing non-robotic visual information. This breakthrough suggests that the future of embodied intelligence may lie in leveraging massive, diverse human video datasets rather than relying solely on specialized, task-specific robotic data.

美团技术团队
Meituan Open-Sources LongCat-Video-Avatar 1.5: Advancing Digital Human Video Generation for Commercial Use
Open Source

Meituan Open-Sources LongCat-Video-Avatar 1.5: Advancing Digital Human Video Generation for Commercial Use

Meituan's technical team has officially open-sourced LongCat-Video-Avatar 1.5, marking a significant transition from experimental state-of-the-art (SOTA) research to practical, commercial-grade digital human video generation. This major update introduces comprehensive improvements in lip-sync accuracy, physical plausibility, and long-video stability. Furthermore, the model now supports multi-person interactions and features optimized inference efficiency. Designed to handle complex commercial environments, LongCat-Video-Avatar 1.5 aims to provide stable, natural, and high-quality content, effectively moving digital human technology from controlled laboratory settings to diverse, real-world applications. The release emphasizes a shift toward "thousand people, thousand faces" personalization in the digital human landscape.

美团技术团队
Meituan LongCat Team Unveils LongCat-AudioDiT: Redefining the Limits of Zero-Shot Voice Cloning Technology
Research Breakthrough

Meituan LongCat Team Unveils LongCat-AudioDiT: Redefining the Limits of Zero-Shot Voice Cloning Technology

The Meituan LongCat team has officially announced the release of LongCat-AudioDiT, a groundbreaking Text-to-Speech (TTS) model designed to push the boundaries of zero-shot voice cloning. By fundamentally reimagining the audio synthesis pipeline, the model abandons traditional intermediate representations such as Mel-spectrograms. Instead, LongCat-AudioDiT operates directly within the waveform latent space using a diffusion-based architecture. This strategic shift is engineered to eliminate the cascade errors typically caused by multi-stage data conversions, allowing the AI to learn the inherent laws of sound directly. This development marks a significant milestone in the pursuit of high-fidelity, seamless voice mimicry without the need for extensive fine-tuning, potentially setting a new technical standard for the AI audio industry.

美团技术团队
LongCat-Flash-Prover: Meituan Open-Sources AI Model for Rigorous Mathematical Theorem Proving and Formalization
Open Source

LongCat-Flash-Prover: Meituan Open-Sources AI Model for Rigorous Mathematical Theorem Proving and Formalization

The Meituan technical team has announced the open-source release of LongCat-Flash-Prover, a specialized AI model designed to tackle the complexities of mathematical formalization and theorem proving. Unlike conventional AI models that focus primarily on achieving correct numerical outputs, LongCat-Flash-Prover is built to maintain rigorous logical chains required for formal verification. The project addresses a fundamental challenge in AI reasoning: the inherent ambiguity of natural language, which can lead to the failure of complex mathematical proofs. By prioritizing formalization over simple answer-guessing, Meituan aims to provide a tool that ensures every step of a mathematical argument is logically sound. This release marks a significant contribution to the open-source community, specifically targeting the transition from intuitive AI responses to verifiable mathematical rigor.

美团技术团队
Anthropic-Cybersecurity-Skills: 817 Structured AI Agent Capabilities Mapped to Global Security Frameworks
Industry News

Anthropic-Cybersecurity-Skills: 817 Structured AI Agent Capabilities Mapped to Global Security Frameworks

A significant new repository titled 'Anthropic-Cybersecurity-Skills' has been released, providing a comprehensive library of 817 structured cybersecurity skills specifically designed for AI agents. This initiative utilizes the agentskills.io standard to ensure interoperability across more than 20 major platforms, including Claude Code, GitHub Copilot, and Gemini CLI. The skills are meticulously mapped to six essential industry frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND, NIST AI RMF, and MITRE F3 (Fight Fraud). By bridging the gap between AI automation and standardized security protocols, this project offers a structured roadmap for deploying AI agents in complex security environments, focusing on threat detection, risk management, and fraud prevention.

GitHub Trending
Palmier Pro: A New AI-Centric Video Editing Solution Debuts for macOS Users
Product Launch

Palmier Pro: A New AI-Centric Video Editing Solution Debuts for macOS Users

Palmier Pro, a specialized video editing application designed specifically for artificial intelligence workflows on macOS, has been introduced by the developer palmier-io. Hosted on GitHub, this project distinguishes itself by being built from the ground up for AI integration rather than simply adding AI features to an existing framework. While the initial release information focuses on its core identity as an AI-native tool for the Apple ecosystem, it signals a growing trend of platform-specific creative software optimized for modern machine learning capabilities. The project's presence on GitHub suggests an accessible approach to distribution for macOS users looking for AI-driven video manipulation tools.

GitHub Trending
Garry Tan Introduces gstack: A Specialized Claude Code Configuration Featuring 23 Opinionated Tools for Multi-Role AI Orchestration
Open Source

Garry Tan Introduces gstack: A Specialized Claude Code Configuration Featuring 23 Opinionated Tools for Multi-Role AI Orchestration

Garry Tan has unveiled "gstack," a highly curated and "opinionated" setup designed for Claude Code. This configuration integrates 23 specific tools that enable the AI to function across various professional capacities, including CEO, Designer, Engineering Manager, Release Manager, Documentation Engineer, and Quality Assurance (QA). The project reflects a significant shift in the software development paradigm, where AI agents are no longer just coding assistants but are capable of managing complex, multi-disciplinary tasks. Tan notes that this advanced setup has fundamentally changed his approach to development, suggesting a transition away from manual coding toward high-level AI orchestration. By providing a structured framework for these diverse roles, gstack aims to streamline the entire development lifecycle through specialized AI personas.

GitHub Trending
Penpot: An Open Source Design Tool Redefining Collaboration Between Designers and Developers
Open Source

Penpot: An Open Source Design Tool Redefining Collaboration Between Designers and Developers

Penpot has emerged as a significant open-source design tool specifically engineered to bridge the gap between design and code collaboration. By providing a platform that caters to both designers and developers, Penpot facilitates a more integrated workflow. As an open-source alternative in the design space, it emphasizes transparency and community-driven development. The tool focuses on streamlining the transition from visual concepts to functional code, addressing a long-standing friction point in the product development lifecycle. This analysis explores its role as a collaborative bridge and its position within the open-source ecosystem, highlighting how it fosters a shared environment for creative and technical teams to work together effectively.

GitHub Trending
HeyGen Launches Hyperframes: A New Framework to Write HTML and Render Video Built Specifically for AI Agents
Open Source

HeyGen Launches Hyperframes: A New Framework to Write HTML and Render Video Built Specifically for AI Agents

HeyGen, a prominent leader in AI-driven video generation, has introduced a new project titled 'Hyperframes' on GitHub. The framework is designed with a clear and concise mission: to allow developers to write HTML and render it directly into video content. Distinctively positioned as being 'built for agents,' Hyperframes aims to streamline the process of programmatic video creation, enabling autonomous AI systems to generate visual media through standard web coding languages. This development represents a significant shift in the video production landscape, moving away from traditional manual editing toward a code-centric, automated approach. By leveraging the ubiquity of HTML, Hyperframes lowers the barrier for integrating dynamic video rendering into AI-driven workflows, potentially transforming how digital content is synthesized and delivered by intelligent agents.

GitHub Trending
OpenMontage: The World’s First Open-Source Agent-Based Video Production System for AI Assistants
Open Source

OpenMontage: The World’s First Open-Source Agent-Based Video Production System for AI Assistants

OpenMontage has officially launched as the world's first open-source agent-based video production system, marking a significant milestone in the intersection of artificial intelligence and multimedia creation. Developed by calesthio and hosted on GitHub, the project introduces a massive framework consisting of 12 specialized pipelines, 52 integrated tools, and over 500 distinct agent skills. The system is designed to transform standard AI programming assistants into comprehensive video production studios, allowing for automated and highly sophisticated content creation. By leveraging an agentic architecture, OpenMontage provides a modular and scalable solution for developers and creators looking to automate the complexities of video editing, rendering, and assembly through the power of open-source AI agents.

GitHub Trending
Alibaba Files Lawsuit Against US Pentagon Over Inclusion in Chinese Military-Linked Companies Blacklist
Industry News

Alibaba Files Lawsuit Against US Pentagon Over Inclusion in Chinese Military-Linked Companies Blacklist

Alibaba Group has initiated legal action against the United States Department of Defense (Pentagon) following its designation on the Section 1260H list. This list identifies entities allegedly supporting the Chinese military. The lawsuit challenges the Pentagon's decision to include the e-commerce and technology giant on this specific blacklist, which labels companies as "Chinese military companies" operating in the United States. This move highlights the escalating legal and regulatory tensions between major Chinese technology firms and U.S. defense authorities regarding allegations of military-civil fusion and national security concerns. The outcome of this legal challenge could have significant implications for how international tech entities are categorized and regulated under U.S. law.

Tech in Asia
Grammarly-Owned Superhuman Acquires AI Detection Platform GPTZero to Strengthen Writing Ecosystem
Industry News

Grammarly-Owned Superhuman Acquires AI Detection Platform GPTZero to Strengthen Writing Ecosystem

In a significant move within the artificial intelligence sector, Superhuman—a company owned by the writing assistance giant Grammarly—has officially acquired GPTZero. GPTZero, which has become a household name in AI content verification, originally started as a senior thesis project. Since its inception, the platform has experienced exponential growth, now boasting a user base of over 19 million registered individuals. This acquisition represents a strategic consolidation of writing enhancement and AI detection technologies, signaling a new phase in how digital content is created and verified. The deal highlights the immense value of transparency tools in the current generative AI era and underscores the successful transition of academic innovation into a massive commercial enterprise.

Tech in Asia
South Korean Smilegate Investment Secures $40 Million Initial Close for New Artificial Intelligence Fund
Funding

South Korean Smilegate Investment Secures $40 Million Initial Close for New Artificial Intelligence Fund

Smilegate Investment, a prominent South Korean venture capital firm established in 1999, has successfully reached an initial close of $40 million for its newly launched artificial intelligence (AI) fund. With a long-standing history in the investment sector spanning over two decades, the firm currently manages assets totaling approximately 900 billion won, which translates to roughly US$585 million. This strategic move to secure $40 million for AI-focused initiatives highlights the firm's commitment to the evolving technology landscape. The initial close marks a significant step in the firm's capital deployment strategy, leveraging its substantial management experience to support the growth of the AI industry. The fund aims to capitalize on emerging opportunities within the artificial intelligence sector, backed by Smilegate's robust financial foundation and historical expertise in the South Korean market.

Tech in Asia
MoEngage Acquires Technology to Deploy Individual AI Agents for Personalized Marketing Future
Industry News

MoEngage Acquires Technology to Deploy Individual AI Agents for Personalized Marketing Future

MoEngage, a prominent player in the marketing automation space, has completed an all-cash acquisition to integrate advanced technology capable of assigning dedicated AI agents to individual customers. This strategic move underscores the company's belief that the future of marketing lies in the deployment of millions of autonomous agents. By leveraging this new technology, MoEngage aims to transform customer engagement through hyper-personalization at an unprecedented scale. The deal highlights a significant shift in the marketing industry toward agentic AI solutions, focusing on one-to-one interactions rather than broad segments. While specific financial details remain undisclosed beyond the all-cash nature of the transaction, the acquisition positions MoEngage as a leader in the evolving landscape of AI-driven customer relationship management.

TechCrunch AI
Google Home Enhances Familiar Faces Recognition to Identify Users Even When Facing Away
Product Launch

Google Home Enhances Familiar Faces Recognition to Identify Users Even When Facing Away

Google has launched a significant update to its Google Home ecosystem, specifically improving the 'Familiar Faces' recognition feature. Starting June 23rd, 2026, the system is being expanded to better identify individuals who have already been tagged in a user's library, even in scenarios where they are not directly looking at the camera. This update addresses a common limitation in smart home security by allowing cameras to maintain identification when a person is facing away. By refining how the system recognizes known individuals, Google aims to reduce the frequency of misidentifications and 'unknown person' alerts, providing a more accurate and seamless monitoring experience for smart home users. The rollout marks a technical step forward in how ambient computing handles identity and presence within the home environment.

The Verge
Hollywood Distribution Giants Pass on Sam Altman Biopic 'Artificial' Directed by Luca Guadagnino
Industry News

Hollywood Distribution Giants Pass on Sam Altman Biopic 'Artificial' Directed by Luca Guadagnino

In a surprising turn for the film industry, major distribution powerhouses including Netflix, A24, Focus Features, and Warner Bros.' Clockwork have reportedly declined to pick up 'Artificial,' the upcoming biographical drama centered on OpenAI CEO Sam Altman. Directed by the acclaimed Luca Guadagnino, the film explores the life and influence of one of the tech industry's most pivotal figures. While the industry's largest players are distancing themselves from the project, smaller prestige distributors such as Neon and Mubi are still reportedly showing interest. This collective rejection by mainstream studios suggests a complex tension between Hollywood's creative output and the growing influence of artificial intelligence leaders, raising questions about the industry's willingness to scrutinize the architects of the AI revolution.

The Verge
Nationwide Train Services in Germany Halted Following Major Communication System Failure
Industry News

Nationwide Train Services in Germany Halted Following Major Communication System Failure

On June 23, 2026, the German rail network experienced a significant disruption as train services were halted across the country. The stoppage was officially attributed to a technical problem within the communication system essential for rail operations. This incident led to a total standstill of traffic on the national network, affecting thousands of passengers and highlighting the vulnerability of critical transportation infrastructure. While specific technical details regarding the nature of the communication error were not immediately disclosed, the scale of the disruption suggests a systemic failure. Authorities and rail operators are working to resolve the issue, which has caused widespread travel delays throughout Germany.

Hacker News
Prime Day Deal: Roborock Saros 20 Hits New Record Low Price with $240 Discount
Industry News

Prime Day Deal: Roborock Saros 20 Hits New Record Low Price with $240 Discount

The Roborock Saros 20, a highly-regarded robot vacuum and mop hybrid, has reached a significant pricing milestone during the Prime Day sales event. Currently available for $1,359.99, the device features a $240 reduction from its standard retail price, marking a new all-time low. This deal is accessible through both Amazon and Roborock’s official online storefront. Recognized for its high level of automation, the Saros 20 is described as a device that users 'barely have to think about,' positioning it as a top-tier choice in the competitive hybrid cleaning market. This analysis explores the implications of this price drop for consumers looking to invest in premium home maintenance technology and the strategic timing of this discount during one of the year's largest retail events.

The Verge
Open Source

FUTO Releases Comprehensive Open-Source Dataset of One Million English Swipes for Mobile Input Development

FUTO has announced the release of a significant dataset containing over one million QWERTY English swipes, now available on HuggingFace under the MIT license. The collection process began in August 2024, utilizing a voluntary mobile-based platform where users swiped Wikipedia-sourced sentences word-by-word. After filtering for quality, the final dataset was released in March 2025. This initiative aims to improve swipe typing models and provide a robust benchmark for evaluating different typing systems. FUTO utilized this data extensively to refine its own models, marking a major contribution to open-source mobile input technology and linguistic data accessibility. By providing this data under a permissive license, FUTO enables developers to enhance mobile keyboard accuracy and performance.

Hacker News
GPT-5 Pro Solves Three-Year Immunology Mystery Regarding T Cell Behavior
Industry News

GPT-5 Pro Solves Three-Year Immunology Mystery Regarding T Cell Behavior

In a significant advancement for both artificial intelligence and biological science, GPT-5 Pro has assisted immunologist Derya Unutmaz in resolving a scientific mystery that had remained unsolved for three years. The breakthrough specifically concerns the behavior of T cells, which are fundamental components of the human immune system. By utilizing the analytical capabilities of OpenAI's latest model, researchers were able to gain critical insights that had previously eluded the scientific community. This development is expected to have far-reaching implications for medical science, particularly in the fields of oncology and autoimmune disease research. The successful application of GPT-5 Pro in this context underscores the growing role of advanced AI models in accelerating complex scientific discoveries and providing solutions to long-standing biological puzzles.

OpenAI Blog
Anthropic Launches Claude Tag for Slack to Capture Organizational Context and Institutional Knowledge in Enterprise Workflows
Product Launch

Anthropic Launches Claude Tag for Slack to Capture Organizational Context and Institutional Knowledge in Enterprise Workflows

Anthropic has officially introduced Claude Tag, a new AI-driven feature designed to function as an always-on teammate within the Slack communication platform. Moving beyond basic productivity enhancements, Claude Tag is a strategic initiative aimed at capturing and internalizing a company's unique organizational context, institutional knowledge, and specific enterprise workflows. By integrating directly into the flow of Slack messages, the tool learns the nuances of how a business operates in real-time. This development marks a significant step for Anthropic in providing deeper, context-aware AI solutions for the enterprise sector, ensuring that the AI understands the specific environment in which it operates rather than relying solely on general data.

TechCrunch AI