
Meituan Open Sources LongCat-Next: A Native Multimodal Model for Real-World AI Perception and Interaction
Meituan's technical team has officially released and open-sourced LongCat-Next, a native multimodal model designed to bridge the gap between AI and the physical world. By treating vision and voice as "native languages," this model aims to enhance how AI perceives and interacts with its environment. The release includes the core LongCat-Next model and its discrete tokenizer, providing developers with the tools to build systems capable of understanding and acting within real-world scenarios. This move marks a significant step in Meituan's exploration of physical-world AI applications, offering the global developer community a foundation for creating AI that can truly sense and respond to the complexities of the physical realm.
















