Back to List
TechnologyAIVideoInnovation

Google's Gemini Veo 3.1 Launches 'Ingredients to Video' Mode for Pro/Ultra Subscribers: Create 8-Second 1080p Videos from Three Reference Images with Consistent Characters and SynthID Watermarks

Google has rolled out the Veo 3.1 video model to Gemini Pro/Ultra subscribers, introducing a new 'Ingredients to Video' mode. This feature allows users to upload three reference images simultaneously to extract character, scene, and style characteristics, which are then merged into an 8-second, 1080p video. The generated content includes an invisible SynthID watermark. Users can create videos via text prompts on web or mobile, with the system maintaining cross-frame character consistency and lighting coherence. Google demonstrated this by combining three selfies, a cyber city background, and an oil painting style image to produce a 'futuristic impressionist street walk' short film with no facial or clothing deformation. Veo 3.1 also outputs native environmental sound and supports first/last frame control and video extension. The multi-image reference feature is fully available, utilizing existing subscription quotas without additional payment plans announced.

AI新闻资讯 - AI Base

Google has today pushed the Veo 3.1 video model to its Gemini Pro/Ultra subscribers, introducing a new and innovative 'Ingredients to Video' mode. This feature empowers users to upload up to three reference images concurrently. From these images, the system intelligently extracts distinct characteristics: one for the character, another for the scene, and a third for the artistic style. These extracted elements are then seamlessly fused together to generate an 8-second, 1080p video.

A key security and authenticity feature of Veo 3.1 is the automatic inclusion of an invisible SynthID watermark within all generated content. Users can initiate video creation by simply inputting a text prompt, accessible through both web and mobile interfaces. The system is designed to maintain high fidelity across frames, ensuring consistent character appearance and coherent lighting throughout the generated video.

Google provided a compelling demonstration of Veo 3.1's capabilities. By combining three selfies taken from different angles, a cyber city background image, and an oil painting style reference, the model successfully outputted a short film depicting an 'impressionist futuristic street walk.' Notably, the demonstration highlighted the model's ability to render faces and clothing without any deformation, showcasing its advanced consistency.

Beyond visual generation, Veo 3.1 also outputs native environmental sound, enhancing the immersive quality of the videos. Furthermore, the model offers functionalities for controlling the first and last frames of the video, along with a video extension feature. Google has confirmed that the multi-image reference capability is now fully rolled out to all eligible subscribers. The generation quota for this new feature aligns with existing subscription allowances, and no additional payment plans have been announced at this time.

Related News

Technology

Open-Mercato: AI-Powered CRM/ERP Framework for R&D, Operations, and Growth – Enterprise-Grade, Modular, and Highly Customizable

Open-Mercato is an AI-supported CRM/ERP foundational framework designed to empower research and development, new processes, operations, and growth. It boasts a modular and scalable architecture, specifically tailored for teams seeking robust default functionalities alongside extensive customization options. The framework positions itself as a superior enterprise-grade alternative to solutions like Django and Retool, offering a powerful platform for businesses.

Technology

Heretic: Fully Automated Censorship Removal for Language Models Trending on GitHub

Heretic, a new project by p-e-w, has recently gained traction on GitHub Trending. Published on February 21, 2026, this tool focuses on the fully automated removal of censorship from language models. The project's primary aim is to provide a solution for users seeking to bypass restrictions within these AI systems, as indicated by its brief description and prominent GitHub presence.

Technology

Superpowers: A Comprehensive Software Development Workflow and Skill Framework for Coding Agents on GitHub Trending

Superpowers, recently featured on GitHub Trending, introduces an effective agent skill framework and a complete software development methodology. Designed for coding agents, this workflow is built upon a foundation of composable 'skills' and includes an initial set of these skills. It aims to streamline the development process for AI-driven coding agents by providing a structured and modular approach to their capabilities.