Back to List
TechnologyAIVideoInnovation

Google's Gemini Veo 3.1 Launches 'Ingredients to Video' Mode for Pro/Ultra Subscribers: Create 8-Second 1080p Videos from Three Reference Images with Consistent Characters and SynthID Watermarks

Google has rolled out the Veo 3.1 video model to Gemini Pro/Ultra subscribers, introducing a new 'Ingredients to Video' mode. This feature allows users to upload three reference images simultaneously to extract character, scene, and style characteristics, which are then merged into an 8-second, 1080p video. The generated content includes an invisible SynthID watermark. Users can create videos via text prompts on web or mobile, with the system maintaining cross-frame character consistency and lighting coherence. Google demonstrated this by combining three selfies, a cyber city background, and an oil painting style image to produce a 'futuristic impressionist street walk' short film with no facial or clothing deformation. Veo 3.1 also outputs native environmental sound and supports first/last frame control and video extension. The multi-image reference feature is fully available, utilizing existing subscription quotas without additional payment plans announced.

AI新闻资讯 - AI Base

Google has today pushed the Veo 3.1 video model to its Gemini Pro/Ultra subscribers, introducing a new and innovative 'Ingredients to Video' mode. This feature empowers users to upload up to three reference images concurrently. From these images, the system intelligently extracts distinct characteristics: one for the character, another for the scene, and a third for the artistic style. These extracted elements are then seamlessly fused together to generate an 8-second, 1080p video.

A key security and authenticity feature of Veo 3.1 is the automatic inclusion of an invisible SynthID watermark within all generated content. Users can initiate video creation by simply inputting a text prompt, accessible through both web and mobile interfaces. The system is designed to maintain high fidelity across frames, ensuring consistent character appearance and coherent lighting throughout the generated video.

Google provided a compelling demonstration of Veo 3.1's capabilities. By combining three selfies taken from different angles, a cyber city background image, and an oil painting style reference, the model successfully outputted a short film depicting an 'impressionist futuristic street walk.' Notably, the demonstration highlighted the model's ability to render faces and clothing without any deformation, showcasing its advanced consistency.

Beyond visual generation, Veo 3.1 also outputs native environmental sound, enhancing the immersive quality of the videos. Furthermore, the model offers functionalities for controlling the first and last frames of the video, along with a video extension feature. Google has confirmed that the multi-image reference capability is now fully rolled out to all eligible subscribers. The generation quota for this new feature aligns with existing subscription allowances, and no additional payment plans have been announced at this time.

Related News

Technology

Google Unveils Antigravity: A New AI-Powered Autonomous Platform for End-to-End Software Development, Integrating with Gemini 3 for Agentic Coding

Google has launched Antigravity, a novel platform designed for "AI agent-led development," moving beyond traditional IDEs. This autonomous agent collaboration system enables AI to independently plan, execute, and verify complete software development tasks. Deeply integrated with the Gemini 3 model, Antigravity represents Google's key product in "Agentic Coding." It addresses limitations of previous AI tools, which were primarily assistive and required manual operation and step-by-step human prompts. Antigravity allows AI to work across editors, terminals, and browsers, plan complex multi-step tasks, automatically execute actions via tool calls, and self-check results. It shifts the development paradigm from human-operated tools to AI-operated tools with human supervision and collaboration. The platform's core philosophy revolves around Trust, Autonomy, Feedback, and Self-Improvement, providing transparency into AI's decision-making, enabling autonomous cross-environment operations, facilitating real-time human feedback, and allowing AI to learn from past experiences.

Technology

Google Vids Unlocks Advanced AI Features for All Gmail Users: Free Access to AI Voiceovers, Redundancy Removal, and Image Editing

Google has made several advanced AI features in its Vids video editing platform available to all users with a Gmail account, previously exclusive to paid subscribers. These newly accessible tools include AI voiceovers, automatic removal of redundant speech, and AI image editing. The transcription trimming feature automatically eliminates filler words like "um" and "ah," along with long pauses, significantly enhancing video quality. Users can also generate professional-grade voiceovers from text scripts, choosing from seven different voice options, many of which sound natural. Additionally, the AI image editing tool allows for easy modifications such as background removal, descriptive editing, and transforming static photos into dynamic videos. Google aims to empower both beginners and experienced creators to produce high-quality video content, anticipating significant growth in the video editing market despite Vids being in its early stages.

Technology

Quora's Poe AI Platform Launches Group Chat Feature Supporting Up to 200 Users for Enhanced Collaborative AI Interactions

Quora has introduced a new group chat feature for its AI platform, Poe, allowing up to 200 users to collaborate with various AI models and bots in a single conversation. This innovation supports multi-modal interactions including text, image, video, and audio generation. The launch coincides with OpenAI's ChatGPT piloting similar group chat functionalities in select markets, signaling a shift in AI interaction methods. Quora highlights that this feature will offer new interactive experiences for AI users, such as family trip planning using Gemini 2.5 and o3Deep Research, or team brainstorming with image models to create mood boards. Users can also engage in intellectual games with Q&A bots. Group chats can be created from Poe's homepage, with real-time synchronization across devices, ensuring seamless transitions between desktop and mobile. Quora developed this feature over six months and plans to optimize it based on user feedback, emphasizing the unexplored potential for group interaction and collaboration in AI mediums. Poe also enables users to create and share custom bots.