AI News on November 19, 2025

← Previous Day View All Dates Next Day →

Product

Manus Launches Browser Operator Chrome Extension: Transforms Any Browser into an AI-Powered Tool for Automated Tasks and Secure Access

Manus has released the Manus Browser Operator, a Chrome extension designed to convert any standard browser into an AI-enabled one. This tool automates complex browser operations, allowing access to protected websites and systems like research platforms and CRM tools without triggering additional login verifications. Currently in a phased rollout for advanced users, the extension aims to significantly boost daily work efficiency. Key features include secure local access, session reuse, and the ability to perform tasks such as data retrieval from databases (Crunchbase, PitchBook), CRM updates, and data extraction from paid platforms. The system operates with a dual-layer architecture, combining cloud-based browsing for general tasks with local browser access for authenticated systems, ensuring secure and efficient task execution. It is currently in beta for Pro, Plus, and Team users, supporting Chrome and Edge, with ongoing optimization for complex interactions.

Xiaohu.AI 日报•November 19, 2025

Read Analysis Source

Technology

Google Unveils Antigravity: A New AI-Powered Autonomous Platform for End-to-End Software Development, Integrating with Gemini 3 for Agentic Coding

Google has launched Antigravity, a novel platform designed for "AI agent-led development," moving beyond traditional IDEs. This autonomous agent collaboration system enables AI to independently plan, execute, and verify complete software development tasks. Deeply integrated with the Gemini 3 model, Antigravity represents Google's key product in "Agentic Coding." It addresses limitations of previous AI tools, which were primarily assistive and required manual operation and step-by-step human prompts. Antigravity allows AI to work across editors, terminals, and browsers, plan complex multi-step tasks, automatically execute actions via tool calls, and self-check results. It shifts the development paradigm from human-operated tools to AI-operated tools with human supervision and collaboration. The platform's core philosophy revolves around Trust, Autonomy, Feedback, and Self-Improvement, providing transparency into AI's decision-making, enabling autonomous cross-environment operations, facilitating real-time human feedback, and allowing AI to learn from past experiences.

Xiaohu.AI 日报•November 19, 2025

Read Analysis Source

Technology

Google Vids Unlocks Advanced AI Features for All Gmail Users: Free Access to AI Voiceovers, Redundancy Removal, and Image Editing

Google has made several advanced AI features in its Vids video editing platform available to all users with a Gmail account, previously exclusive to paid subscribers. These newly accessible tools include AI voiceovers, automatic removal of redundant speech, and AI image editing. The transcription trimming feature automatically eliminates filler words like "um" and "ah," along with long pauses, significantly enhancing video quality. Users can also generate professional-grade voiceovers from text scripts, choosing from seven different voice options, many of which sound natural. Additionally, the AI image editing tool allows for easy modifications such as background removal, descriptive editing, and transforming static photos into dynamic videos. Google aims to empower both beginners and experienced creators to produce high-quality video content, anticipating significant growth in the video editing market despite Vids being in its early stages.

AI新闻资讯 - AI Base•November 19, 2025

Read Analysis Source

Technology

Quora's Poe AI Platform Launches Group Chat Feature Supporting Up to 200 Users for Enhanced Collaborative AI Interactions

Quora has introduced a new group chat feature for its AI platform, Poe, allowing up to 200 users to collaborate with various AI models and bots in a single conversation. This innovation supports multi-modal interactions including text, image, video, and audio generation. The launch coincides with OpenAI's ChatGPT piloting similar group chat functionalities in select markets, signaling a shift in AI interaction methods. Quora highlights that this feature will offer new interactive experiences for AI users, such as family trip planning using Gemini 2.5 and o3Deep Research, or team brainstorming with image models to create mood boards. Users can also engage in intellectual games with Q&A bots. Group chats can be created from Poe's homepage, with real-time synchronization across devices, ensuring seamless transitions between desktop and mobile. Quora developed this feature over six months and plans to optimize it based on user feedback, emphasizing the unexplored potential for group interaction and collaboration in AI mediums. Poe also enables users to create and share custom bots.

AI新闻资讯 - AI Base•November 19, 2025

Read Analysis Source

Product

Google AI Developers Announce Immediate Availability of Gemini 3 for Builders

Google AI Developers have announced that Gemini 3 is now available for immediate use by developers. The announcement, made on November 19, 2025, encourages users to 'Start building with Gemini 3 today.' This brief update signifies the release of the new version of Gemini, making it accessible for development projects.

Google AI Developers(@googleaidevs) - Google AI Developers (@googleaidevs)•November 19, 2025

Read Analysis Source

Technology

Google Research Unveils Generative UI: AI Now Creates Interactive Interfaces from Simple Prompts, Transforming User Experience in Gemini and Search

Google Research has introduced Generative UI, a groundbreaking interactive technology that enables AI models to generate complete, visual, and interactive user interfaces, including web pages, tools, games, and applications, from natural language prompts. This innovation expands AI's capability beyond mere content generation to full interactive experience creation. Integrated into Gemini App's 'Dynamic View' and Google Search's AI Mode, Generative UI addresses the limitations of traditional AI's linear text output, which struggles with complex knowledge and interactive tasks. The system allows AI to instantly design and implement functional interfaces, such as animated DNA explanations or social media galleries, rather than just providing textual descriptions. This feature is currently experimental in Gemini and available to Google AI Pro and Ultra users in the US for Search's AI Mode, leveraging tool access, system-level instructions, and post-processing for robust and safe interface generation.

Xiaohu.AI 日报•November 19, 2025

Read Analysis Source

Technology

Google Unveils Gemini 3: A Leap in AI Reasoning, Multimodal Integration, and Agentic Behavior for Complex Understanding and Autonomous Task Execution

Google has officially launched Gemini 3, marking a significant advancement in AI capabilities. Defined by Google as a qualitative leap in higher-level reasoning, multimodal integration, and agentic behavior, Gemini 3 empowers AI with comprehensive abilities to understand complex scenarios, perform cross-modal analysis, and autonomously execute tasks. Key features include enhanced reasoning depth and problem decomposition, allowing it to understand the logic behind questions and break down complex tasks. Its 'Deep Think' mode achieved a 41% accuracy in human doctoral-level exams without tools, outperforming other public AI models. Gemini 3 also demonstrates significant progress in multimodal understanding across images, video, audio, and code. A major breakthrough is its agentic capabilities, supported by the new Google Antigravity platform, enabling AI to plan, code, execute, and verify tasks autonomously. Furthermore, Gemini 3 boasts scalable learning and long-horizon planning with million-token context understanding, capable of managing multi-step scenarios consistently. These advancements position Gemini 3 for applications in learning, building, and planning across various domains.

Xiaohu.AI 日报•November 18, 2025

Read Analysis Source

Technology

xAI Launches Grok 4.1 with Enhanced Performance and Reduced Hallucinations on Web and Apps, Lacks API Access for Enterprise

Elon Musk's xAI has released Grok 4.1, its newest large language model, now available for consumer use on Grok.com, X, and its mobile apps. This launch, preceding Google's Gemini 3, introduces significant architectural and usability improvements, including faster reasoning, improved emotional intelligence, and notably lower hallucination rates. Grok 4.1 has achieved top rankings in public benchmarks, surpassing models from Anthropic, OpenAI, and Google's pre-Gemini 3 models. A white paper detailing its evaluations and training process has also been published. However, a key limitation for enterprise developers is the current absence of API access for Grok 4.1, restricting its integration into production environments. Only older xAI models are presently available via the developer API, supporting up to 2 million tokens of context.

VentureBeat•November 18, 2025

Read Analysis Source

Technology

Google Unveils Gemini 3: Claims Global AI Leadership in Math, Science, Multimodal, and Agentic Benchmarks, Surpassing Competitors

Google has officially launched Gemini 3, its latest proprietary frontier model family, marking its most comprehensive AI release since the Gemini line debuted in 2023. Available exclusively through Google products and developer platforms, Gemini 3 includes the flagship Gemini 3 Pro, Gemini 3 Deep Think for enhanced reasoning, generative interface models, and Gemini Agent for multi-step tasks. Independent AI benchmarking organization Artificial Analysis has crowned Gemini 3 Pro the "new leader in AI" globally, achieving a top score of 73 on its index, a significant leap from Gemini 2.5 Pro's 9th place. LMArena also reported Gemini 3 Pro as the world's top model across text reasoning, vision, coding, and web development, outperforming Grok-4.1, Claude 4.5, and GPT-5-class systems in various categories.

VentureBeat•November 18, 2025

Read Analysis Source