Google Gemini Assistant Upgrade: Supports Real-time Video Analysis and Screen Sharing

AIbase基地

Published inAI News · 3 min read · Mar 4, 2025

At this year's Mobile World Congress (MWC), Google announced new real-time video analysis and screen sharing capabilities for its Gemini assistant. Google One AI Premium subscribers with Gemini Advanced will be the first to experience these features later this month.

This update gives Gemini Live two core abilities: real-time analysis of video content via an external camera, and screen sharing, allowing the AI assistant to directly interpret information on the user's phone and provide feedback. These features enable more interactive visual communication with the AI, such as object recognition, screen content interpretation, or real-time suggestions.

Google's Gemini large language model

The new features will initially launch on Android devices and support multiple languages. Google demonstrated the integration of these features across various Android devices at MWC, further solidifying its competitiveness in the AI assistant space.

This update also marks a key step towards real-world interaction for AI assistants. Google's long-term goal is Project Astra – a general-purpose multimodal AI assistant capable of processing text, video, and audio in real-time with short-term memory. Astra is expected to integrate deeply with Google Search, Lens, and Maps in the future.

With the launch of Gemini Live, the competition between Google and OpenAI is intensifying. Since December last year, ChatGPT has supported real-time video and screen sharing with its advanced voice mode, and Google's update is undoubtedly a direct response. Whether Gemini can leverage this new functionality to further solidify its AI leadership remains to be seen.

Following OpenAI, Google Gemini Joins MCP Initiative to Accelerate AI Agent Interoperability

Just weeks after OpenAI announced its adoption of a rival Anthropic standard for connecting AI models to the systems where their data resides, Google has followed suit. Google DeepMind CEO Demis Hassabis announced on X on Wednesday that Google will add support for the Anthropic Model Context Protocol (MCP) to its Gemini models and software development kits (SDKs).

Veo 2 Launches on Gemini API: Revolutionizing AI Video Generation

Google's AI team recently announced the launch of Veo 2, its highly anticipated video generation model, via the Gemini API. This news has generated significant excitement in the tech world, marking a new era for AI video generation. Starting today, developers with billing enabled and Tier 1 access or higher can utilize the API to access Veo 2 and experience its powerful Text-to-Video and Image-to-Video capabilities.

Google Reiterates $75 Billion Investment Plan to Accelerate AI Infrastructure

Alphabet Inc., Google's parent company, recently reaffirmed its capital expenditure plan of approximately $75 billion for 2023. This investment aims to expand data centers and acquire necessary chips and servers to support core business enhancements and the development of artificial intelligence (AI) services. Sundar Pichai, Alphabet's CEO, detailed the plan at Google Cloud's annual conference. Image note: Image generated by AI, image licensing service Mi.

Google Unveils AR Glasses Prototype: Seamlessly Blending Reality and the Digital World

At the latest TED conference, Google showcased a futuristic prototype of augmented reality (AR) glasses. The glasses, boasting a sleek design resembling regular eyewear, incorporate Google's advanced Gemini AI assistant, demonstrating impressive multi-functionality. In a demo, Shahram Izadi, head of the Android XR team, highlighted various applications, including real-time translation of Persian to English and book scanning. Izadi noted that the glasses...

Deep Research Now Powered by Gemini 2.5 Pro: Google's Most Intelligent AI Model Arrives

April 9, 2025: AI research tools have reached a critical breakthrough. Google announced a major upgrade to its highly anticipated Deep Research feature, now powered by the experimental Gemini 2.5 Pro. This model has demonstrated superior performance in industry reasoning benchmarks and Chatbot Arena evaluations, considered by professionals to be one of the most powerful AI models globally. This technological advancement has sparked widespread interest among researchers, technical experts, and industry observers.

Google Unveils Sec-Gemini v1: A New AI Security Model for Rapid Threat Identification

Google announced on its official security blog the launch of Sec-Gemini v1, a new experimental AI model focused on advancing AI in cybersecurity. This marks a significant step in Google's efforts to leverage AI technology to combat escalating cyber threats. Addressing the Attack-Defense Asymmetry: AI Empowers Defenders Google highlighted a fundamental challenge in cybersecurity: the asymmetry between attackers and defenders. Defenders must address all potential cyber threats,

AI News

AI Daily

AI Timeline

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Google Gemini Assistant Upgrade: Supports Real-time Video Analysis and Screen Sharing

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Google AI Studio Major Update: New Gemini-2.0-flash-live-001 Officially Launched

Following OpenAI, Google Gemini Joins MCP Initiative to Accelerate AI Agent Interoperability

Google Distributed Cloud, Gemini, and NVIDIA Advance On-premises AI for Enterprises

Veo 2 Launches on Gemini API: Revolutionizing AI Video Generation

Google Reiterates $75 Billion Investment Plan to Accelerate AI Infrastructure

Google Unveils AR Glasses Prototype: Seamlessly Blending Reality and the Digital World

Google Gemini Launches Deep Research Feature, Exclusive to Paid Subscribers

Deep Research Now Powered by Gemini 2.5 Pro: Google's Most Intelligent AI Model Arrives

Gemini Live Visual Chat Arrives on Pixel 9: AI Assistant Enters a New Era of Multimodal Interaction

Google Unveils Sec-Gemini v1: A New AI Security Model for Rapid Threat Identification