Google AI Video Strikes Again! The Universal Visual Encoder VideoPrism Refreshes 30 SOTA Performances

新智元

Published inAI News · 1 min read · Feb 26, 2024

Translated data: Google's team has introduced a new universal visual encoder called VideoPrism, which, based on extensive pre-training with massive video data and text pairs, has set 30 new state-of-the-art records. This model is capable of handling various video understanding tasks, including classification, localization, retrieval, captioning, and question answering. Google's VideoPrism demonstrates strong versatility and generalization capabilities, bringing significant breakthroughs to the field of video technology.

Video Encoder AI Model Google

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Google DeepMind Employees Plan to Unionize, Oppose Military AI Projects

Apr 28, 2025

220

Google Allegedly Pays Samsung Millions Monthly to Pre-install Gemini App

Apr 28, 2025

150

How the Advertising Industry Adapts to the AI Era: From Google to ChatGPT

Google's rise in the history of the internet is almost legendary. Founded in 1999, Google attracted a massive user base with its clean, ad-free search experience. Early on, founders Larry Page and Sergey Brin staunchly avoided advertising, believing it would compromise search quality. However, by 2000, to achieve profitability, Google launched AdWords, rapidly transforming into an advertising revenue giant. Advertising gradually became a significant component of search results pages. However...

Apr 28, 2025

100

Google Allegedly Paid Samsung Billions to Pre-install Gemini Apps Amidst Antitrust Trial

Apr 27, 2025

210

Alphabet Q1 Earnings Exceed Expectations; Announces $70 Billion Stock Buyback, AI Overview Reaches 1.5 Billion Monthly Active Users

Apr 27, 2025

180

Ema Unveils New Language Model, EmaFusion: Outperforming O3, Gemini in Cost and Accuracy

In today's increasingly competitive AI landscape, Ema has launched a new language model, EmaFusion, claiming to surpass several well-known AI models, including O3, Gemini, and Sonnet, in both cost and accuracy. Unlike traditional single-strategy systems, EmaFusion employs a "cascaded" judgment system that dynamically balances cost and accuracy, while also allowing users to fine-tune it based on specific task requirements. Ema's CEO, Sur...

Apr 27, 2025

280

Google AI Releases 601 Real-World Generative AI Application Cases Across Industries

Apr 27, 2025

330

DeepMind Employees Protest Google's Military Contracts, Sparking Unionization Efforts

Google's DeepMind employees in the UK are actively pursuing unionization to oppose the company's decision to sell AI technology to military contractors and its collaborations with the Israeli government. Approximately 300 London-based DeepMind employees have reportedly applied to join the Communication Workers Union (CWU), hoping to leverage collective bargaining to influence the company's commercialization strategy. Google's push for DeepMind to find commercial applications for its technology has sparked...

Apr 27, 2025

140

Google CEO Pichai Reveals: Over 30% of Code is AI-Generated

During Alphabet's recent Q1 2025 earnings call, Google CEO Sundar Pichai revealed that over 30% of Google's code is currently generated with the help of artificial intelligence (AI). This means that for every three code changes, developers accept an AI suggestion once. Pichai noted that AI-assisted programming is gaining strong momentum across teams with the introduction of more powerful models and proactive workflows. Proactive workflows refer to AI systems capable of planning and executing multi-step tasks. He stated:

Apr 25, 2025

320

Google's Gemini Chatbot to Expand to Smartwatches and Cars, Replacing Google Assistant

Google CEO Sundar Pichai announced that its next-generation chatbot, Gemini, will arrive on smartwatches, Android Auto, and other devices later this year. This rollout signifies Google's plan to gradually replace the existing Google Assistant with Gemini, aiming for an enhanced user experience. Gemini is currently available on mobile apps and web, but its expansion to smartwatches, cars, and other smart devices is underway.

Apr 25, 2025

250

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Google AI Video Strikes Again! The Universal Visual Encoder VideoPrism Refreshes 30 SOTA Performances

新智元

This article is from AIbase Daily

AI News Recommendations

Google DeepMind Employees Plan to Unionize, Oppose Military AI Projects

Google Allegedly Pays Samsung Millions Monthly to Pre-install Gemini App

How the Advertising Industry Adapts to the AI Era: From Google to ChatGPT

Google Allegedly Paid Samsung Billions to Pre-install Gemini Apps Amidst Antitrust Trial

Alphabet Q1 Earnings Exceed Expectations; Announces $70 Billion Stock Buyback, AI Overview Reaches 1.5 Billion Monthly Active Users

Ema Unveils New Language Model, EmaFusion: Outperforming O3, Gemini in Cost and Accuracy

Google AI Releases 601 Real-World Generative AI Application Cases Across Industries

DeepMind Employees Protest Google's Military Contracts, Sparking Unionization Efforts

Google CEO Pichai Reveals: Over 30% of Code is AI-Generated

Google's Gemini Chatbot to Expand to Smartwatches and Cars, Replacing Google Assistant