Waymo Launches Next-Generation AI Model EMMA to Advance Autonomous Driving Technology

AIbase基地

Published inAI News · 4 min read · Nov 12, 2024

274

Recently, Waymo officially released an AI research model named "End-to-End Multimodal Autonomous Driving Model" (EMMA). This model has been specifically trained and fine-tuned for autonomous driving technology, leveraging the extensive knowledge of Gemini to better understand complex road scenarios. Waymo detailed the design philosophy and technical advantages of the model in their published research paper, and discussed the pros and cons of a purely end-to-end approach.

Artificial Intelligence Driving (Image Source: AI Synthesis)

Image source note: The image was generated by AI, authorized by service provider Midjourney

Waymo stated that the EMMA model is built on the foundation of Gemini, fully utilizing its capabilities, and focusing on tasks specific to autonomous driving, such as motion planning and 3D object detection. The model has demonstrated excellent task transfer capabilities in several key autonomous driving tasks. Waymo noted that compared to training separate models for each task, EMMA significantly improves performance in path prediction, object detection, and road map understanding.

Waymo's research results indicate that the construction of EMMA provides a promising research direction for the combination of future core autonomous driving tasks. Drago Anguelov, Waymo's Vice President and Head of Research, said: "EMMA showcases the powerful capabilities and importance of multimodal models in the field of autonomous driving. We look forward to further exploring how multimodal methods and components can help build more universal and adaptable driving systems."

EMMA also performs well in handling raw camera inputs and text data. It can generate various driving outputs and, by establishing a unified language space, fully utilizes Gemini's world knowledge and reasoning abilities to enhance the decision-making process and improve the efficiency of end-to-end planning.

Waymo emphasizes that the significance of this research extends beyond the application in autonomous vehicles, also expanding the capabilities of AI in complex dynamic environments by applying advanced AI technologies to real-world tasks.

Key Points:
🚗 EMMA model is specifically trained for autonomous driving, utilizing Gemini's knowledge to understand complex road scenarios.
📈 Compared to traditional models, EMMA shows more efficient performance in key tasks.
🌍 The research outcomes are not only applied to autonomous driving but also expand the potential applications of AI in dynamic environments.

Robotaxi Industry Sees New Opportunities! JPMorgan Optimistic About Baidu's Market Capitalization Exceeding $79.6 Billion

Recently, the Robotaxi industry has become increasingly popular. A new research report from JPMorgan stated that Baidu's market capitalization should reach $79.6 billion. This news has attracted widespread attention in the market, especially as major players in the autonomous driving industry are beginning to emerge. According to predictions from overseas research institutions, the valuation of the renowned autonomous driving company Waymo is expected to surge to $75 billion next year, prompting the industry to re-evaluate the value of Robotaxi. Analysts believe that within the country, the company known as Apollo Go...

Google Launches Gemini for Education! Free AI Tools Sweep the Global Education Sector

Google recently announced the launch of a new AI tool suite called Gemini for Education, based on its latest generation Gemini 2.5 Pro model and the LearnLM learning large model specifically optimized for education, providing free, powerful, and efficient learning and teaching support for teachers and students around the world. This move marks another major breakthrough for Google in the field of educational technology, aiming to empower educators and students through AI technology, creating a more personalized and efficient learning experience. Gemini for Educa

Gemini Live Makes a Major Upgrade! Seamless Integration with Google Apps, Smart Life Within Reach

With the rapid development of artificial intelligence technology, Google's AI assistant Gemini Live has undergone a major upgrade. According to the latest information obtained by AIbase, Gemini Live is about to achieve deep integration with multiple Google apps, providing users with a more intelligent and efficient interaction experience. This feature not only enhances productivity but will also completely change the way users interact with the Google ecosystem. Seamless connection with Google apps, smarter operations are now more convenient. Latest news shows

Gemini2.5Pro API Returns Free, Developer Community Responds Enthusiastically

Recently, Google announced that the API of its flagship AI model, Gemini2.5Pro, has been reintroduced to the free tier of Google AI Studio. This news has triggered widespread attention and enthusiastic discussions within the developer community. According to AIbase, this move marks another important advancement in Google's efforts to popularize AI technology, offering developers lower barriers to innovation. As the most advanced AI model from Google so far, Gemini2.5Pro is known for its exceptional multimodal capabilities and strong reasoning power.

Gemini Will Replace Google Assistant, Android Users Welcome New Experience

Recently, Google announced that the upcoming Gemini feature will replace Google Assistant on Android devices. According to an internal email obtained by Android Police, the Gemini update will start rolling out on July 7th. This update will allow users to still control phone calls, messages, WhatsApp, and other apps through this AI assistant even when the Gemini app is closed. This change aims to enhance user experience.

"AI Daily Report - June 26"; Doubao AI Programming Launches Major Upgrade; Google Opensources AI Agent Gemini CLI

Welcome to the AIbase [AI Daily Report] section! Spend three minutes a day to learn about the latest AI events, helping you understand AI industry trends and innovative AI product applications. For more AI news, visit: https://www.aibase.com/zh1. Doubao AI Programming Launches Major Upgrade! No-code beginners can easily create their own web pages, with real-time editing that is very convenient! Doubao AI Programming has been upgraded to Application Creation 1.0, featuring visual editing, real-time preview, and multi-version management functions, lowering the barrier to web and application development for beginners.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Waymo Launches Next-Generation AI Model EMMA to Advance Autonomous Driving Technology

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Robotaxi Industry Sees New Opportunities! JPMorgan Optimistic About Baidu's Market Capitalization Exceeding $79.6 Billion

Google Launches Gemini for Education! Free AI Tools Sweep the Global Education Sector

Gemini Live Will Be Fully Integrated into Google Apps, Making the AI Assistant Smarter!

Gemini Live Makes a Major Upgrade! Seamless Integration with Google Apps, Smart Life Within Reach

The Revolution of Large Models! How Gemini 2.5 Pro is Transforming the Way We Process Information

Google's New Gemini Education Program Empowers AI Applications in Schools, Benefiting Both Teachers and Students!

Gemini2.5Pro API Returns Free, Developer Community Responds Enthusiastically

Gemini Will Replace Google Assistant, Android Users Welcome New Experience

Gemini Will Replace Google Assistant, New Privacy Protection Model is Coming!

"AI Daily Report - June 26"; Doubao AI Programming Launches Major Upgrade; Google Opensources AI Agent Gemini CLI