UCLA Introduces Multi-modal Embodied Intelligent Model MultiPLY AI with Human-like Senses

站长之家

Published inAI News · 2 min read · Jan 22, 2024

Translated data: Recently, researchers from institutions including UCLA have introduced MultiPLY, an embodied intelligence large model. This model not only possesses multimodal perception capabilities, including touch, vision, and hearing, enabling AI to interact more comprehensively with 3D environments. Through interactions between the agent and the 3D environment, MultiPLY has demonstrated excellent performance in experiments such as object retrieval, tool usage, multisensory annotation, and task decomposition. Additionally, the researchers have created a large-scale multisensory dataset called Multisensory-Universe, which contains 500,000 data entries. This research provides new insights for building large models with multisensory capabilities and offers a new direction for achieving AGI.

MultiPLY AI UCLA

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

ChatGPT Paid Users Surge to 20 Million, Driving 30% Annual Revenue Growth

OpenAI's ChatGPT is experiencing rapid growth. According to a recent report by The Information, ChatGPT's paid user base has surpassed 20 million in just three months, a nearly 30% increase from 15.5 million at the end of last year. This surge indicates a growing willingness among users to pay for the AI's capabilities in writing code, articles, providing health advice, and financial planning. Estimates suggest ChatGPT's monthly revenue currently reaches at least $415 million.

Apr 2, 2025

Anthropic Unveils Claude's Inner Workings: Nine Fascinating Discoveries Under the AI Microscope

Apr 2, 2025

Qualcomm Acquires Vietnamese AI Company MovianAI to Boost Generative AI Development

Qualcomm recently announced the completion of its acquisition of MovianAI, a Vietnamese artificial intelligence research company. While the exact amount of the transaction remains undisclosed, this move has drawn considerable attention within the industry. MovianAI was originally the generative AI division of VinAI, a subsidiary of the Vietnamese conglomerate Vingroup. This acquisition signifies Qualcomm's continued expansion in AI technology and will further strengthen its global competitiveness. Following the acquisition, MovianAI's founder and CEO Hu...

Apr 2, 2025

Tencent Releases GeometryCrafter: Unlocking Geometric Consistency in Open-World Videos with AI

Tencent has recently unveiled GeometryCrafter, a new AI model released via Hugging Face. This model excels at consistent geometric estimation in open-world videos, quickly gaining attention in the tech community. Leveraging diffusion priors, GeometryCrafter opens up new possibilities for deep understanding and processing of video content, offering benefits to creators and researchers alike.

Apr 2, 2025

MiniMax Audio Launches Speech-02 Voice Model: Supports 200,000 Characters at Once

MiniMax Audio, a leading innovator in audio technology, has officially released its new Speech-02 series voice model. Supporting over 30 languages and capable of processing 200,000 characters at once, it delivers a more natural, fluent, and convenient audio experience. The new Speech-02 series is the core highlight of this update. According to the official introduction, this series has significantly improved multilingual support, enabling more accurate and native-sounding pronunciations in various languages. Even more impressively, Speech-

Apr 2, 2025

Krea Integrates Gemini's Text-to-Image and Image Editing: Chat Interface Receives a Practical Leap

Recently, the AI creative platform Krea announced the successful integration of Google Gemini's text-to-image and image editing capabilities, further enhancing the platform's generative capabilities and user experience. This update significantly improves the practicality of the Krea Chat interface, transforming it from a simple dialogue tool into a comprehensive creative platform integrating image generation and editing. This advancement is considered a significant step for Krea in the AI-driven creative design field.

Apr 2, 2025

Vibe Draw: Transform Kids' Drawings into 3D Worlds with One Tap

Apr 2, 2025

Arm Abandons Alphawave Acquisition, Shifts Strategy to Capture AI Chip Market Share

Apr 2, 2025

Tinder Launches AI Dating Game to Improve Flirting Skills!

In today's competitive online dating market, Tinder recently launched a new AI-powered interactive game designed to help users improve their flirting skills. Called "The Game Game," it launched this Tuesday and allows users to practice flirting and simulate first encounters by interacting with an AI chatbot, receiving scores and advice based on their performance. To play, users simply tap the Tinder logo in the upper left corner of the Tinder app. The game provides users with a...

Apr 2, 2025

ElevenLabs Launches World's First AI Text-to-Bark Model

ElevenLabs, a pioneer in AI audio technology, recently announced the launch of Text To Bark, the world's first AI text-to-speech model designed specifically for dogs. This innovative technology has garnered significant attention from the tech industry and pet lovers alike. It purportedly converts human-input text into highly realistic dog barks, with a claimed accuracy so high that 95% of dogs can't distinguish them from real canine vocalizations. This is considered a bold attempt to facilitate communication between humans and their pets.

Apr 2, 2025

AI News

AI Daily

AI Timeline

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

UCLA Introduces Multi-modal Embodied Intelligent Model MultiPLY AI with Human-like Senses

站长之家

This article is from AIbase Daily

AI News Recommendations

ChatGPT Paid Users Surge to 20 Million, Driving 30% Annual Revenue Growth

Anthropic Unveils Claude's Inner Workings: Nine Fascinating Discoveries Under the AI Microscope

Qualcomm Acquires Vietnamese AI Company MovianAI to Boost Generative AI Development

Tencent Releases GeometryCrafter: Unlocking Geometric Consistency in Open-World Videos with AI

MiniMax Audio Launches Speech-02 Voice Model: Supports 200,000 Characters at Once

Krea Integrates Gemini's Text-to-Image and Image Editing: Chat Interface Receives a Practical Leap

Vibe Draw: Transform Kids' Drawings into 3D Worlds with One Tap

Arm Abandons Alphawave Acquisition, Shifts Strategy to Capture AI Chip Market Share

Tinder Launches AI Dating Game to Improve Flirting Skills!

ElevenLabs Launches World's First AI Text-to-Bark Model