2024-10-28 16:13:01.AIbase.
ZhiYuan Launches Hour-Level Ultra-Long Video Understanding Model Video-XL
2024-10-28 14:42:03.AIbase.
Meta Open Sources Long Video LLM Project LongVU: Filters Duplicate Frames for Efficient and Accurate Understanding of Long Video Content
2024-10-25 11:16:59.AIbase.
Salesforce AI Research Unveils New Multimodal Model BLIP-3-Video: Cost-Effective Video Understanding
2024-10-23 09:40:35.AIbase.
Cohere Launches Multimodal Search Model Embed3, Allowing Text and Image-Based File Retrieval
2024-10-21 14:55:38.AIbase.
Zhiyuan Releases Native Multimodal World Model Emu3: Achieving Text, Image, and Video Understanding and Generation Solely Through Next Token Prediction
2024-10-21 13:52:45.AIbase.
Redefining Multimodal AI! Zhiyuan Releases the Native Multimodal World Model Emu3
2024-10-14 10:56:21.AIbase.
Apple's 'Multimodal Alchemy Furnace' Upgraded Again! MM1.5 Enhances Text Density and Multimodal Understanding
2024-10-11 10:03:00.AIbase.
Rhymes AI Launches First Open Source Multimodal AI Model Aria, Outperforming GPT-4o Mini and Other Leading AI Models
2024-10-10 11:08:05.AIbase.
vivo Launches New Blue Heart Large Model Matrix: Upgraded Language, Voice, Image, and Multimodal Capabilities
2024-10-10 11:03:28.AIbase.
iFlytek: To Release Multimodal Visual Interaction Technology on October 24
2024-10-08 11:18:05.AIbase.
Apple Introduces MM1.5: A Revolution in Multimodal AI Models Redefining Intelligent Understanding?
2024-10-07 11:00:18.AIbase.
Meta's Privacy Policy Sparks Controversy: Ray-Ban Meta Users Unknowingly Contributing Data for AI Models
2024-09-27 17:37:02.AIbase.
Super Powerful Multimodal Model Emu3: Understanding Images and Videos Through Next Word Prediction
2024-09-27 14:28:20.AIbase.
OpenAI Launches New Multimodal Content Moderation Model: Based on GPT-4o, Capable of Detecting Text and Images
2024-09-26 18:01:01.AIbase.
The Brand New Open Source AI Model Molmo Sweeps Industry Giants, Surpassing GPT-4o and Claude 3.5
2024-09-26 14:34:11.AIbase.
The Open Source Multimodal Model Molmo Can Recognize Objects in Images and Generate Accurate Descriptions
2024-09-25 13:58:28.AIbase.
Snapchat Collaborates with Google, Integrating Gemini Multimodal Capabilities into My AI Chatbot
2024-09-20 09:06:14.AIbase.
Alibaba International Launches Latest Multimodal Large Model Ovis, Providing Cooking Steps by Analyzing Dishes
2024-09-17 10:04:47.AIbase.
OPPO's Large Model Upgraded to AndesGPT-2.0, Supporting Multimodal Capabilities
2024-09-14 15:42:34.AIbase.