Welcome to the 【AI Daily】column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with the hottest AI topics, focusing on developers and helping you understand technology trends and innovative AI product applications.

Check out the latest AI products Learn More: https://top.aibase.com/

1. Tencent HunYuan Launches 5 Open-Source 3D Models: 30-Second Generation, Compatible with Multiple Platforms

Tencent HunYuan announced the launch of five new open-source 3D generation models based on Hunyuan3D-2.0, featuring faster generation speeds and richer details. The Turbo series models utilize the FlashVDM framework for accelerated generation, completing the process within 30 seconds. The upgraded 3D AI creation engine supports multi-view input, allowing users to upload a small number of images to quickly generate high-quality 3D models, reducing production costs. The new models are widely used in UGC, product material synthesis, and game asset generation, meeting game 3D asset standards.

image.png

【AiBase Summary:】

⚡ The Turbo series models achieve tens of times faster generation through the FlashVDM framework, reducing generation time to 30 seconds.

🖼️ The Hunyuan3D-2-MV model better captures details, generating 3D assets that meet user expectations.

🛠️ The upgraded engine supports multi-view input; users only need to upload 2-4 images to quickly generate high-quality 3D models.

2. Anthropic Releases Major MCP Transmission Mechanism Upgrade: Farewell to Long Connections, Embrace More Flexible Streamable HTTP

Anthropic has made a significant update to its Model Context Protocol (MCP), introducing a Streamable HTTP transmission method to replace the traditional HTTP+SSE solution. This innovation addresses key limitations in remote MCP transmission, improving flexibility and compatibility. The new mechanism allows for more efficient two-way communication between the client and server, supporting stateless server operation, simplifying the deployment process, and enhancing system scalability.

image.png

【AiBase Summary:】

🚀 Removes dedicated /sse endpoints; all messages are transmitted through a unified /message endpoint, simplifying the communication process.

🔄 Servers can dynamically upgrade HTTP requests to SSE streams, supporting flexible two-way communication and addressing the unidirectional limitations of SSE.

🌐 Significantly improved compatibility, suitable for various network infrastructures, supporting stateless mode and reducing resource consumption.

Details: https://github.com/modelcontextprotocol/specification/pull/206

3. Shengshu Technology's Vidu to Create the First Overseas AI Original Sci-Fi Anime Series

Shengshu Technology Co., Ltd. and Aura Productions have reached a strategic partnership to launch the first overseas AI original sci-fi anime series. This collaboration marks the application of AI technology in anime production, opening a new chapter for the anime industry. The two parties will jointly produce a 50-episode short sci-fi anime series, utilizing Vidu's advanced video generation technology to improve production efficiency and quality, indicating a more intelligent and efficient future for anime creation.

image.png

【AiBase Summary:】

🚀 A 50-episode short sci-fi anime series will be launched and released on major global social media platforms.

🤖 Vidu's multi-subject consistency function ensures seamless integration of characters and scenes, achieving high-quality animation storytelling.

⏱️ Vidu 2.0 significantly improves video generation efficiency, generating high-quality videos in 10 seconds.

4. Google Cloud Launches High-Definition Speech Model Chirp 3, Supporting 248 Voices

Google Cloud launched the high-definition speech model Chirp 3 at its DeepMind headquarters in London, aiming to provide developers with powerful speech synthesis tools. The model supports 248 different voices and 31 languages, enabling developers to create applications such as intelligent voice assistants, audiobooks, and video dubbing. To ensure responsible use, Google has restricted access to voice cloning functionality and reiterated its commitment to data privacy.

image.png

【AiBase Summary:】

🌟 Google Cloud launches the Chirp 3 speech model, supporting 248 voices and 31 languages, helping developers build intelligent applications.

🔒 Google restricts access to voice cloning functionality to ensure ethical AI practices and prevent misuse.

💼 Google launches initiatives to enhance UK AI skills and provides cloud infrastructure support to startups, fostering innovation.

Details: https://cloud.google.com/text-to-speech/docs/chirp3-hd

5. Musk's xAI Acquires Video Generation Startup Hotshot, Intensifying Competition in the AI Video Field

Elon Musk's xAI company acquired the video generation AI startup Hotshot, marking its further expansion into multi-modal AI technology. Hotshot, with its unique technological advantages and powerful computing power, is dedicated to improving video generation capabilities.

image.png

【AiBase Summary:】

🤖 Hotshot focuses on AI video generation, using 6 million video clips for training to improve the model's understanding of video content.

⚙️ After the acquisition, Hotshot will continue to expand the development of its video generator, utilizing the powerful computing power of xAI's Colossus supercomputer.

💼 This acquisition marks Musk's further layout in AI technology, indicating that AI video generation technology will usher in a new round of breakthroughs.

6. Roblox Open-Sources Cube3D: The First Basic AI Model to Achieve 3D Object Generation

Roblox recently launched and open-sourced Cube3D, its first basic AI model for generating 3D objects, aimed at improving 3D creation efficiency. Through innovative training methods, the model tokenizes 3D objects and can quickly generate complete 3D shapes. In the future, Cube3D will develop into a multi-modal model supporting various input types, including text, images, and videos, further enhancing integration with existing Roblox AI creation tools.

image.png

【AiBase Summary:】

🛠️ Cube3D is Roblox's first open-source AI model for 3D object generation, designed to improve developer creation efficiency.

🔍 Through innovative training methods, the model can tokenize 3D objects and predict the next shape, quickly building complete 3D objects.

🌐 Roblox plans to develop Cube3D into a multi-modal model, which will support text, image, and video input in the future, enhancing the functionality of creation tools.

7. Zoom AI Assistant AI Companion Feature Upgrade

Zoom recently announced a new round of feature upgrades for its AI assistant, Zoom AI Companion, marking the evolution of this tool to enhance user interaction and work efficiency in video conferencing. New features include Zoom Tasks for automatically identifying and completing to-do items, a new voice recorder for transcribing offline conversations, and a customizable AI assistant, expected to significantly enhance user productivity and collaboration.image.png

【AiBase Summary:】

🌟 The Zoom Tasks feature automatically identifies to-do items in meetings and completes related tasks.

🗣️ The new voice recorder transcribes offline conversations and provides real-time meeting notes.

📅 The customizable AI assistant feature will be launched in April, allowing users to customize functions according to their needs.

8. 128K Ultra-Long Memory! Mistral's Latest Open-Source Model Mistral Small 3.1 Arrives, Surpassing GPT-4o Mini in Parameters

Mistral AI released the open-source model Mistral Small 3.1. With its 24 billion parameter design, its performance rivals products from Google and OpenAI. The model shows significant improvements in text processing and multi-modal understanding, supporting a 128k token context window and processing speed of 150 tokens per second.

image.png

【AiBase Summary:】

🌟 Mistral Small 3.1 has 24 billion parameters, with performance comparable to similar products from Google and OpenAI, driving competition in the AI market.

📈 The model supports a 128k token context window and a processing speed of up to 150 tokens per second, suitable for long documents and quick response scenarios.

🌍 Mistral adopts an open-source strategy, releasing under the Apache 2.0 license, emphasizing European digital sovereignty and attracting global developers to participate in innovation.

Details: https://top.aibase.com/tool/mistral-small-3-1

9. Who Says Videos Can Only Be “One-Shot”? ByteDance's Innovative LCT Technology Lets AI Direct Blockbuster Films!

The emergence of Long Context Adjustment (LCT) technology has greatly enhanced the narrative capabilities of AI-generated videos, allowing it to freely switch shots like a film director to construct more coherent story scenes. By introducing a full attention mechanism, staggered 3D positional embeddings, and asynchronous noise strategy, LCT solves the problems of visual consistency and temporal dynamics in multi-shot generation.

image.png

【AiBase Summary:】

🎥 LCT technology enables AI video generation models to direct multi-shot narrative videos, enhancing narrative capabilities.

🔍 Through a full attention mechanism and staggered 3D positional embeddings, LCT ensures visual consistency and temporal dynamics.

🚀 LCT supports autoregressive shot expansion, facilitating long video creation and interactive modification.

Details: https://top.aibase.com/tool/zhangshangxiawentiaoyoulct

10. The "Comeback" of a 32B Parameter Model! OLMo 2 32B Emerges, Challenging GPT-3.5 Turbo

OLMo 2 32B is the latest large language model released by the Allen Institute for Artificial Intelligence. With 32 billion parameters and fully open-source characteristics, it challenges many proprietary models. Through a refined training process, OLMo 2 32B surpasses GPT-3.5 Turbo and GPT-4o mini in several benchmark tests, demonstrating excellent performance and higher training efficiency.

image.png

【AiBase Summary:】

🌐 OLMo 2 32B is a fully open-source language model, disclosing all data, code, and training processes to promote global research collaboration.

📈 The model has 32 billion parameters and surpasses GPT-3.5 Turbo in several benchmark tests, proving the powerful capabilities of open-source models.

⚡ OLMo 2 32B demonstrates excellent training efficiency, using only one-third of the computing resources, showcasing the potential for efficient AI development.

Details: https://github.com/allenai/OLMo-core