Welcome to the AI Daily section! This is your daily guide to exploring the world of artificial intelligence. Every day, we bring you the hottest topics in the AI field, focusing on developers, helping you understand technological trends and discover innovative AI product applications.

Explore fresh AI products by clicking here: https://top.aibase.com/

1. Say goodbye to expensive motion capture! Runway introduces Act-One, a generative character performance tool, turning videos into animations with seamless style switching!

I was deeply impressed by Runway's latest Act-One tool! This revolutionary technology uses generative AI models to effortlessly create lifelike character animations from just video and voice inputs, completely overturning traditional animation production processes. No expensive equipment or complicated post-production is needed; anyone can produce high-quality animation works. The operation is simple and allows for the generation of various styles of character animations, providing creators with great creative freedom.

image.png

AiBase Summary:

🎬 Revolutionary technology uses generative AI models to create lifelike character animations from actor videos and voice inputs, overturning traditional production processes.

💡 Simple operation allows for multiple styles of character animations, providing creators with great creative freedom.

🌟 Handles complex multi-round dialogue scenes, with a wide range of applications, bringing a new era to the animation industry.

Details link: https://top.aibase.com/tool/runway

2. Ideogram launches Canvas feature: achieve magical image filling and seamless expansion

Ideogram's latest Canvas feature offers powerful image generation and editing options, allowing users to freely expand, compare, adjust image sizes and order, and even combine multiple images into new works. It's especially suitable for marketers and content creators, enhancing creativity efficiency and flexibility.

image.png

AiBase Summary:

🖼️ New Canvas feature: Ideogram's new feature supports image generation and various editing options.

✂️ Infinite creativity: Users can generate four images from prompts and modify them freely.

📈 Efficient creation: Especially suitable for marketers and content creators, enhancing creation efficiency and flexibility.

3. Stability AI releases Stable Diffusion 3.5 series text-to-image models

Stability AI has released the most powerful model, Stable Diffusion 3.5, which includes a family bucket of three versions to meet diverse needs. The model offers high customization, efficient performance, and diverse outputs, capable of running on consumer-grade hardware, supporting global image generation.

image.png

AiBase Summary:

🔑 High customization, efficient performance, and diverse outputs

🔑 The model can run on consumer-grade hardware, supporting global image generation

🔑 Generous community license, allowing free commercial use

Details link: https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-large

4. Claude 3.5 gets a major upgrade: Sonnet's coding capabilities surpass o1, Haiku offers unbeatable cost-effectiveness, and it can even use a computer!

Anthropic has released the upgraded Claude 3.5 Sonnet and a new model Claude 3.5 Haiku, making significant progress in reasoning, coding, and visual processing. Sonnet leads the industry, performing excellently, even surpassing public models like OpenAI o1-preview. Haiku is Anthropic's fastest model, with performance comparable to Claude 3 Opus but at a lower cost and faster speed. Both models have the ability to use computers, opening up new possibilities for automated processes and personalized experiences.

image.png

AiBase Summary:

🚀 Sonnet's coding capabilities lead the industry, surpassing public models like OpenAI o1-preview.

💡 Haiku is Anthropic's fastest model, offering high cost-effectiveness, suitable for personalized experience generation.

💻 The models have the ability to use computers, opening up new possibilities for automated processes and tasks.

5. Canva launches new text-to-image tool Dream Lab, generating 3D illustrations with a single click!

Canva's latest AI feature, the most eye-catching being the Dream Lab tool, uses Leonardo.ai's Phoenix model, allowing users to generate various styles of images through descriptions. In addition, Canva's Magic AI tool suite has also been updated, improving text generation accuracy and adding new features for whiteboard and video editing. However, Canva has announced an increase in subscription prices for some commercial customers, causing mixed reactions from users regarding the gradual improvements and new features.

image.png

AiBase Summary:

🎨 Canva's new image generation tool "Dream Lab" uses Leonardo.ai's Phoenix model to generate multiple styles of images based on descriptions.

✏️ Canva's "Magic" AI tool suite has been updated, improving text generation accuracy and adding new features for whiteboard and video editing.

💰 Canva has announced an increase in subscription prices for some commercial customers, causing mixed reactions from users regarding the value of gradual improvements and new features.

6. Volcano Engine launches template mall to lower the threshold for AI applications

Volcano Engine's template mall provides users with a simple and quick way to easily use AI capabilities, significantly improving work efficiency and quality. The mall features multiple high-quality templates from AI best practices, covering various business scenarios, allowing users to copy and customize applications with one click. Additionally, the mall offers clear categorization and popular recommendations, bringing more possibilities and inspiration to users.

image.png

AiBase Summary:

⚙️ Template mall lowers the threshold for AI applications, allowing more users to easily use AI capabilities and improve work efficiency and quality.

💡 The mall features multiple high-quality templates from AI best practices, covering intelligent customer service, content marketing, and other business scenarios.

🚀 Users can copy templates with one click and customize applications, shortening preparation time and improving efficiency.

7. Genmo opens source video generation model Mochi 1: high quality, ultra-smooth, even home computers can create Hollywood-level blockbusters!

Genmo's open-source latest video generation model, Mochi 1, has caused a sensation in the video generation field with its high quality and ultra-smooth characteristics, allowing home computers to create Hollywood-level blockbusters. Mochi 1 adopts an innovative Asymmetric Diffusion Transformer (AsymmDiT) architecture with 10 billion parameters, fully trained from scratch, providing great convenience for developers.

image.png

AiBase Summary:

💡 Mochi 1 adopts an innovative Asymmetric Diffusion Transformer (AsymmDiT) architecture with 10 billion parameters, the largest video generation model ever publicly released.

💡 Mochi 1 has excellent motion quality and precise adherence to text prompts, capable of generating smooth videos up to 5.4 seconds long at 30 frames per second.

💡 Mochi 1 can simulate various physical phenomena, generating naturally smooth human actions, providing new possibilities for video generation for developers.

Details link: https://huggingface.co/genmo/mochi-1-preview

8. Tencent launches ima.copilot intelligent workbench product

Tencent's latest ima.copilot intelligent workbench product, powered by the Hunyuan large model, aims to provide users with a new search, read, and write experience. The product features core functions such as knowledge acquisition, personal knowledge base building, and intelligent writing assistance, allowing easy management and acquisition of knowledge, providing customized answers, and assisting with writing tasks. Tencent plans to launch more versions to meet user needs, showcasing in-depth exploration and continuous innovation in the field of artificial intelligence, enhancing work and learning efficiency, and providing intelligent support tools for users.

image.png

AiBase Summary:

🔍 Knowledge acquisition: Users can ask questions based on the entire web through ima.copilot, integrating high-quality content into their personal knowledge base, easily obtaining knowledge.

📚 Personal knowledge base building: The product supports users in building their own knowledge base, providing customized answers, inspiring work and learning ideas.

✍️ Intelligent writing assistance: ima.copilot can understand user needs and assist in completing writing tasks such as papers, essays, and copywriting.

Details link: https://ima.qq.com/

9. PodCastLM is here!

PodCastLM is a newly launched tool designed to help users convert PDF document content into audio podcasts, enhancing the efficiency and fun of information dissemination. By combining modern technology, users can easily generate original audio content, saving time and effort.

image.png

AiBase Summary:

🔊 User-friendly interface and smooth conversion process

🎙️ Convert PDF documents into audio podcasts

📚 Suitable for various users, such as podcast hosts, content creators, educators

Details link: https://github.com/YOYZHANG/PodCastLM

10. Cohere releases multi-modal search model Embed 3

Cohere's latest multi-modal AI search model, Embed 3, supports enterprise-level retrieval through text and images, significantly improving image search performance, helping companies tap into data value. The updated API simplifies the process for customers switching from other models, providing a more flexible search experience.

image.png

AiBase Summary:

🌟 Users can perform multi-modal searches through images and text

📈 The updated model significantly improves image search performance, helping companies tap into data value

🔄 The updated API simplifies the process for customers switching from other models

Details link: https://cohere.com/blog/multimodal-embed-3

11. ChatGPT Advanced Voice Mode lands in Europe!

OpenAI has recently expanded its ChatGPT Advanced Voice Mode to regions including the EU, achieving response speeds comparable to human dialogue. The feature is not only available to users in the US and UK but has also undergone multiple improvements, including the addition of five new voices, custom instruction functions, and memory of dialogue content. Compared to competitor Google's Gemini Live, ChatGPT offers a more natural conversational experience and more efficient information interaction.

image.png

AiBase Summary:

🚀 ChatGPT Advanced Voice Mode has been expanded to regions including the EU, with fast response speeds comparable to human dialogue.

🔊 Added five new voices and custom instruction functions, allowing users to choose different voice responses and control ChatGPT's behavior.

💡 OpenAI has made further breakthroughs in the field of artificial intelligence, allowing users to enjoy a more natural conversational experience and efficient information interaction.

12. French AI startup Les Ministraux releases a new lightweight model, outperforming Llama 3!

Les Ministraux's Ministral 3B and Ministral 8B models perform excellently on edge devices, matching the performance of open-source models, providing users with high computational efficiency and low-latency solutions. However, Mistral company has recently been embroiled in controversy and is no longer as open as before, possibly being acquired by Microsoft.

image.png

AiBase Summary:

🚀 Ministral 3B and Ministral 8B outperform Llama 38B and Mistral 7B, with Ministral 8B superior in all aspects except coding capabilities.

💡 Ministral 3B and Ministral 8B support up to 128k context, setting a new benchmark for models under 10B parameters, with Ministral 8B equipped with a sliding window attention mechanism.

⚙️ Les Ministraux models can be applied to manage AI agent workflows, create task assistants, and other scenarios, with Ministral 8B priced at $0.1 per million tokens and Ministral 3B at $0.04 per million tokens.