Welcome to the AI Daily column! This is your daily guide to exploring the world of artificial intelligence, where we present the hottest topics in the AI field every day, focusing on developers to help you understand technology trends and learn about innovative AI product applications.
Fresh AI Products Click to Learn More: https://top.aibase.com/
1. Digital Humans Take Flight! ByteDance's Loopy Lip-Sync Feature Launches on Jimeng
ByteDance's new project, Loopy, has launched on Jimeng, achieving perfect synchronization of digital human voices with visuals, expressions, and emotions. AIbase's experience has been excellent, with currently the best support for Chinese. Loopy solves the problem of disjointedness in lip-sync videos, automatically adding tone, emotion, and expressions when characters speak, precisely directing subtle movements of virtual avatars.
AiBase Highlights:
👄 Characters automatically convey tone, emotion, and expressions, solving the problem of disjointedness in lip-sync videos.
🎤 Supports text-to-speech and local voice uploads, enabling characters to speak and sing.
👁 Video lip-syncing considers subtle changes like the Adam's apple and eyebrows, making the overall video more realistic.
Details Link: https://top.aibase.com/tool/jimeng
2. Tencent's Meta Device AI Agent Now Supports Publishing to WeChat Official Accounts: Create Digital Avatars and 24/7 Intelligent Customer Service
Tencent's AI agent product, Tencent Meta Device, now supports publishing to WeChat Official Accounts, bringing several new features to account operators. Users can create an agent in three simple steps to enhance user engagement and experience. Developers can quickly build high-quality agents, supporting publishing on platforms like QQ and WeChat, as well as API calls.
AiBase Highlights:
🤖 Real-time interactive digital avatars: Enhance user engagement and experience.
🕒 24/7 intelligent customer service: Provide round-the-clock customer service, improving service efficiency.
📝 Article insertion feature: Enhance the interactivity and informativeness of content, providing readers with a question-and-answer assistant and more practical articles.
3. Alibaba Cloud's MoDa Community Launches AIGC Zone: 157 Multimodal Models Available
Alibaba announced several technological innovations and business developments at the Cloud栖 Conference on September 21, 2024, including the official launch of the AIGC zone in the MoDa community, providing developers with a comprehensive AI creation and development platform. Alibaba Cloud also introduced significant upgrades in security and data management, as well as a new family of elastic computing products.
AiBase Highlights:
🚀 MoDa community launches the AIGC zone, providing a comprehensive AI creation and development platform, with free functional modules and GPU computing power.
🔒 Alibaba Cloud's native security capabilities have been fully upgraded, launching the Cloud-native Network Detection and Response product NDR, increasing free security protection capabilities, and supporting small and medium-sized enterprises in completing cloud security risk governance.
💻 Alibaba Cloud releases a family of elastic computing products, launching the ninth-generation ECS enterprise-level instances, with performance improved by up to 30%.
4. Aishi Technology's Video Generation Large Model PixVerse Launches New UI: Smoother Operation
Aishi Technology's large video generation model, PixVerse, has introduced a new user interface (UI), bringing a series of innovative features to enhance the user's creative experience. Updates include a universal creation floating board, homepage inspiration library, creative workbench, etc., optimizing the generation steps and functional layout to meet the needs of different devices. The PixVerse V2.5 version has been globally launched, improving the dynamic effects, speed, and quality of video generation, and enhancing the model's ability to understand prompts and generate content. New features such as Performance high-performance mode, motion brush, lens control, and text content generation make video creation more professional and vivid, and the experience smoother.
AiBase Highlights:
⚙️ Updates include a universal creation floating board, homepage inspiration library, creative workbench, enhancing user experience.
🚀 PixVerse V2.5 version globally launched, optimizing video generation effects and speed, improving generation accuracy and aesthetic level.
🎨 New features like Performance high-performance mode, motion brush, lens control, text content generation, making creation more professional and vivid, experience smoother.
Details Link: https://pixverse.ai/
5. CNKI's Huazhi Large Model 5.0 Released: More Comprehensive Application Scenarios, More Powerful Reasoning Capabilities
The Huazhi Large Model 5.0 version was released at a seminar co-hosted by Tongfang Knowledge Network and Huawei Cloud, comprehensively upgrading application scenarios, reasoning capabilities, and content generation credibility, introducing new applications such as intelligent PPT, AI technology novelty search, Huazhi APP, and 3D holographic interactive digital humans. The Huazhi Large Model 5.0 achieves a full series, multimodal, strong knowledge, high-credibility capability leap, successfully applied in education and research, industry and agriculture, government and finance, medical and legal fields, with CNKI AI learning and research assistant recognized by thousands of institutional users.
AiBase Highlights:
🌟 More comprehensive application scenarios
🚀 More powerful reasoning capabilities
💡 Introduces intelligent PPT, AI technology novelty search, Huazhi APP, 3D holographic interactive digital humans, and other new applications
6. Shocking Resource Consumption of ChatGPT! Writing an Email Equals Drinking a Bottle of Water
Recent research reveals that using ChatGPT to write emails consumes a large amount of water and electricity, potentially exacerbating drought issues. AI resource consumption is concerning, requiring the development of sustainable development policies.
AiBase Highlights:
💧 Each 100-word email sent consumes 519 milliliters of water, equivalent to a bottle of mineral water.
⚡ Training GPT-3 consumed 700,000 liters of water, and sending an email consumes 0.14 kilowatt-hours of electricity.
🌱 Over-reliance on AI may lead to resource consumption issues, and businesses need to develop sustainable development policies.
7. Deepgram Launches Real-Time Intelligent Dialogue API, Revolutionizing Human-Machine Interaction Experience
Deepgram's latest AI voice agent API revolutionizes human-machine interaction, bringing unprecedented natural dialogue experiences to businesses and developers. The API integrates advanced speech recognition and synthesis technologies, supporting real-time dialogue understanding and generation, opening up new horizons for building efficient voice assistants.
AiBase Highlights:
🚀 API integrates advanced technologies, supports real-time dialogue understanding and generation, enhancing interaction naturalness.
💡 Equipped with innovative end-of-thought detection models, gracefully handles pauses and interruptions in conversations, making communication smoother and more natural.
🔧 Offers flexibility, supports integration with multiple large language models, response speed controlled within 1 second, suitable for applications in multiple fields.
Details Link: https://deepgram.com/agent/
8. StoryMaker: Maintaining Character Consistency in Multi-Character Scenes Made Easy
StoryMaker is a personalized solution that brings unprecedented consistency and coherence to AI-generated continuous images, allowing creators to easily build engaging visual narratives. Its core advantage lies in its powerful ability to maintain character consistency, supporting the continuity of facial features, clothing, hairstyles, and body postures in multi-character scenes. This technology's flexibility and diverse creative possibilities open up new realms for AI-assisted creation, bringing new possibilities to the digital art and entertainment industries.
AiBase Highlights:
🔑 Powerful ability to maintain character consistency, keeping facial features, clothing, hairstyles, and postures highly consistent
🌟 Wide range of applications, users control the background, character poses, and style of generated images through simple text commands, creating image sequences that meet specific narrative needs
🎨 Strong flexibility, supports advanced features such as clothing exchange and character interpolation, seamlessly integrates with other generation plugins, offering diverse creative possibilities
Details Link: https://top.aibase.com/tool/storymaker
9. Former Apple Design Director Jony Ive Confirms Collaboration with OpenAI to Create Mysterious AI Device
Jony Ive is collaborating with OpenAI to develop a mysterious AI hardware device aimed at creating a computing experience less socially disruptive than the iPhone. The project team is strong, with several members having participated in the design of classic Apple products. The market is full of anticipation for this collaboration, hoping to bring fresh AI devices.
AiBase Highlights:
🌟 Jony Ive collaborates with OpenAI to develop a mysterious AI hardware device.
🤖 The goal of the new device is to create a computing experience less socially disruptive than the iPhone.
🛠️ The project team is strong, with several members having participated in the design of classic Apple products.
10. Apple's New Siri Powered by Apple Intelligence May Launch Earlier
According to the Power On newsletter, Apple may release a completely rebuilt Siri based on Apple Intelligence earlier. This means users may experience some features earlier, although not the full experience. Mark Gurman revealed details about the release timeline of Apple Intelligence features, providing some new information.
AiBase Highlights:
🚀 Apple may release a new Siri based on Apple Intelligence earlier, and users may experience some features earlier.
💡 The new Siri features are expected to be released in iOS18.3, earlier than the previously expected iOS18.4.
📅 The development timeline and release dates for the iOS18 series have also been detailed, including the release schedule for iOS18.1 to iOS18.4.
11. Google Invests $120 Million to Establish Global AI Opportunity Fund
Google announced a $120 million investment to establish the Global AI Opportunity Fund, aiming to promote global AI education. The fund will collaborate with non-profit organizations to provide multilingual AI training, narrowing the digital gap between countries. CEO Sundar Pichai calls for policies to promote AI innovation, emphasizing the importance of AI in achieving sustainable development goals.
AiBase Highlights:
🌐 Google invests $120 million to establish the "Global AI Opportunity Fund," promoting global AI education.
🤝 The fund will collaborate with non-profit organizations to provide multilingual AI training, narrowing the digital gap between countries.
📈 CEO Sundar Pichai calls for policies to promote AI innovation, emphasizing the importance of AI in achieving sustainable development goals.
12. Perplexity AI Plans to Launch New "Sponsored Q&A" Advertising System
Perplexity AI plans to launch a new "Sponsored Q&A" advertising system, in talks with Nike and Marriott for cooperation, challenging Google's dominance in the digital advertising market. The system's pricing is significantly lower than Google's, attracting more brands to participate, making Perplexity a unicorn company with a valuation exceeding $1 billion. However, the company also faces accusations of plagiarism and has taken measures to improve.