Welcome to the "AI Daily" column! Here is your daily guide to exploring the world of artificial intelligence. Every day, we present you with the hottest topics in the AI field, focusing on developers to help you gain insights into technological trends and understand innovative AI product applications.

Fresh AI products Click to learn more: https://top.aibase.com/

1. iFLYTEK Launches Spark Multimodal Interaction Model, Achieving “Voice, Vision, and Digital Human Interaction” Integration

The newly launched iFLYTEK Spark multimodal interaction model marks a new stage of development, expanding from single voice interaction technology to real-time multimodal interaction with audio and video. This model integrates voice, vision, and digital human interaction functions, achieving seamless integration and introducing hyper-realistic digital human technology to enhance the vividness and realism of AI. The new model enables more authentic and coherent emotional expression through cross-modal semantic consistency, supports hyper-realistic rapid interaction technology, and provides personalized interaction experiences. Multimodal visual interaction allows the model to comprehensively perceive background scenes and logistical states, offering users a richer and more precise interaction experience.

image.png

[AiBase Highlights:]

🌟 iFLYTEK launches the Spark multimodal interaction model, integrating voice, vision, and digital human interaction functions for seamless integration.

🔥 Introduces hyper-realistic digital human technology, ensuring precise alignment of digital human actions with voice content, enhancing the vividness and realism of AI.

💡 Supports hyper-realistic rapid interaction technology, enabling end-to-end modeling from voice to voice, providing personalized interaction experiences.

Details link: https://www.xfyun.cn/solutions/Multimodel

2. Anthropic Releases New Prompt Optimization Feature

The latest prompt optimization feature from Anthropic brings significant convenience to developers, enhancing the reliability and efficiency of AI applications. The optimizer automatically refines prompts using advanced engineering techniques, improving accuracy by 30%, with a word count accuracy of 100%. The example management feature allows developers to manage examples in a structured manner, simplifying processes. Kapa.ai has successfully migrated multiple AI workflows to the Claude platform, praising the optimizer for speeding up production.

image.png

[AiBase Highlights:]

🔍 The new prompt optimization feature enhances the reliability and efficiency of AI applications.

🚀 The optimizer automatically refines prompts, improving accuracy by 30%, with a word count accuracy of 100%.

💡 The example management feature simplifies the example management process, and Kapa.ai successfully migrated multiple AI workflows to the Claude platform.

Details link: https://www.anthropic.com/news/prompt-improver

3. Major Upgrade! ChatGPT Desktop Version for Windows Fully Launched, macOS Version Adds Application Collaboration Features

OpenAI has released an important update, enhancing the usability of ChatGPT on Windows and Mac systems. The Windows desktop application is officially launched, and the macOS test version is deeply integrated with popular coding applications, transforming into a real-time assistant. This brings significant benefits to developers.

image.png

[AiBase Highlights:]

🚀 The ChatGPT desktop application for Windows is fully launched, enhancing user experience.

💻 The macOS version adds application collaboration features, allowing developers to analyze code and receive intelligent suggestions directly.

📈 OpenAI plans to expand support for more applications, enhancing the practicality of AI tools in desktop work.

Details link: https://openai.com/chatgpt/desktop/?ref=maginative.com

4. Tencent's AI Smart Workbench ima.copilot Launches Windows Version

Tencent has launched the Windows version of ima.copilot, which not only features a search function but can also answer questions, create text, and generate images, showcasing Tencent's technological strength and in-depth exploration in the field of artificial intelligence. The search function of ima integrates resources from WeChat public account articles, enriching search results and improving information acquisition efficiency and quality. ima supports handling local files, multilingual translation, provides a personal knowledge base, and offers 24-hour online personal assistant services, delivering a convenient and efficient work and learning experience for users.

image.png

[AiBase Highlights:]

🔍 The search function integrates resources from WeChat public account articles, enriching search results and improving information acquisition efficiency and quality.

📄 Handles local files, automatically summarizes content, generates mind maps, and supports multilingual translation for user convenience.

📚 Provides a personal knowledge base and 24-hour online personal assistant services, creating a dedicated library for users and offering a convenient and efficient work and learning experience.

Details link: https://ima.qq.com/

5. Generate Applications with a Single Sentence! Alibaba Tongyi Launches Code Mode

Alibaba Tongyi Lab has launched a code mode that allows users to generate various applications, including mini-games, data charts, websites, and resumes, through simple everyday language commands. Users can visit the Tongyi web version and click on "Code Mode" to start experiencing this new interaction method. The code mode particularly benefits non-programming users by supporting pre-set popular application templates, such as personal resumes and the 2048 mini-game. Developed based on Qwen2.5-Coder, it enhances AI programming performance and efficiency.

image.png

[AiBase Highlights:]

👩‍💻 Tongyi's code mode allows users to generate various applications through simple commands, including mini-games and data charts.

🌐 Users can visit the Tongyi web version and click on "Code Mode" to start experiencing this new interaction method.

🚀 The code mode, developed based on Qwen2.5-Coder, enhances AI programming performance and efficiency.

6. Boston Dynamics Spot Robot Gains New Skills to Easily Avoid Obstacles Like Wires and Ladders!

Boston Dynamics' robotic dog Spot has recently undergone a significant software update, greatly enhancing its mobility in complex environments. This update not only improves Spot's autonomous navigation capabilities but also lays the groundwork for its application in more complex environments.

image.png

[AiBase Highlights:]

🐶 The Spot robot can now automatically identify and avoid obstacles like wires and ladders.

🤖 The latest video features a mysterious dinosaur-headed robot, sparking curiosity among viewers.

📈 Software updates enhance Spot's navigation capabilities, expanding its application prospects.

7. Google Gemini Exp1114 Debuts! Dominates GPT-4 in Initial Battle, Tops Multiple Capability Evaluations, Shocking the Industry

Google's latest experimental version of Gemini (Exp1114) has achieved remarkable results on the Chatbot Arena platform, surpassing competitors and showcasing impressive strength. Gemini-Exp-1114 ties for first place with GPT-4-latest, exceeding it by more than 40 points, dominating core areas such as mathematics, complex prompts, and creative writing. Industry analysts believe this breakthrough indicates that Google's long-term investment in AI is beginning to yield results.

image.png

[AiBase Highlights:]

🚀 Gemini-Exp-1114 surpasses GPT-4-latest in overall scoring, showcasing strong comprehensive capabilities.

💡 Gemini-Exp-1114 tops in core areas such as mathematics, complex prompts, and creative writing, delivering impressive performance.

🔗 The breakthrough of Gemini-Exp-1114 shows that Google's long-term investment in AI is beginning to bear fruit, sparking industry discussions and attention.

8. TikTok Launches Powerful AI Video Creation Tool Symphony, Empowering the Entire Process of Commercial Advertising Creation

TikTok has announced the full launch of Symphony Creative Studio, providing advertisers and content creators with an unprecedented creative experience, making video production simpler and more efficient, with no additional costs. This marks an intensification of competition among social media platforms in the field of AI creative tools, showcasing TikTok's technological strength in AI video creation and its determination in the commercialization process.

[AiBase Highlights:]

🚀 Symphony Creative Studio integrates video generation, transformation, and expansion functions, helping advertisers and creators overcome creative bottlenecks and providing rapid video content generation capabilities.

👥 Supports AI virtual character video creation, allowing users to choose ready-made or customized virtual images, with the system automatically generating videos for further optimization by advertisers.

🎨 Provides video translation dubbing, existing video editing, and other functions, automatically generating video content based on advertisers' historical activities, offering a more efficient and creative content production experience for brand advertisers.

9. AI Competes in Minecraft! Claude's New Version Amazes the Internet with Its Building Skills

Recently, a unique AI capability evaluation took place on the Minecraft platform, attracting widespread attention. The old and new versions of Claude 3.5 Sonnet engaged in building competitions, revealing significant differences in capabilities, with the new version Sonnet 3.6 performing exceptionally well. The evaluation has been jokingly referred to as the only reliable benchmark and has received support from the open-source community, launching on GitHub. The AI construction process does not rely on visual understanding, providing contextual operation instructions in textual form.

image.png

[AiBase Highlights:]

🌟 Sonnet 3.6 excelled in creativity, receiving support from over 2000 voters.

🧠 The AI construction process does not depend on visual understanding, providing contextual operation instructions in textual form.

🔧 The project team plans to further refine the evaluation mechanism, creating a scoring system similar to the Lmsys arena, using the Elo algorithm to rank based on human user votes.

Details link: https://x.com/mckaywrigley/status/1849613686098506064

10. Pony.ai Officially Launches IPO, Expected to Raise Up to $378 Million

Pony.ai has officially launched its IPO, planning to list on NASDAQ and raise up to $378 million. Several automotive manufacturers are participating in the subscription, promoting Robotaxi technology cooperation and global layout. [AiBase Highlights:]

🌟 Pony.ai initiates IPO, planning to list on NASDAQ and raise up to $378 million.

🚗 The main funds will be used for the commercialization of autonomous driving services and technology research and development.

🤝 Several automotive manufacturers are participating in the subscription, promoting Robotaxi technology cooperation and global layout.