Welcome to the [AI Daily] column! Here is your daily guide to exploring the world of artificial intelligence. Every day, we present the hot topics in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications.

Fresh AI products click to learn: https://top.aibase.com/

1. Graduate-level reasoning! Anthropic releases the Claude 3.5 Sonnet model, which can run code in the chat window

Anthropic today announced the release of Claude 3.5 Sonnet, the first product in the Claude 3.5 series. The model outperforms competitors and its predecessor, Claude 3 Opus, in multiple evaluations, setting a new industry standard while maintaining a speed and cost comparable to mid-range models. Claude 3.5 Sonnet sets new industry benchmarks in graduate-level reasoning, undergraduate-level knowledge, and coding capabilities, with significant performance improvements.

【AiBase Summary:】

⭐ Performance improvement: Outstanding performance in multiple evaluations of reasoning, knowledge mastery, and coding capabilities.

⭐ Operating speed and cost: Twice the operating speed of the previous Claude3Opus, with costs only one-fifth, offering high cost-effectiveness.

⭐ Enhanced understanding: Particularly outstanding in tasks requiring visual reasoning such as interpreting charts and graphs, and can accurately transcribe text from imperfect images.

⭐ New feature Artifacts allows users to request Claude to generate code snippets, text documents, or website designs.

Learn more: https://mp.weixin.qq.com/s/GIh5YZwIyw2qIj2Mtjej4g

2. First AI face-swapping software infringement case in Beijing concluded

In the first AI face-swapping software infringement case heard by the Beijing Internet Court, the court found the defendant guilty of infringing the plaintiff's personal information rights but did not constitute an infringement of the plaintiff's portrait rights. The case involves deep synthesis technology and the Personal Information Protection Law, sparking discussions on the legality of face-swapping technology and privacy protection.

image.png

【AiBase Summary:】

🔍 Court found the defendant guilty of infringing the plaintiff's personal information rights, but not of portrait rights.

💡 Face-swapping template videos do not have the recognizability of portrait rights, and do not constitute an infringement of the plaintiff's portrait rights.

💻 The defendant's actions involve the processing of personal information, infringing the plaintiff's personal information rights.

Details: https://www.chinaz.com/ainews/9700.shtml

3. Tencent Yuanbao releases a new version, integrating with WeChat search

Tencent Yuanbao recently released a new version, mainly improving the processing capabilities for ultra-long texts and AI search and analysis functions, adding WeChat search integration, bringing users a more efficient and convenient user experience. This update not only improves the efficiency of processing ultra-long documents but also enriches the support for file formats, chart generation, and image analysis functions. The new version also enhances the search function, integrating with WeChat search and other search engines, providing more comprehensive services.

image.png

【AiBase Summary:】

🚀 Improved ultra-long text processing capabilities, supporting the processing of single document texts up to 10 million words.

📊 Multi-file parsing, parsing up to 50 files at once, supporting multiple file formats.

🔍 Enhanced search function, integrating with WeChat search and other search engines, providing intelligent search results.

Details link: https://top.aibase.com/tool/tengxunyuanbao

4. CNKI announces the launch of the CNKI AI Academic Research Assistant 4.0

CNKI has recently launched the AI Academic Research Assistant 4.0 version, combining AI large model technology and high-quality data to enhance document retrieval, reading, and academic creation efficiency. New features include controllable generation, document expansion, scholar search, full-text translation, and academic expansion services, meeting users' personalized needs. The prominent upgrade is the enhanced search and scholar search services in a question-and-answer format. Experience address: https://top.aibase.com/tool/zhiwangcnki-ai-xueshuyanjiuzhushou

image.png

【AiBase Summary:】

🔍 The AI Academic Research Assistant 4.0 version combines AI large model technology and high-quality data to enhance document retrieval, reading, and academic creation efficiency.

🔄 New features include controllable generation, document expansion, scholar search, full-text translation, and academic expansion services, meeting users' personalized needs.

🔗 The prominent upgrade is the enhanced search and scholar search services in a question-and-answer format, providing more accurate answers and detailed scholar information.

Details link: https://top.aibase.com/tool/zhiwangcnki-ai-xueshuyanjiuzhushou

5. Groq launches the whisper-large-v3 model, supporting speech transcription and translation, free to open

Groq's latest Whisper Large-V3 model provides users with powerful speech transcription and translation functions, which can be used in the Playground or local projects via API. Users experience high-speed transcription, supporting translation into English from multiple languages. The Whisper API is compatible with OpenAI standards, providing speech-to-text and translation functions, easy to integrate into applications. Superior performance, using the advanced "whisper-large-v3" model.

image.png

【AiBase Summary:】

🔊 High-speed transcription: A 4-minute 30-second video takes about 3 seconds to transcribe.

🌐 Multilingual support: Supports transcription and translation into English from multiple languages.

🛠️ API interface: Provides speech-to-text and translation functions, can be integrated into applications.

Details link: https://console.groq.com/playground

6. Fudan open-source project Hallo has been adapted to the ComfyUI plugin

The Hallo project is an open-source project that generates talking videos from audio and images. Although the installation threshold is high, it provides more possibilities and fun for retouching and other processes. It adopts an end-to-end diffusion paradigm, introducing a hierarchical audio-driven visual synthesis module to achieve alignment accuracy between audio input and visual output, generating natural talking videos. Although the installation is complex, it injects new vitality into the open-source ecosystem, looking forward to more similar projects in the future bringing convenience and fun.

【AiBase Summary:】

🔊 Hallo project adapted to ComfyUI plugin, generates natural talking videos from audio and images.

🎤 Adopts an end-to-end diffusion paradigm, introduces a hierarchical audio-driven visual synthesis module, improves alignment accuracy.

😊 Hierarchical audio-driven module achieves control over expressions and gestures diversity, personalized customization, generates natural effects.

Details link: https://github.com/AIFSH/ComfyUI-Hallo

7. AI tool Perplexity accused of continuing to scrape website content despite being banned

Perplexity is an AI startup company that reshapes the online reading experience by reinventing the way of web information interaction. However, they have bypassed the Robots Exclusion Protocol to obtain restricted web content, which has sparked controversy. Although the CEO defended the company's actions, they faced criticism over copyright issues. Perplexity's actions have raised ethical and legal dilemmas in the digital media field.

【AiBase Summary:】

🤖 Perplexity bypassed the Robots Exclusion Protocol to obtain restricted web content, sparking controversy.

💼 CEO defended the company's actions, but faced criticism over copyright issues.

⚖️ Perplexity's actions raised ethical and legal dilemmas in the digital media field.

8. Ukrainian influencer's face stolen by AI: After complaint, the company using her image deleted the related images

21-year-old Ukrainian YouTube blogger Olga Loiek recently found that someone had used AI to clone her, and these cloned images were being abused online. The cloners not only used AI technology to create a large number of videos but also claimed that the characters in these videos were Russians, which is obviously false. The incident has sparked discussions on the legal and ethical issues of AI technology use, especially in terms of personal privacy and image rights protection.

【AiBase Summary:】

🔍 Cloning phenomenon abuse: Over 4,900 cloned videos, spreading false information, involving fraudulent behavior.

🛡 Image rights infringement: Multiple image rights infringements, potentially misleading the public, sparking legal discussions.

⚖ AI technology challenges: Need to be wary of the risks of abuse, protect personal rights, sparking ethical issues discussions.

Details: https://www.chinaz.com/ainews/9707.shtml

9. Strong alliance! Universal Music partners with AI music startup SoundLabs to create voice clone models for singers

Universal Music Group has partnered with AI music technology company SoundLabs to launch the MicDrop feature, allowing artists to create personalized voice models, with full control, breaking language barriers, and protecting artists' rights. This revolutionary technology brings music creation into a new creative space, driving the application and development of AI in the music field.

【AiBase Summary:】

🎤 Artists create voice models, with full control, ensuring clear ownership and usage rights.

🎸 MicDrop feature can convert voice to instruments, providing more flexibility and creative space for music creation.

🌍 Language conversion feature helps artists break language barriers, globally distribute music, expand the audience base.

10. Cure for regret? Agent intelligent body may become the AI remedy for post-618 impulsive shopping

Agent intelligent body has become the salvation for 618 impulsive shoppers, bringing a revolution to consumption decisions. Platforms for intelligent bodies have sprung up like mushrooms after rain, allowing the collaboration of different intelligent bodies with a single click, creating personalized intelligent assistants. AI shopping assistant intelligent bodies combined with e-commerce have created a new business model.

【AiBase Summary:】

🤖 Agent intelligent body has become a new tool for consumption decisions, changing the shopping behavior of impulsive shoppers.

🔍 Intelligent body platforms have emerged, allowing the collaboration of different intelligent bodies with a single click, providing personalized intelligent assistant services.

💡 AI shopping assistant intelligent bodies combined with e-commerce, demonstrating more accurate consumption suggestions, creating a new business model.

11. GaussianCube: High-quality 3D generative modeling, performance leap of 74%!

The field of three-dimensional generative modeling has seen a breakthrough with the GaussianCube technology, which has surpassed traditional NeRF, revolutionizing 3D modeling. The technology uses a density-constrained Gaussian fitting algorithm, simplifying the modeling process and achieving high-precision fitting. Experimental results show a performance improvement of up to 74%, proving its great potential.

image.png