AI Daily: Doubao Launches Ola Friend Smart Earbuds; Vivo Releases New Bluemind Large Model Matrix; Open Source Version of NotebookLM is Here

站长之家

Published inAI News · 14 min read · Oct 10, 2024

167

Welcome to the AI Daily section! This is your daily guide to exploring the world of artificial intelligence. Each day, we bring you the hottest topics in the AI field, focusing on developers to help you understand technological trends and discover innovative AI product applications.

Discover fresh AI products by clicking here: https://top.aibase.com/

1. Doubao Launches Ola Friend AI-Enabled Headphones: Priced at 1199 Yuan

Doubao has introduced the Ola Friend AI-enabled headphones, designed to provide users with a companion AI friend always by their side. These headphones, integrated with AI technology, offer functions such as listening to music, learning English, and receiving tour guide services. Users need to download the Doubao app to fully utilize these features.

AiBase Highlights:
🎧 Ola Friend AI-enabled headphones are priced at 1199 yuan and support intelligent conversation functions.
🤖 The Ola Friend headphones serve as a companion AI friend, offering functions like listening to music, learning English, and tour guide services.
📱 Users need to download the Doubao app to fully utilize the features of the Ola Friend headphones, which supports functions like searching for information, answering questions, and inspiring creativity.

2. vivo Unveils the New BlueCore Large Model Matrix

At the 2024 vivo Developer Conference, vivo unveiled the new BlueCore large model matrix, upgrading language, voice, image, and multimodal capabilities to provide more powerful performance and functions. The new BlueCore large model matrix has set a new industry standard, offering users a superior experience.

AiBase Highlights:
🚀 The BlueCore large model matrix has been comprehensively upgraded, including language, edge-side, voice, image, and multimodal large models.
💡 The 3 billion BlueCore edge-side large model 3B has been introduced, with a 300% performance improvement, a 46% reduction in power consumption, a 63% reduction in memory usage, and a typing speed of 80 words per second.
🔊 The newly developed BlueCore voice large model supports natural semantic understanding, emotional expression, and simultaneous interpretation, while the image & multimodal large model enhances the generation capabilities of Chinese characteristics and Oriental aesthetics.

3. Open-Source Version of NotebookLM is Here! Podcastfy: Convert PDFs, Text, URLs, and More into Podcasts

In the digital age, creating engaging multilingual audio content has become a hot topic. Google's NotebookLM has received high praise, and the open-source Python package Podcastfy has garnered widespread attention. Podcastfy is an open-source version of NotebookLM, utilizing advanced generative AI technology to enable users to create more personalized and scalable podcasts.

AiBase Highlights:
🌟 Podcastfy is an open-source Python package that converts text and web content into multilingual audio dialogues.
🎧 Users can experience Podcastfy through the Gradio demo application or HuggingFace, with simple and easy-to-use operations.
⚠️ When using external content, ensure you have the copyright and permissions, as the generated audio content is AI-generated and does not mimic real individuals.
Detailed link: https://github.com/souzatharsis/podcastfy-demo?tab=readme-ov-file

4. New Magic in Image Restoration! Breakthrough Algorithm PMRF

The PMRF (Posterior Mean Correction Flow) algorithm is an innovative technology in the field of image processing, resolving the contradiction between distortion and perceptual quality in image restoration, opening up new possibilities for high-quality image reconstruction. Its uniqueness lies in its exceptional performance in various image restoration tasks, achieving remarkable results and balancing distortion and perceptual quality.

AiBase Highlights:
✨ The PMRF algorithm cleverly combines posterior mean prediction and correction flow models to create a new image restoration framework, minimizing distortion and enhancing perceptual quality.
🌟 Widely applicable, covering denoising, super-resolution, repairing damaged areas, and color restoration, producing natural and realistic images.
💡 In benchmark and real dataset tests, PMRF performs excellently, balancing distortion and perceptual quality, setting a new standard for image restoration.
Detailed link: https://huggingface.co/spaces/ohayonguy/PMRF

5. Walmart Introduces New AI Model Wallaby

Walmart recently launched the Wallaby large language model, focusing on retail industry data to enhance customer service experiences. They adopt a multi-model combination approach to flexibly address different application needs. The upgraded customer support assistant can more accurately understand customer intentions and provide personalized services.

AiBase Highlights:
✨ Walmart introduces the Wallaby large language model, focusing on retail industry data to enhance customer service experiences.
🤖 Walmart adopts a multi-model combination approach to flexibly address different application needs.
🛍️ The upgraded customer support assistant can more accurately understand customer intentions and provide personalized services.

6. Astonishing! GPT-4 Accidentally Masters Facial Recognition Technology, Accuracy Exceeds Professional Algorithms

Recent research shows that GPT-4 possesses facial recognition, gender determination, and age estimation capabilities, with accuracy surpassing professional algorithms but with potential security risks. The study reveals methods to bypass GPT-4's safety mechanisms, sparking thoughts about the security of large language models. Although GPT-4 performs excellently in biometric tasks, the authors warn against fully relying on its recognition abilities.

AiBase Highlights:
🌟 GPT-4 achieved a perfect accuracy rate of 100% in gender recognition tests, surpassing the DeepFace model.
📊 GPT-4's age estimation accuracy is 74.25%, but its estimates for older individuals may be more broad.
🔒 The study found ways to bypass GPT-4's safety mechanisms, necessitating enhanced research on the security of large language models.

7. 2 Million Users! Hugging Face Releases Gradio 5: Easily Build AI Applications with Natural Language

Hugging Face has released Gradio 5, dedicated to simplifying AI development, providing enterprise-level security, and AI Playground features, further advancing the AI application development experience.

AiBase Highlights:
🌟 Gradio5 introduces enterprise-level security to ensure application safety.
🚀 The new AI Playground feature simplifies the development process, making application generation effortless.
🔮 Hugging Face plans for the future, introducing various new features to further enhance the AI application development experience.
Detailed link: https://www.gradio.app/

8. OpenAI Seeks Court Dismissal of Musk's Lawsuit, Labeling it as "Harassment"

In this article, OpenAI is seeking to dismiss Musk's lawsuit against the company, labeling it as "harassment." The article reveals the background of the legal dispute between Musk and OpenAI, emphasizing that Musk's accusations lack evidence and questioning his legal standing.

AiBase Highlights:
🌟 Musk's multiple lawsuits against OpenAI are labeled as "harassment" by OpenAI and are being sought to be dismissed.
📉 OpenAI emphasizes that Musk's accusations lack evidence, calling them unrealistic claims.
⚖️ Musk claims that OpenAI has not followed the commitments of the founding agreement, but his legal right to make such claims is questioned.

9. Zoom Introduces Digital Avatar Feature: Convenience or Concern?

Zoom's plan to introduce a digital avatar feature has raised concerns about deepfake technology. While this feature can improve video creation efficiency, it may also pose risks of spreading false information.

AiBase Highlights:
✨ Zoom plans to introduce a digital avatar feature, converting user videos into AI-driven realistic digital avatars to improve asynchronous communication efficiency.
💡 The popularity of deepfake technology makes it difficult to distinguish between truth and falsehood, potentially leading to the misuse of fake videos.
🔒 Zoom's description of security measures is vague, and further protection measures are needed to prevent the generation of malicious fake videos.

10. DressRecon: Input Video to Build a 3D Model with Detailed Clothing

Recently, a research team from Carnegie Mellon University released a new technology called "DressRecon," which achieves high-quality human reconstruction from monocular video, especially suitable for loose clothing and hand-held objects. This technology uses neural implicit models to separate body and clothing deformations, leveraging image-based prior knowledge to capture subtle geometric features. The reconstruction results generate high-fidelity 3D models that support rendering from any angle, enhancing the visualization experience.

AiBase Highlights:
👗 The research team introduces the DressRecon technology, achieving high-quality human reconstruction from monocular video, especially suitable for loose clothing and hand-held objects.
📷 The technology uses neural implicit models to separate body and clothing deformations, leveraging image-based prior knowledge to capture subtle geometric features.
🎥 The reconstruction results not only generate high-fidelity 3D models but also support rendering from any angle, enhancing the visualization experience.
Detailed link: https://jefftan969.github.io/dressrecon/

11. DreamWaltz-G: Generate Vivid 3D Animated Avatars from Text

In the digital era, personalized virtual avatars are gaining attention. The DreamWaltz-G framework combines bone-guided score distillation and mixed 3D Gaussian representations to enhance the consistency and animation expressiveness of avatar generation. This framework supports shape control, video reenactment, and multi-subject scene construction, expanding the possibilities of digital content creation.

AiBase Highlights:
📌 The innovative framework DreamWaltz-G can generate vivid 3D animated avatars based on text descriptions.
🎨 Combining bone-guided score distillation and mixed 3D Gaussian representations, it enhances the consistency and animation expressiveness of avatar generation.
🎥 Supports shape control, video reenactment, and multi-subject scene construction, expanding the possibilities of digital content creation.

Doubao Mobile Assistant Encounters WeChat Login Issues, Official Response Suggests Triggering Security Risk Control

Some users of the nubia M153 engineering device, a collaboration between Doubao and ZTE, reported being unable to log into WeChat on December 3, with prompts indicating abnormal login environments. WeChat attributed this to potential security controls, and Doubao's team is investigating. Issues included login failures, forced logouts during data transfers, and automatic exits.....

BeanBag App's one-sentence photo editing function has been fully upgraded based on SeedEdit 3.0

Today, BeanBag App has made a significant upgrade in the field of intelligent photo editing. Its one-sentence photo editing function has been fully optimized based on the latest image editing model, SeedEdit 3.0, bringing more diverse photo editing experiences to users. After this upgrade, BeanBag App supports various photo editing operations, greatly meeting different photo editing needs of users. Users can easily add or replace text with just one click, adding unique text elements to photos to make youthful memories图文并茂and more memorable.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

AI Daily: Doubao Launches Ola Friend Smart Earbuds; Vivo Releases New Bluemind Large Model Matrix; Open Source Version of NotebookLM is Here

站长之家

This article is from AIbase Daily

AI News Recommendations

Doubao Mobile Assistant Encounters WeChat Login Issues, Official Response Suggests Triggering Security Risk Control

AI Daily: Lovart AI Launches Element Separation Feature; Xcode 26.1.1 Released; Alibaba Cloud Tongyi Model Makes Its Largest-Scale Deployment for Double 11

AI Daily: HeyGen Launches AI Video Translation Engine; iFLYTEK Unveils Spark X1.5; QQ Browser Introduces AI + Small Window

AI Daily: Kunlun Tech SkyReels V3 Model Released; Moonshot AI Launches Kimi Linear Model; MiniMax Music 2.0 Released

AI Daily: Meituan's LongCat-Flash-Omni Released; Qwen3-Max Launches Deep Thinking Feature; Baidu Wenshi 5.0 Makes a Strong Return

AI Daily: Sora's Free Quota to Shrink; Moonshot Releases Kimi Linear Architecture; Canva Freely Releases Affinity Professional Design Suite

AI Daily: OpenAI Releases Browser Atlas; Tongyi Qwen3-VL Adds Two Model Sizes, 2B and 32B; Baidu Launches Recurrent Evidence Enhancement Large Model

AI Daily: Google Gemini 3.0 Pro is being rolled out on a limited scale; Aishike Technology completes B+ round financing of 100 million yuan; Baidu releases document parsing model PaddleOCR-VL

AI Daily: Qilin 2.1 Launches New Start and End Frame Feature; Kunlun Wanzhi Launches AI Music Model Mureka V7.5; Tencent Cloud Launches AI Development Tool CloudBase AI CLI

BeanBag App's one-sentence photo editing function has been fully upgraded based on SeedEdit 3.0

AI News Recommendations

Doubao Mobile Assistant Encounters WeChat Login Issues, Official Response Suggests Triggering Security Risk Control

AI Daily: Lovart AI Launches Element Separation Feature; Xcode 26.1.1 Released; Alibaba Cloud Tongyi Model Makes Its Largest-Scale Deployment for Double 11

AI Daily: HeyGen Launches AI Video Translation Engine; iFLYTEK Unveils Spark X1.5; QQ Browser Introduces AI + Small Window

AI Daily: Kunlun Tech SkyReels V3 Model Released; Moonshot AI Launches Kimi Linear Model; MiniMax Music 2.0 Released

AI Daily: Meituan's LongCat-Flash-Omni Released; Qwen3-Max Launches Deep Thinking Feature; Baidu Wenshi 5.0 Makes a Strong Return

AI Daily: Sora's Free Quota to Shrink; Moonshot Releases Kimi Linear Architecture; Canva Freely Releases Affinity Professional Design Suite

AI Daily: OpenAI Releases Browser Atlas; Tongyi Qwen3-VL Adds Two Model Sizes, 2B and 32B; Baidu Launches Recurrent Evidence Enhancement Large Model

AI Daily: Google Gemini 3.0 Pro is being rolled out on a limited scale; Aishike Technology completes B+ round financing of 100 million yuan; Baidu releases document parsing model PaddleOCR-VL

AI Daily: Qilin 2.1 Launches New Start and End Frame Feature; Kunlun Wanzhi Launches AI Music Model Mureka V7.5; Tencent Cloud Launches AI Development Tool CloudBase AI CLI

BeanBag App's one-sentence photo editing function has been fully upgraded based on SeedEdit 3.0