Soul Voice Model Major Upgrade: Real-time End-to-End Voice Calls, Difficult to Distinguish Between Real Humans and AI Virtual Characters!

AIbase基地

Published inAI News · 6 min read · Sep 5, 2024

1.4k

In the "AI+Social" arena within China, Soul App is poised to infuse new vitality with AI!

Recently, Soul officially announced an upgrade to its voice large model, launching its self-developed end-to-end full-duplex voice call large model.

The most impressive effect of this upgrade is that it allows users to have voice calls with virtual characters as naturally and smoothly as chatting with real people!

How realistic is it? You can get a feel by watching the video below:

Official demonstration of "experiencing real-time AI call" examples

So, what makes Soul's self-developed end-to-end voice call large model special? According to official descriptions, its key highlights include:

Ultra-low interaction latency
Quick automatic interruption
Super-realistic voice expression
Emotion perception and understanding capabilities

Ultra-low interaction latency means that as soon as you speak, the AI can immediately respond without any delay, instantly bridging the gap between you and the AI. To have a real conversation with it, you don't need to wait at all, it's just like talking to a real person.

Soul's voice large model supports the quick automatic interruption feature. This means that when you communicate with the AI, if you want to interject, it can fully understand your intention and easily interrupt the other party, making this interaction really fun!

Finally, with super-realistic voice expression and emotion perception and understanding capabilities, the AI not only understands what you say but also senses your emotions and responds appropriately based on them.

Combining with the official video example, if this feature is fully launched later, it's estimated that a wave of users on Soul might not be able to tell the difference between real people and AI virtual characters.

Soul stated that its end-to-end voice call large model has already been applied to the "Echoes of Another World" real-time call scenario (in beta testing) and will be expanded to multiple AI companionship and interaction scenarios in the future, such as AI Gou Dan.

It is understood that as early as 2020, Soul had initiated AIGC technology research, focusing on the development of key technologies such as intelligent dialogue, voice technology, and virtual characters, and deeply integrating these AI capabilities into social scenarios.

In the process of upgrading social interactions with AI, Soul particularly emphasizes achieving anthropomorphic and natural emotional companionship experiences.

To provide users with better emotional feedback and companionship, the Soul technical team has been focusing on emotional understanding and latency issues. They have launched self-developed voice generation large models, voice recognition large models, voice dialogue large models, music generation large models, etc., supporting real voice generation, voice DIY, multilingual switching, multi-emotion simulation of real-time dialogue with humans, and these have been applied in multiple scenarios of Soul, such as "AI Gou Dan", "Werewolf Shadow" AI voice real-time interaction, "Echoes of Another World", etc.

The launch of Soul's self-developed end-to-end voice call large model means that users can enjoy a more natural human-computer interaction experience. In the future, Soul also plans to further promote the construction of multi-modal end-to-end large model capabilities, making human-AI interactions more interesting and immersive.

AI+Social SoulApp Voice Model Full-Duplex Voice Calls

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Silicon Base Flow Launches Powerful Coding Model Kimi K2 to Promote Smart Application Development

The Silicon Base Flow platform has launched the open-source MoE model Kimi K2 developed by Moonshot AI. The model has a total of 1T parameters and 32B activated parameters, supports a context length of 128K, and performs excellently in coding and agent tasks. The pricing is 4 yuan per million tokens for input and 16 yuan per million tokens for output. New users can get 14 yuan in trial credit upon registration. The model has three technical advantages: 15.5T tokens of large-scale training, MuonClip optimizer for stable expansion, and design optimized for agent tasks. Tests show that it excels in coding

Jul 14, 2025

A Daily: Moonlight Open-Sources Large Model Kimi K2; Zhiyuan Fully Open-Sources RoboBrain 2.0; Tongyi Qianwen Launches Qwen Chat Desktop Client

Moon's dark side opens trillion-parameter Kimi K2 model; RoboBrain2.0 enhances robot cognition; Alibaba's Qwen adds image generation; IndexTTS2 revolutionizes voice cloning; HuggingFace's Reachy Mini sells well; Meta enables real-time video generation; PixVerse adds multi-keyframe; Tesla Grok supports AMD only; OpenAI delays open-source release; Liquid AI's LFM2 boosts edge AI; AI 'time travel' trend goes viral.....

Jul 14, 2025

Liquid AI Opensources LFM2: The New King of Edge AI, Achieving Breakthroughs in Speed and Efficiency!

Liquid AI opensources the next-generation edge AI model LFM2, available in three versions with 350M to 1.2B parameters. The model features an innovative architecture, achieving twice the inference speed and three times the training efficiency on edge devices, supporting 32K long context processing. LFM2 performs exceptionally well in tasks such as instruction following, outperforming models of similar scale, making it particularly suitable for privacy-sensitive scenarios. Fully open-sourced through Hugging Face, this marks the first time a U.S. company has surpassed Chinese open-source models in the field of efficient small models. Liquid AI

Jul 14, 2025

Zhiyuan Announces Full Open Source of RoboBrain 2.0 and RoboOS 2.0, Breaking 10 Evaluation Benchmarks

BAAI released RoboBrain2.0 (32B) and RoboOS2.0 framework. RoboBrain2.0 excels in spatiotemporal cognition and complex tasks, while RoboOS2.0 is the first embodied AI SaaS framework supporting lightweight deployment and multi-robot collaboration. Both are now open-source.....

Jul 14, 2025

OpenAI Delays First Open-Source Large Model Release, Ensuring Safety Becomes Top Priority

OpenAI announced the postponement of its first open-source large model release, with CEO Sam Altman stating that more time is needed for safety testing and risk assessment. This new model, which has performance comparable to o3-mini, may be named 'Open Model,' but the extent of its openness remains unclear. Research Vice President Aidan Clark emphasized that the company maintains strict standards for open source, as the model cannot be recalled once released. Although the delay disappointed some users, OpenAI believes ensuring safety and taking a responsible approach is more important. This decision will shape the future of models.

Jul 14, 2025

130

China's AI Governance Plan Shines at the UN Summit, Beating Over 60% of Deepfake Attacks

The UN AI for Good Summit was held in Geneva, where Peng Jin from Ant Group shared China's achievements in AI security technology. Data shows that Ant Digital helped Southeast Asian banks reduce fake face attack rates from 10% to 4%, with an identification accuracy rate of 99.9%. Ant provides financial-grade identity authentication through the ZOLOZ platform, serving 25 countries, and has opened a dataset of 1.8 million fake samples to promote industry research. China's technological solutions are offering important references for global AI safety governance.

Jul 14, 2025

AI Chatbot Becomes a Virtual Friend, Experts Worry About Impact on Children's Social Development

UK study: 67% of teens aged 9-17 view AI chatbots as 'friends', with 12% relying on them due to lack of real social connections. AI mimics human emotions, potentially blurring human-machine boundaries. Experts warn of psychological risks and urge regulations to protect youth mental health.....

Jul 14, 2025

New AI Time Travel Gameplay is Trending! See What a 12-Year-Old Looks Like at 23?

AI's 'time travel' trend thrives as ChatGPT transforms childhood photos. TikTok's AI effect drew 170K users, but results vary: Musk's image was unrecognizable, Asian stars distorted, while Eddie Peng fared slightly better. Experts note AI predicts general trends, not individual changes, yet this playful tech sparks social media buzz.....

Jul 14, 2025

OpenAI Postpones Open-Source Large Model Release, Prioritizes Safety Testing

OpenAI announced the postponement of the open-source large model release. CEO Sam Altman stated that more time is needed for safety testing. The model was originally scheduled to be released this week but is now delayed until next week to ensure its safety and reliability. Altman emphasized that once the model is released, it cannot be recalled and must be handled with caution. This is OpenAI's first attempt to release a downloadable self-running model, aimed at providing powerful tools for researchers and small businesses. Although the delay is disappointing, the community generally understands the importance of safety testing and believes it is crucial for the AI ecosystem.

Jul 14, 2025

Goldman Sachs Introduces AI New Employee Deutsch, Opening the Era of Intelligent Finance

Goldman Sachs introduced AI coding assistant 'Devin' to boost efficiency, planning hundreds of deployments. The hybrid human-AI approach enhances productivity, though AI won't replace developers.....

Jul 14, 2025

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Soul Voice Model Major Upgrade: Real-time End-to-End Voice Calls, Difficult to Distinguish Between Real Humans and AI Virtual Characters!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Silicon Base Flow Launches Powerful Coding Model Kimi K2 to Promote Smart Application Development

A Daily: Moonlight Open-Sources Large Model Kimi K2; Zhiyuan Fully Open-Sources RoboBrain 2.0; Tongyi Qianwen Launches Qwen Chat Desktop Client

Liquid AI Opensources LFM2: The New King of Edge AI, Achieving Breakthroughs in Speed and Efficiency!

Zhiyuan Announces Full Open Source of RoboBrain 2.0 and RoboOS 2.0, Breaking 10 Evaluation Benchmarks

OpenAI Delays First Open-Source Large Model Release, Ensuring Safety Becomes Top Priority

China's AI Governance Plan Shines at the UN Summit, Beating Over 60% of Deepfake Attacks

AI Chatbot Becomes a Virtual Friend, Experts Worry About Impact on Children's Social Development

New AI Time Travel Gameplay is Trending! See What a 12-Year-Old Looks Like at 23?

OpenAI Postpones Open-Source Large Model Release, Prioritizes Safety Testing

Goldman Sachs Introduces AI New Employee Deutsch, Opening the Era of Intelligent Finance