Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Submit Your Model

Submit Your Model Info & Services - Precision Marketing & User Targeting

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

AI Search Visibility Checker

Detect brand's visibility on AI platforms

Tools

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Easily Identify Audio Forgery! Zhejiang University and Tsinghua University Join Forces to Create the AI Voice Privacy Protection Tool SafeEar

AIbase基地

Published inAI News · 5 min read · Sep 26, 2024

152

In the rapidly advancing era of artificial intelligence, voice synthesis and conversion technologies are evolving at an unprecedented pace, delivering incredibly realistic and natural audio experiences. However, these advancements also introduce potential security risks, particularly with the "voice cloning" technology that could be exploited by malicious actors to threaten personal privacy and social stability.

To address this challenge, Zhejiang University's Intelligent System Security Lab and Tsinghua University have jointly developed a revolutionary voice forgery detection framework—SafeEar. This framework not only efficiently detects forged audio but also safeguards users' voice privacy during the detection process, achieving dual protection of security and privacy.

The core technology of SafeEar lies in its use of a neural audio codec-based decoupling model. This innovative design separates the acoustic features from the semantic information of the voice, relying solely on acoustic features for forgery detection. This not only significantly enhances detection accuracy but also ensures that the voice content is not leaked during the process, effectively protecting user privacy.

The framework includes modules such as the front-end decoupling model, bottleneck layer, obfuscation layer, forgery detector, and real-environment enhancement. Through the collaborative operation of these modules, SafeEar demonstrates exceptional detection capabilities against various forgery techniques, with a false alarm rate as low as 2.02%, nearly reaching the level of the most advanced technologies currently available. Furthermore, experiments have shown that attackers cannot recover the original voice content from the acoustic information, fully demonstrating SafeEar's outstanding performance in privacy protection.

The front-end module of SafeEar employs an innovative decoupling model that effectively distinguishes between acoustic and semantic information during the separation and reconstruction of voice features. Subsequently, the bottleneck and obfuscation layers further protect voice information through dimensionality reduction and random obfuscation, effectively preventing the extraction of real information even when faced with the most advanced voice recognition models.

In terms of forgery detection, SafeEar utilizes an acoustic input-based Transformer classifier, enhancing the precision and efficiency of the detection. Additionally, by simulating audio conditions under various environments with multiple audio codecs, SafeEar also improves the model's environmental adaptability.

After a series of rigorous experimental tests, SafeEar not only surpasses many traditional detection methods but also sets a new standard in the field of audio forgery detection. More importantly, SafeEar can protect users' voice privacy in real-time during practical applications, providing strong support for the secure development of intelligent voice services.

Through this technology, Zhejiang University and Tsinghua University have not only pioneered a new field in voice forgery detection but also constructed a rich audio dataset containing various languages and vocoders. This lays a solid foundation for future research and applications, ensuring that users can enjoy convenient voice services while also receiving better privacy protection.

The advent of SafeEar undoubtedly provides us with a powerful tool to address privacy challenges in the AI era, allowing us to enjoy the convenience of technology while better protecting our privacy security.

Paper Link: https://safeearweb.github.io/Project/files/SafeEar_CCS2024.pdf

Voice Synthesis Voice Cloning SafeEar Neural Audio Codec

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Claude Client Updated, Supports Taking Screenshots and Sending to Claude, Quick Key Voice Communication

Claude desktop app upgraded to a productivity tool with real-time screen, voice, and file support, now available on mobile. Core 'Screenshot Share' feature enables AI analysis of selected areas via hotkey drag, boosting cross-platform efficiency.....

Oct 22, 2025

290

Sesame Completes $250 Million Series B Funding, Revolutionary AI Voice Attracts Millions of Users to Try, Test Version of the App Launches Concurrently

Sesame, led by ex-Oculus co-founder and AR startup CTO, raised $250M in Series B. Beta testing iOS app for voice-based personal AI, aiming for smart glasses integration.....

Oct 22, 2025

360

Claude Desktop Version Major Upgrade: Supports Sending Screenshots, Caps Lock Turns into an AI Voice Magic Key

Claude desktop now free on Windows and Mac. Mac update adds screenshot sharing and Caps Lock voice control for seamless AI collaboration without switching windows.....

Oct 22, 2025

850

Clone Your Voice in 10 Seconds! Fish Audio S1 Upgrade Makes a Stunning Debut, Priced at Just One-Sixth of ElevenLabs

Fish Audio S1 voice cloning model upgraded, enhancing emotional expressiveness and realism. Generates speech with nuanced emotions, rhythm, and tone variations, redefining industry standards cost-effectively for superior user experience.....

Oct 21, 2025

160

Fish Audio Launches Upgraded S1 Voice Cloning Model: Clone Real Human Speech in 10 Seconds

Fish Audio released an upgraded version of the S1 voice cloning model, achieving breakthroughs in emotional expressiveness and realism. The model can generate realistic human-like voices with emotions, rhythm, and tone variations. It can clone a human voice with just 10 seconds of audio sample, fully preserving the original voice's accent, intonation, rhythm, and speaking habits, producing highly realistic results.

Oct 21, 2025

220

Microsoft Deepens AI Strategy: Core Integration of Copilot in Windows 11 Supports Voice Control, Screen Analysis, and Local Automation

Microsoft integrates Copilot into Windows 11 with generative AI, enabling voice control, screen analysis, and local automation to revolutionize PC interaction.....

Oct 17, 2025

160

Volcano Engine Launches Four Powerful Large Models, Voice Synthesis and Replication Features Upgraded

Volcano Engine launched four Doubao AI models at Wuhan AI Expo: upgraded 1.6 with four thinking lengths, lightweight 1.6lite, and new voice synthesis 2.0 & cloning 2.0, enhancing intelligence for flexible enterprise solutions.....

Oct 16, 2025

220

Google Launches Veo 3.1 Video Generation Model: New Audio Features and Fine-Grained Editing Capabilities

Google upgrades the video generation model Veo 3.1, improving audio output, editing control accuracy, and image-to-video quality, enabling more realistic videos and precise response to instructions. New features allow adding objects to videos and automatically matching the visual style. The ability to remove objects will be introduced in the Flow tool, enhancing editing flexibility.

Oct 16, 2025

230

Google AI Video Generation Tool Flow Upgraded: More Flexible Editing and Powerful Audio Features

Google upgraded AI video tool Flow with new lighting editing for realism & flexibility. Audio enhancements also allow natural content creation.....

Oct 16, 2025

180

Turing Award Winner Hinton: AI May Already Have Subjective Experiences, but Human Understanding of Consciousness Is Limited

AI pioneer Hinton presents a controversial view in an interview: current AI systems may already possess some form of subjective experience, but have not yet developed self-awareness. He emphasizes that the key lies in human misunderstanding of the nature of consciousness, rather than whether AI has awareness. At the same time, he reviews the development of AI from simple keyword matching to the present.

Oct 15, 2025

220

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Easily Identify Audio Forgery! Zhejiang University and Tsinghua University Join Forces to Create the AI Voice Privacy Protection Tool SafeEar

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Claude Client Updated, Supports Taking Screenshots and Sending to Claude, Quick Key Voice Communication

Sesame Completes $250 Million Series B Funding, Revolutionary AI Voice Attracts Millions of Users to Try, Test Version of the App Launches Concurrently

Claude Desktop Version Major Upgrade: Supports Sending Screenshots, Caps Lock Turns into an AI Voice Magic Key

Clone Your Voice in 10 Seconds! Fish Audio S1 Upgrade Makes a Stunning Debut, Priced at Just One-Sixth of ElevenLabs

Fish Audio Launches Upgraded S1 Voice Cloning Model: Clone Real Human Speech in 10 Seconds

Microsoft Deepens AI Strategy: Core Integration of Copilot in Windows 11 Supports Voice Control, Screen Analysis, and Local Automation

Volcano Engine Launches Four Powerful Large Models, Voice Synthesis and Replication Features Upgraded

Google Launches Veo 3.1 Video Generation Model: New Audio Features and Fine-Grained Editing Capabilities

Google AI Video Generation Tool Flow Upgraded: More Flexible Editing and Powerful Audio Features

Turing Award Winner Hinton: AI May Already Have Subjective Experiences, but Human Understanding of Consciousness Is Limited

GEO Services