Adobe Launches New AI Tool That Allows Sound Designers to Create Audio by Humming and Mimicking Sounds

AIbase基地

Published inAI News · 3 min read · Dec 23, 2024

251

Adobe Research, in collaboration with Northwestern University, has developed a groundbreaking AI system called Sketch2Sound. This technology can transform simple vocal imitations and text descriptions into professional-grade sound effects, potentially revolutionizing the way sound design is done in the industry.

The system analyzes three key elements of the voice input: loudness, timbre (which determines the brightness of the sound), and pitch. It then combines these features with text descriptions to generate the desired sound.

Video: García et al., Adobe Research

What makes Sketch2Sound interesting is its ability to understand context. For example, if someone inputs "forest ambiance" and makes a short sound, the system automatically identifies that these sounds should be bird calls - without needing specific instructions.

The same intelligence applies to music. When creating a drum pattern, users can input "bass drum, snare drum" and then hum the rhythm with low and high tones. The system will automatically place the bass drum in the low range and the snare drum in the high range.

Providing Fine Control for Professionals

The research team has built in special filtering techniques that allow users to adjust the precision of the generated sounds. Sound designers can choose between precise, detailed control or a more relaxed, approximate method according to their needs.

This flexibility makes Sketch2Sound particularly valuable for sound designers (professionals who create sound effects for movies and television shows). They can create effects more quickly using voice and text descriptions instead of manipulating physical objects to produce sounds.

Researchers noted that the spatial audio characteristics of input recordings can sometimes affect the generated sounds in undesirable ways, but they are working to address this issue. Adobe has not yet announced when or if Sketch2Sound will become a commercial product.

Anthropic to Unveil Its First Asia-Pacific Office in Tokyo, Marking a New Era in AI

Amid the rapid development of the global artificial intelligence industry, the US AI startup Anthropic officially announced on June 24 that it will open its first Asia-Pacific office in Tokyo this fall. This news undoubtedly injects new vitality into the AI ecosystem in Japan and the entire Asia-Pacific region. Founded in 2020, Anthropic is committed to developing AI technologies centered around human needs, with the philosophy of building safe and controllable AI systems to better serve society through technology. The company is leading in this field.

YouTube Shorts to Introduce Veo3 AI Video Generation Technology with Daily Views Exceeding 2 Billion

At the recently concluded Cannes Lions International Festival of Creativity, YouTube CEO Neal Mohan announced that YouTube will introduce its latest Veo3AI video generation model to YouTube Shorts later this summer. Dubbed "a rogue's dream", its release will bring new creative possibilities for short video creators. Currently, short video creators are already able to use the Veo2 model to generate background videos and standalone clips. Although Neil

Meta Releases V-JEPA 2: New Breakthroughs in Video Understanding and Zero-Shot Robot Control Lead the Future!

The Meta AI research team has made another breakthrough in the artificial intelligence field, officially releasing the new video understanding model V-JEPA2 (Video Joint Embedding Predictive Architecture2) on June 11, 2025. Led by Meta's chief AI scientist Yann LeCun, this model has opened up new possibilities in video understanding and physical world modeling with its innovative self-supervised learning technology and zero-shot robot control capabilities.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Adobe Launches New AI Tool That Allows Sound Designers to Create Audio by Humming and Mimicking Sounds

AIbase基地

Providing Fine Control for Professionals

This article is from AIbase Daily

AI News Recommendations

New Open Source AI System OmniGen 2: Integrates Image and Text Generation Like GPT-4o

Kling AI Launches Video Sound Effect Feature for Immersive Visual and Auditory Experience

ElevenLabs Launches Voice Design v3 - Generate Any Sound You Want with Just One Sentence

Anthropic to Unveil Its First Asia-Pacific Office in Tokyo, Marking a New Era in AI

iFlytek Xinghuo Medical Large Model V2. International Edition Released - Exceeding the Practicality of Human Doctors!

From Text Generation to Instruction Editing: OmniGen2 Redefines Application Scenarios for Open-Source Multimodal Models

YouTube Shorts to Introduce Veo3 AI Video Generation Technology with Daily Views Exceeding 2 Billion

DeepSite V2 Update! Supports DeepSeek-R1-0528 Model, Easily Generate 3D Web Page Animations, No Code Needed for Creative Play!

ByteDance Seaweed APT2 is震撼 released! Real-time Interactive AI Video Generation Unlocks a New Era of 3D Virtual World

Meta Releases V-JEPA 2: New Breakthroughs in Video Understanding and Zero-Shot Robot Control Lead the Future!