Upload a Piece of Music and Transform It into a Piano Piece! AMT-APC Algorithm Generates Master-Level Piano Performances at One Click

AIbase基地

Published inAI News · 5 min read · Oct 18, 2024

445

Recently, researchers at the Musashino University School of Data Science have developed a new algorithm called AMT-APC, which can automatically generate piano compositions with greater precision. This technology leverages the advantages of Automatic Music Transcription (AMT) models by fine-tuning them to better capture musical nuances and expressiveness, thereby producing piano renditions that closely mimic the original pieces.

Historically, the challenge in automatically generating piano music has been the fidelity of sound quality and the lack of expressive depth. Existing models often only produce simple melodies and rhythms, failing to capture the rich details and emotions of the original pieces. The AMT-APC algorithm, however, takes a different approach. It initially utilizes a pre-trained AMT model to accurately "capture" various sounds in the music and then fine-tunes this model for the task of Automatic Piano Performance (APC).

The core of the AMT-APC algorithm lies in its two-step strategy:

Step one: Pre-training. Researchers selected a high-performance AMT model named hFT-Transformer as the foundation and further trained it using the MAESTRO dataset to handle longer musical segments.

Step two: Fine-tuning. Researchers created a paired dataset containing original audio and piano performance MIDI files and used this dataset to fine-tune the AMT model, enabling it to generate piano performances that closely align with the style of the original pieces.

To enhance the expressiveness of the generated piano music, researchers introduced a concept called "style vectors." Style vectors are a set of features extracted from each piano performance, including note onset rate distribution, velocity distribution, and pitch distribution. By inputting style vectors along with the original audio into the model, the AMT-APC algorithm can learn different performance styles and reflect them in the generated piano music.

Experimental results show that compared to existing automatic piano performance models, the AMT-APC algorithm significantly improves both sound fidelity and expressiveness. Using a metric called Qmax to evaluate the similarity between the original piece and the generated audio, the AMT-APC model achieved the lowest Qmax value, indicating its superior ability to replicate the characteristics of the original piece.

This study demonstrates that AMT and APC tasks are highly related, and leveraging existing AMT research can help develop more advanced APC models. In the future, researchers plan to explore more suitable AMT models for APC applications to achieve more realistic and expressive automatic piano performances.

Project link: https://misya11p.github.io/amt-apc/

Paper link: https://arxiv.org/pdf/2409.14086

Suno Acquires WavTool to Enhance AI Music Editing Tool amid Music Copyright Controversy

AI music company Suno announced on Thursday the acquisition of WavTool, a browser-based AI digital audio workstation (DAW). The move aims to enhance Suno's editing capabilities in song creation and production. WavTool, launched in 2023, offers various features including audio separation, AI audio generation, and an AI music assistant, and is expected to be integrated with Suno's latest editing interface. Although the specific terms of the acquisition have not been disclosed, a company spokesperson stated

Dou Bao AI's Gaokao Score Reaches the Threshold for Tsinghua and Peking University Admission! Literature Score of 683 Leads Domestic and International Top Models

ByteDance's Seed Team recently released the impressive results of the 2025 full subject Gaokao test: the Dou Bao Seed1.6-Thinking model scored 683 in literature and 648 in science, meeting the admission threshold for Tsinghua and Peking University, and performed outstandingly in domestic and international AI model Gaokao tests. The test used the national new volume one and Shandong Province's independent proposition papers, with Dou Bao competing against five other top domestic and international AI models such as Google Gemini 2.5 Pro, DeepSeek R1, and OpenAI o3.

Generate 3D Models from a Single Image! PartCrafter Disrupts Modeling Workflow with Peking University, ByteDance, and CMU Collaboration

A major breakthrough in 3D modeling technology! The PartCrafter project co-developed by Peking University, ByteDance, and Carnegie Mellon University has officially been unveiled. It can generate high-precision, structured 3D models from a single RGB image, completely overturning the traditional complex process of segmentation followed by reconstruction. This technology not only improves generation efficiency but also infers the 3D geometry of unseen structures, showcasing immense potential for AI in the 3D generation field. The AIbase editorial team has compiled the latest information to provide you with an in-depth analysis of PartCrafter.

Tencent Music completes full acquisition of Himalaya, with AI becoming a key fusion term

Tencent Music Entertainment Group recently announced that it would acquire China's leading long audio platform, Himalaya, by paying $1.26 billion in cash and partial equity. This much-discussed merger not only reshaped the competitive landscape of China's audio market but also foretells a new trend of deep integration between "ear economy" and AI technology. According to the announcement, Tencent Music will pay a huge amount of cash and provide Class A ordinary shares worth up to 5.5686% of the total share capital to Himalaya and its founding shareholders in phases. After the transaction is completed, Himalaya will officially become part of Tencent Music.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Upload a Piece of Music and Transform It into a Piano Piece! AMT-APC Algorithm Generates Master-Level Piano Performances at One Click

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Suno Acquires WavTool to Enhance AI Music Editing Tool amid Music Copyright Controversy

Starting at 1999 yuan! Xiaomi AI Glasses Revolutionize Smart Wearables, All-in-One Functions for Shooting, Payment, and Music!

Dou Bao AI's Gaokao Score Reaches the Threshold for Tsinghua and Peking University Admission! Literature Score of 683 Leads Domestic and International Top Models

Bytedance Launches ProtoReasoning Framework: Enhancing the Logical Reasoning Ability of Large Language Models

Everyone can create music! Tencent AI Lab launches the open-source music generation large model SongGeneration

OpenAI Launches ChatGPT Record: New Features for Real-Time Recording, Transcription, and Summarization

Apple's New Speech Technology Takes the Field! 34-Minute 4K Video Transcription Completed in Only 45 Seconds, Speed Exceeds OpenAI by 55%

Quark launches a Gaokao志愿 large model for free to help candidates choose their ideal university!

Generate 3D Models from a Single Image! PartCrafter Disrupts Modeling Workflow with Peking University, ByteDance, and CMU Collaboration

Tencent Music completes full acquisition of Himalaya, with AI becoming a key fusion term