ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper A Study of Audio Mixing Methods for Piano Transcription in Violin-Piano Ensembles
Cited 3 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Hyemi Kim, Jiyun Park, Taegyun Kwon, Dasaem Jeong, Juhan Nam
Issue Date
2023-06
Citation
International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023, pp.1-5
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICASSP49357.2023.10095061
Abstract
While piano music transcription models have shown high performance for solo piano recordings, their performance de-grades when applied to ensemble recordings. This study aims to analyze the impact of different data augmentation methods on piano transcription performance, specifically focusing on mixing techniques applied to violin-piano ensembles. We apply mixing methods that consider both harmonic and temporal characteristics of the audio. To create datasets for this study, we generated the PFVN-synth dataset, which contains 7 hours of violin-piano ensemble audio by rendering MIDI files and corresponding labels, and also collected unaccompanied violin recordings and mixed them with the MAESTRO dataset. We evaluated the transcription results on both synthesized and real audio recordings datasets.
KSP Keywords
Data Augmentation, High performance, Temporal characteristics, music transcription