ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Blind rhythmic source separation: Nonnegativity and repeatability
Cited 13 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Min Je Kim, Ji Ho Yoo, Kyeong Ok Kang, Seung Jin Choi
Issue Date
2010-03
Citation
International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2010, pp.2006-2009
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICASSP.2010.5495205
Abstract
An unsupervised method is proposed aiming at extracting rhythmic sources from commercial polyphonic music whose number of channels is limited to one. Commercial music signals are not usually provided with more than two channels while they often contain multiple instruments including singing voice. Therefore, instead of using conventional ways, such as modeling mixing environments or statistical characteristics, we should introduce other source-specific characteristics for separating or extracting the sources. In this paper, we concentrate on extracting rhythmic sources from the mixture with the other harmonic sources. An extension of nonnegative matrix factorization (NMF) is used to analyze multiple relationships between spectral and temporal properties in the given input matrices. Moreover, temporal repeatability of the rhythmic sound sources is implicated as common rhythmic property among segments of an input mixture signal. The proposed method shows acceptable, but not superior separation quality to the referred drum source separation systems. However, it has better applicability due to its blind manner in separation. ©2010 IEEE.
KSP Keywords
Harmonic sources, Nonnegative Matrix Factorization(NMF), Separation system, Sound source, Statistical characteristics, Temporal properties, polyphonic music, singing voice, source separation, unsupervised method