ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술대회 Blind Rhythmic Source Separation : Nonnegativity and Repeatability
Cited 13 time in scopus Download 2 time Share share facebook twitter linkedin kakaostory
저자
김민제, 유지호, 강경옥, 최승진
발행일
201003
출처
International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2010, pp.2006-2009
DOI
https://dx.doi.org/10.1109/ICASSP.2010.5495205
초록
An unsupervised method is proposed aiming at extracting rhythmic sources from commercial polyphonic music whose number of channels is limited to one. Commercial music signals are not usually provided with more than two channels while they often contain multiple instruments including singing voice. Therefore, instead of using conventional ways, such as modeling mixing environments or statistical characteristics, we should introduce other source-specific characteristics for separating or extracting the sources. In this paper, we concentrate on extracting rhythmic sources from the mixture with the other harmonic sources. An extension of nonnegative matrix factorization (NMF) is used to analyze multiple relationships between spectral and temporal properties in the given input matrices. Moreover, temporal repeatability of the rhythmic sound sources is implicated as common rhythmic property among segments of an input mixture signal. The proposed method shows acceptable, but not superior separation quality to the referred drum source separation systems. However, it has better applicability due to its blind manner in separation. ©2010 IEEE.
KSP 제안 키워드
Harmonic sources, Nonnegative Matrix Factorization(NMF), Separation system, Sound source, Statistical characteristics, Temporal properties, polyphonic music, singing voice, source separation, unsupervised method