ETRI Knowledge Sharing Platform : Multi-level Stereo Attention Model for Center Channel Extraction

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper Multi-level Stereo Attention Model for Center Channel Extraction

Cited 3 time in scopus

Citation: International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) 2019, pp.1-4

Abstract: In recent years, the spatial audio reproduction of digital media has become popular. Despite the demand for such spatial audio content, very little content is produced with multi-channel audio. Moreover, it is difficult to provide interactive services to users owing to the lack of object-based content. In this paper, we propose a center channel extraction method based on a multi-level convolutional neural network structure to generate object-based content. In addition, we present a novel stereo attention model which considers each channel's characteristics. By applying the proposed method to stereo audio content, we achieve better extraction performance than existing commercial application.

KSP Keywords: Attention model, Commercial application, Convolution neural network(CNN), Digital Media, Extraction method, Interactive services, Multi-level, Multichannel audio, Neural network structure, Object-based, S characteristics

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.