ETRI Knowledge Sharing Platform : 원어민 및 외국인 화자의 음성인식을 위한 심층 신경망 기반 음향모델링

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Journal Article 원어민 및 외국인 화자의 음성인식을 위한 심층 신경망 기반 음향모델링

Cited - time in scopus

Abstract: This paper proposes a new method to train Deep Neural Network (DNN)-based acoustic models for speech recognition of native and foreign speakers. The proposed method consists of determining multi-set state clusters with various acoustic properties, training a DNN-based acoustic model, and recognizing speech based on the model. In the proposed method, hidden nodes of DNN are shared, but output nodes are separated to accommodate different acoustic properties for native and foreign speech. In an English speech recognition task for speakers of Korean and English respectively, the proposed method is shown to slightly improve recognition accuracy compared to the conventional multi-condition training method.

KSP Keywords: Acoustic properties, DNN-based acoustic model, Deep neural network(DNN), Foreign speech, Hidden nodes, Multi-condition training, multi-set, neural network(NN), new method, recognition accuracy, speech recognition

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.