ETRI Knowledge Sharing Platform : I-vector Based Utterance Verification for Large-Vocabulary Speech Recognition System

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper I-vector Based Utterance Verification for Large-Vocabulary Speech Recognition System

Cited 4 time in scopus

Authors: Woo Yong Choi, Hwa Jeon Song, Hoon Chung, Jeomja Kang, Jeon Gue Park

Citation: International Conference on Computer Communication and the Internet (ICCCI) 2016, pp.316-319

Abstract: This paper proposes a new Utterance Verification (UV) algorithm based on i-vector. Phone segments are extracted and concatenated from the training data, which are used to train the Universal Background Model (UBM) and the Total Variability (TV) matrix, and then, i-vector is extracted from the enrollment and evaluation data using UBM and TV matrix. We compare two Confidence Measures (CMs), cosine distance scoring and Support Vector Machine (SVM). To compensate the channel effect, we use two channel compensation methods, Linear Discriminant Analysis (LDA) and Within-Class Covariance Normalization (WCCN). The decision is made by the word-level CM by combining the phone-level CMs. Experiments are conducted in the Korean isolated word recognition domain. Experimental results show that SVM is superior to cosine distance scoring. Best performance is achieved when SVM is used without any channel compensation method.

KSP Keywords: Best performance, Channel effect, Compensation method, Confidence measure, Cosine Distance, I-vector, Isolated Word Recognition, Speech recognition system, Support VectorMachine(SVM), Universal Background Model(UBM), Vector based

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.