ETRI Knowledge Sharing Platform : A New Distance Measure for a Variable-Sized Acoustic Model Based on MDL Technique

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Journal Article A New Distance Measure for a Variable-Sized Acoustic Model Based on MDL Technique

Cited 2 time in scopus

Download 108 time Share share

Abstract: Embedding a large vocabulary speech recognition system in mobile devices requires a reduced acoustic model obtained by eliminating redundant model parameters. In conventional optimization methods based on the minimum description length (MDL) criterion, a binary Gaussian tree is built at each state of a hidden Markov model by iteratively finding and merging similar mixture components. An optimal subset of the tree nodes is then selected to generate a downsized acoustic model. To obtain a better binary Gaussian tree by improving the process of finding the most similar Gaussian components, this paper proposes a new distance measure that exploits the difference in likelihood values for cases before and after two components are combined. The mixture weight of Gaussian components is also introduced in the component merging step. Experimental results show that the proposed method outperforms MDL-based optimization using either a Kullback-Leibler (KL) divergence or weighted KL divergence measure. The proposed method could also reduce the acoustic model size by 50% with less than a 1.5% increase in error rate compared to a baseline system. © 2010 ETRI.

KSP Keywords: Baseline system, Divergence measure, Hidden markov model(HMM), Kullback-Leibler (KL) divergence, Minimum description length (MDL) criterion, Mobile devices, Model parameter, Optimization methods, Speech recognition system, acoustic model, distance measure

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.