ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article A New Distance Measure for a Variable-Sized Acoustic Model Based on MDL Technique
Cited 2 time in scopus Download 7 time Share share facebook twitter linkedin kakaostory
Authors
Hoon-Young Cho, Sanghun Kim
Issue Date
2010-10
Citation
ETRI Journal, v.32, no.5, pp.795-800
ISSN
1225-6463
Publisher
한국전자통신연구원 (ETRI)
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.4218/etrij.10.1510.0062
Project Code
09MS4200, Development of Portable Korean-English Automatic Speech Translation Technology, Sanghun Kim
Abstract
Embedding a large vocabulary speech recognition system in mobile devices requires a reduced acoustic model obtained by eliminating redundant model parameters. In conventional optimization methods based on the minimum description length (MDL) criterion, a binary Gaussian tree is built at each state of a hidden Markov model by iteratively finding and merging similar mixture components. An optimal subset of the tree nodes is then selected to generate a downsized acoustic model. To obtain a better binary Gaussian tree by improving the process of finding the most similar Gaussian components, this paper proposes a new distance measure that exploits the difference in likelihood values for cases before and after two components are combined. The mixture weight of Gaussian components is also introduced in the component merging step. Experimental results show that the proposed method outperforms MDL-based optimization using either a Kullback-Leibler (KL) divergence or weighted KL divergence measure. The proposed method could also reduce the acoustic model size by 50% with less than a 1.5% increase in error rate compared to a baseline system. © 2010 ETRI.
KSP Keywords
Baseline system, Divergence measure, Kullback-Leibler (KL) divergence, Minimum description length (MDL) criterion, Mobile devices, Model parameter, Speech recognition system, acoustic model, distance measure, error rate, hidden Markov Model