ETRI Knowledge Sharing Platform : End-to-end Korean Digits Speech Recognition

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper End-to-end Korean Digits Speech Recognition

Cited 1 time in scopus

Citation: International Conference on Information and Communication Technology Convergence (ICTC) 2019, pp.1137-1139

Abstract: The traditional speech recognition model consisting of an acoustic model and a language model is mainly used. Recently, an end-to-end speech recognition model consisting of a single integrated neural network model is being studied. This model has the advantage that it does not require a lot of training and it is easy to understand the structure of the model. In this paper, we designed the end-to-end model for Korean digit speech recognition and showed the performance results. We tried the digit speech recognition model in two forms: Word model and character model.

KSP Keywords: End to End(E2E), End-to-End Speech Recognition, Integrated neural network, Neural network model, Recognition model, acoustic model, language models, neural network(NN)

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.