ETRI Knowledge Sharing Platform : 변분 오토인코더와 비교사 데이터 증강을 이용한 음성인식기 준지도 학습

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Journal Article 변분 오토인코더와 비교사 데이터 증강을 이용한 음성인식기 준지도 학습

Cited 0 time in scopus

Download 260 time Share share

Abstract: We propose a semi-supervised learning method based on Variational AutoEncoder (VAE) and Unsupervised Data Augmentation (UDA) to improve the performance of an end-to-end speech recognizer. In the proposed method, first, the VAE-based augmentation model and the baseline end-to-end speech recognizer are trained using the original speech data. Then, the baseline end-to-end speech recognizer is trained again using data augmented from the learned augmentation model. Finally, the learned augmentation model and end-to-end speech recognizer are re-learned using the UDA-based semi-supervised learning method. As a result of the computer simulation, the augmentation model is shown to improve the Word Error Rate (WER) of the baseline end-to-end speech recognizer, and further improve its performance by combining it with the UDA-based learning method.

KSP Keywords: Computer simulation(MC and MD), Data Augmentation, End to End(E2E), Semi-Supervised Learning(SSL), Semi-Supervised learning method, Word Error Rate

This work is distributed under the term of Creative Commons License (CCL)
(CC BY NC)

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.