ETRI-Knowledge Sharing Plaform



논문 검색
구분 SCI
연도 ~ 키워드


학술대회 Spoken English Fluency Scoring using Convolutional Neural Networks
Cited 5 time in scopus Download 2 time Share share facebook twitter linkedin kakaostory
정훈, 이윤경, 이성주, 박전규
Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA) 2017, pp.31-36
17HS5700, 언어학습을 위한 자유발화형 음성대화처리 원천기술 개발, 이윤근
In this paper, we propose a spoken English fluency scoring using Convolutional Neural Network (CNN) to learn feature extraction and scoring model jointly from raw time-domain signal input. In general, automatic spoken English fluency scoring is composed feature extraction and a scoring model. Feature extraction is used to compute the feature vectors that are assumed to represent spoken English fluency, and the scoring model predicts the fluency score of an input feature vector. Although the conventional approach works well, there are issues regarding feature extraction and model parameter optimization. First, because the fluency features are computed based on human knowledge, some crucial representations that are included in a raw data corpus can be missed. Second, each parameter of the model is optimized separately, which can lead to suboptimal performance. To address these issues, we propose a CNN-based approach to extract fluency features directly from a raw data corpus without hand-crafted engineering and optimizes all model parameters jointly. The effectiveness of the proposed approach is evaluated using Korean-Spoken English Corpus.
KSP 제안 키워드
Based Approach, Convolution neural network(CNN), Feature Vector, Feature extractioN, Human knowledge, Model parameter, Parameter optimization, Scoring model, raw data, time-domain