ETRI Knowledge Sharing Platform : A Study on Speech Emotion Recognition Using a Deep Neural Network

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper A Study on Speech Emotion Recognition Using a Deep Neural Network

Cited 26 time in scopus

Citation: International Conference on Information and Communication Technology Convergence (ICTC) 2019, pp.1162-1165

Abstract: When using voice signals as input to a deep learning network, there may be myriad features depending on the method and purpose of extracting the voice signal features. Therefore, extraction of appropriate features should be conducted. In this study, verbal features necessary for speech emotion recognition (SER) and preprocessing features for a deep neural network are described in detail. We implemented various preprocessing methods using voice features. Also, a Keras-based deep neural network using Python libraries was implemented. With these features, we could obtain a test accuracy of 68.5 % using the deep neural network (DNN). As a result, we confirmed that the proposed DNN improved an accuracy by 30.1 % compared to a support vector machine (SVM).

KSP Keywords: Deep learning network, Deep neural network(DNN), Signal features, Speech Emotion recognition, Support VectorMachine(SVM), Voice features, Voice signal, deep learning(DL), neural network(NN), preprocessing methods, vector machine(LSSVM)

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.