ETRI Knowledge Sharing Platform : Development of Speech Emotion Recognition Algorithm using MFCC and Prosody

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper Development of Speech Emotion Recognition Algorithm using MFCC and Prosody

Cited 14 time in scopus

Citation: International Conference on Electronics, Information and Communication (ICEIC) 2020, pp.810-813

Abstract: Recently, in the field of Human Computer Interaction (HCI), speech emotion recognition (SER) is a highly challenging work. Various models have been proposed for better performance. In this paper, we use GRU model, which achieves comparably high performance with less parameters. We used not only MFCC, delta, and acceleration, but also delta of acceleration. Additionally, we propose the novel input feature that captures their pair simultaneously. Furthermore, we applied the prosody, the low-level feature of speech, for every step in GRU cell with MFCC feature. Our model obtained 64.47% of weighted accuracy, using only audio input from both of improvised and scripted data in IEMOCAP.

KSP Keywords: High performance, Less parameters, Speech Emotion recognition, human-computer interaction(HCI), low-level feature, recognition algorithm, various models

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.