ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Development of Speech Emotion Recognition Algorithm using MFCC and Prosody
Cited 11 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Hyejin Koo, Soyeong Jeong, Sungjae Yoon, Wonjong Kim
Issue Date
2020-01
Citation
International Conference on Electronics, Information and Communication (ICEIC) 2020, pp.810-813
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICEIC49074.2020.9051281
Abstract
Recently, in the field of Human Computer Interaction (HCI), speech emotion recognition (SER) is a highly challenging work. Various models have been proposed for better performance. In this paper, we use GRU model, which achieves comparably high performance with less parameters. We used not only MFCC, delta, and acceleration, but also delta of acceleration. Additionally, we propose the novel input feature that captures their pair simultaneously. Furthermore, we applied the prosody, the low-level feature of speech, for every step in GRU cell with MFCC feature. Our model obtained 64.47% of weighted accuracy, using only audio input from both of improvised and scripted data in IEMOCAP.
KSP Keywords
High performance, Human computer interaction, Less parameters, Low-level feature, Speech Emotion recognition, recognition algorithm, various models