ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술대회 Development of Speech Emotion Recognition Algorithm using MFCC and Prosody
Cited 6 time in scopus Download 2 time Share share facebook twitter linkedin kakaostory
저자
구혜진, 정소영, 윤성재, 김원종
발행일
202001
출처
International Conference on Electronics, Information and Communication (ICEIC) 2020, pp.810-813
DOI
https://dx.doi.org/10.1109/ICEIC49074.2020.9051281
협약과제
19ZT1100, 수도권 지역산업기반 ICT융합기술 지원사업, 나중찬
초록
Recently, in the field of Human Computer Interaction (HCI), speech emotion recognition (SER) is a highly challenging work. Various models have been proposed for better performance. In this paper, we use GRU model, which achieves comparably high performance with less parameters. We used not only MFCC, delta, and acceleration, but also delta of acceleration. Additionally, we propose the novel input feature that captures their pair simultaneously. Furthermore, we applied the prosody, the low-level feature of speech, for every step in GRU cell with MFCC feature. Our model obtained 64.47% of weighted accuracy, using only audio input from both of improvised and scripted data in IEMOCAP.
KSP 제안 키워드
High performance, Less parameters, Low-level feature, Recognition algorithm, Speech Emotion recognition, human computer interaction(HCI), various models