ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술지 Cross-Corpus Speech Emotion Recognition Based on Few-Shot Learning and Domain Adaptation
Cited 23 time in scopus Download 1 time Share share facebook twitter linkedin kakaostory
저자
안영도, 이성주, 신종원
발행일
202106
출처
IEEE Signal Processing Letters, v.28, pp.1190-1194
ISSN
1070-9908
출판사
IEEE
DOI
https://dx.doi.org/10.1109/LSP.2021.3086395
협약과제
21HS3400, 다중 화자간 대화 음성인식 기술개발, 박전규
초록
Within a single speech emotion corpus, deep neural networks have shown decent performance in speech emotion recognition. However, the performance of the emotion recognition based on data-driven learning methods degrades significantly for the cross-corpus scenario. To relieve this issue without any labeled samples from the target domain, we propose a cross-corpus speech emotion recognition based on few-shot learning and unsupervised domain adaptation, which is trained to learn the class (emotion) similarity from the source domain samples adapted to the target domain. In addition, we utilize multiple corpora in training to enhance the robustness of the emotion recognition to the unseen samples. Experiments on emotional speech corpora with three different languages showed that the proposed method outperformed other approaches.
KSP 제안 키워드
Deep neural network(DNN), Labeled samples, Learning methods, Source Domain, Speech Emotion recognition, Speech corpora, Target domain, Unsupervised domain adaptation, cross-corpus, data-driven learning, emotional speech