ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article Cross-Corpus Speech Emotion Recognition Based on Few-Shot Learning and Domain Adaptation
Cited 39 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Youngdo Ahn, Sung Joo Lee, Jong Won Shin
Issue Date
2021-06
Citation
IEEE Signal Processing Letters, v.28, pp.1190-1194
ISSN
1070-9908
Publisher
IEEE
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.1109/LSP.2021.3086395
Abstract
Within a single speech emotion corpus, deep neural networks have shown decent performance in speech emotion recognition. However, the performance of the emotion recognition based on data-driven learning methods degrades significantly for the cross-corpus scenario. To relieve this issue without any labeled samples from the target domain, we propose a cross-corpus speech emotion recognition based on few-shot learning and unsupervised domain adaptation, which is trained to learn the class (emotion) similarity from the source domain samples adapted to the target domain. In addition, we utilize multiple corpora in training to enhance the robustness of the emotion recognition to the unseen samples. Experiments on emotional speech corpora with three different languages showed that the proposed method outperformed other approaches.
KSP Keywords
Deep neural network(DNN), Labeled samples, Learning methods, Source Domain, Speech Emotion recognition, Speech corpora, Target domain, cross-corpus, data-driven learning, emotional speech, neural network(NN)