ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper A Comparative Study of AI Models for Depression Assessment using Voice Features
Cited 0 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Eunkyoung Jeon, Sehwan Moon, Seihyoung Lee, Aram Lee, Sanghyun Kim, Kiwon Park, Jeong Eun Kim
Issue Date
2023-12
Citation
International Conference on Bioinformatics and Biomedicine (BIBM) 2023, pp.4898-4899
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/BIBM58861.2023.10385900
Abstract
Recently, there has been a significant amount of research conducted on classifying depression by extracting features from voice data. In this study, we enhanced the dataset by segmenting the voice data into 4-second intervals, followed by the extraction of voice features using MFCC (Mel Frequency Cepstral Coefficients) and Mel-spectrogram. After extracting these voice features, we compared the performance depending on the models and found that using MFCC features and classifying depression with the XGBoost model yielded the best performance. Looking forward, we aim to enhance the model’s performance by utilizing multimodal data, including voice, video, and text data.
KSP Keywords
Best performance, MFCC features, Mel-frequency Cepstral Coefficient(MFCC), Voice Data, Voice features, comparative study, multimodal data, text data