ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Pretrained Network-based Sound Event Recognition for Audio Surveillance Applications
Cited 2 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Suwan Park, Geonwoo Kim
Issue Date
2021-10
Citation
International Conference on Information and Communication Technology Convergence (ICTC) 2021, pp.1306-1309
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICTC52510.2021.9621184
Abstract
Despite the recent surge in the demand on the audio recognition in surveillance systems, there are still many obstacles to readily use it in real environments such as legal restrictions on public data collection and difficulties in obtaining large-scale learning data. To overcome these problems, we propose an adaptive sound event recognition scheme based on a pre-trained network with the large-scale AudioSet, where PANNs-based CNN and SincNet[3] are employed to extract the audio features from log-mel spectrogram and waveform, respectively. Our experimental results show that the proposed method achieves mean average precision (mAP) of 0.415, which is slightly better than the best previous methods. Furthermore, we evaluate the performance of transfer leaning using a smaller amount of data collected by ourselves, considering dangerous situation scenarios in surveillance applications.
KSP Keywords
Audio Features, Audio recognition, Audio surveillance, Data Collection, Data collected, Large-scale learning, Learning data, Public Data, Sound event recognition, Surveillance applications, Surveillance system