ETRI Knowledge Sharing Platform : Automatic Detection of Malicious Sound Using Segmental Two-Dimensional Mel-Frequency Cepstral Coefficients and Histograms of Oriented Gradients

BROWSE

Titles

논문 검색
Type		SCI
Year	~	Keyword

Detail

List

Conference Paper Automatic Detection of Malicious Sound Using Segmental Two-Dimensional Mel-Frequency Cepstral Coefficients and Histograms of Oriented Gradients

Cited 5 time in scopus

Authors: Myung Jong Kim, Young Gwan Kim, Jae Deok Lim, Hoi Rin Kim

Issue Date: 2010-10

Citation: International Conference on Multimedia (MM) 2010, pp.887-890

Language: English

Type: Conference Paper

DOI: https://dx.doi.org/10.1145/1873951.1874104

Abstract: This paper addresses the problem of recognizing malicious sounds, such as sexual scream or moan, to detect and block the objectionable multimedia contents. The malicious sounds show the distinct characteristics that have large temporal variations and fast spectral transitions. Therefore, extracting appropriate features to properly represent these characteristics is important in achieving a better performance. In this paper, we employ segment-based two-dimensional Mel-frequency cepstral coefficients and histograms of gradient directions as a feature set to characterize both the temporal variations and spectral transitions within a long-range segment of the target signal. Gaussian mixture model (GMM) is adopted to statistically represent the malicious and non-malicious sounds, and the test sounds are classified by a maximum a posterior probability (MAP) method. Evaluation of the proposed feature extraction method on a database of several hundred malicious and non-malicious sound clips yielded precision of 91.31% and recall of 94.27%. This result suggests that this approach could be used as an alternative to the image-based methods. © 2010 ACM.

KSP Keywords: Automatic Detection, Gaussian Mixture Models(GMM), Gaussian mixture(GM), Histograms of Oriented Gradients, Long range, Maximum a Posterior(MAP), Mel-frequency Cepstral Coefficient(MFCC), Multimedia contents, Segment-based, Temporal variation, feature extraction method

ETRI

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.

제1유형

ETRI-Knowledge Sharing Plaform

BROWSE

Titles

Detail

ETRI