ETRI Knowledge Sharing Platform : Sound Source Localization Based on GCC-PHAT With Diffuseness Mask in Noisy and Reverberant Environments

BROWSE

Titles

논문 검색
Type		SCI
Year	~	Keyword

Detail

List

Journal Article Sound Source Localization Based on GCC-PHAT With Diffuseness Mask in Noisy and Reverberant Environments

Cited 51 time in scopus

Download 266 time Share share

Authors: Ran Lee, Min-Seok Kang, Bo-Hyun Kim, Kang-Ho Park, Sung Q Lee, Hyung-Min Park

Issue Date: 2020-01

Citation: IEEE Access, v.8, pp.7373-7382

ISSN: 2169-3536

Publisher: IEEE

Language: English

Type: Journal Article

DOI: https://dx.doi.org/10.1109/ACCESS.2019.2963768

Abstract: Although sound source localization is a desirable technique in many communication systems and intelligence applications, the distortion caused by diffuse noise or reverberation makes the time delay estimation (TDE) between signals acquired by a pair of microphones a complicated and challenging problem. In this paper, we describe a method that can efficiently achieve sound source localization in noisy and reverberant environments. This method is based on the generalized cross-correlation (GCC) function with phase transform (PHAT) weights (GCC-PHAT) to achieve robustness against reverberation. In addition, to estimate the time delay robust to diffuse components and to further improve the robustness of the GCC-PHAT against reverberation, time-frequency(t-f) components of observations directly emitted by a point source are chosen by 'inversed' diffuseness. The diffuseness that can be estimated from the coherent-to-diffuse power ratio (CDR) based on spatial coherence between two microphones represents the contribution of diffuse components on a scale of zero to one with direct sounds from a source modeled to be fully coherent. In particular, the 'inversed' diffuseness is binarized with a very rigorous threshold to select highly reliable components for accurate TDE even in noisy and reverberant environments. Experimental results for both simulated and real-recorded data consistently demonstrated the robustness of the presented method against diffuse noise and reverberation.

KSP Keywords: Communication system, GCC-PHAT, Generalized Cross-correlation, Phase transform, Spatial coherence, diffuse noise, highly reliable, point source, power ratio, reverberant environments, sound source localization

This work is distributed under the term of Creative Commons License (CCL)
(CC BY)

ETRI

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.

제1유형

ETRI-Knowledge Sharing Plaform

BROWSE

Titles

Detail

ETRI