ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술지 Speech Enhancement Using Phase-Dependent A Priori SNR Estimator in Log-Mel Spectral Domain
Cited 5 time in scopus Download 18 time Share share facebook twitter linkedin kakaostory
저자
이윤경, 박전규, 이윤근, 권오욱
발행일
201410
출처
ETRI Journal, v.36 no.5, pp.721-729
ISSN
1225-6463
출판사
한국전자통신연구원 (ETRI)
DOI
https://dx.doi.org/10.4218/etrij.14.2214.0039
협약과제
14MS1500, 모바일 플랫폼 기반 대화모델 적용 자연어 음성인터페이스 기술 개발, 이윤근
초록
We propose a novel phase-based method for single-channel speech enhancement to extract and enhance the desired signals in noisy environments by utilizing the phase information. In the method, a phase-dependent a priori signal-to-noise ratio (SNR) is estimated in the log-mel spectral domain to utilize both the magnitude and phase information of input speech signals. The phase-dependent estimator is incorporated into the conventional magnitude-based decision-directed approach that recursively computes the a priori SNR from noisy speech. Additionally, we reduce the performance degradation owing to the one-frame delay of the estimated phase-dependent a priori SNR by using a minimum mean square error (MMSE)-based and maximum a posteriori (MAP)-based estimator. In our speech enhancement experiments, the proposed phase-dependent a priori SNR estimator is shown to improve the output SNR by 2.6 dB for both the MMSE-based and MAP-based estimator cases as compared to a conventional magnitude-based estimator.
KSP 제안 키워드
Frame delay, Magnitude and phase, Minimum Mean Square Error(MMSE), Output SNR, Phase information, Phase-based, Signal noise ratio(SNR), Speech Signals, decision-directed approach, map-based, maximum a posteriori