ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술지 Speech Enhancement Using Phase-Dependent A Priori SNR Estimator in Log-Mel Spectral Domain
Cited 4 time in scopus Download 18 time Share share facebook twitter linkedin kakaostory
저자
이윤경, 박전규, 이윤근, 권오욱
발행일
201410
출처
ETRI Journal, v.36 no.5, pp.721-729
ISSN
1225-6463
출판사
한국전자통신연구원 (ETRI)
DOI
https://dx.doi.org/10.4218/etrij.14.2214.0039
협약과제
14MS1500, 모바일 플랫폼 기반 대화모델 적용 자연어 음성인터페이스 기술 개발, 이윤근
초록
We propose a novel phase-based method for single-channel speech enhancement to extract and enhance the desired signals in noisy environments by utilizing the phase information. In the method, a phase-dependent a priori signal-to-noise ratio (SNR) is estimated in the log-mel spectral domain to utilize both the magnitude and phase information of input speech signals. The phase-dependent estimator is incorporated into the conventional magnitude-based decision-directed approach that recursively computes the a priori SNR from noisy speech. Additionally, we reduce the performance degradation owing to the one-frame delay of the estimated phase-dependent a priori SNR by using a minimum mean square error (MMSE)-based and maximum a posteriori (MAP)-based estimator. In our speech enhancement experiments, the proposed phase-dependent a priori SNR estimator is shown to improve the output SNR by 2.6 dB for both the MMSE-based and MAP-based estimator cases as compared to a conventional magnitude-based estimator.
키워드
Decision-directed approach, Minimum mean square error estimator, Phase modeling, Speech enhancement, Speech separation
KSP 제안 키워드
Error estimator, Frame delay, Magnitude and phase, Minimum Mean Square Error(MMSE), Output SNR, Phase information, Phase modeling, Phase-based, Signal noise ratio(SNR), Speech Separation, Speech Signal