ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술지 Development of an Optimized Feature Extraction Algorithm for Throat Signal Analysis
Cited 9 time in scopus Download 2 time Share share facebook twitter linkedin kakaostory
저자
정영규, 한문성, Sang Jo Lee
발행일
200706
출처
ETRI Journal, v.29 no.3, pp.292-299
ISSN
1225-6463
출판사
한국전자통신연구원 (ETRI)
DOI
https://dx.doi.org/10.4218/etrij.07.0506.0040
협약과제
06MH1900, 네트워크 기반 실감형 서비스를 위한 오감정보처리 기술개발, 박준석
초록
In this paper, we present a speech recognition system using a throat microphone. The use of this kind of microphone minimizes the impact of environmental noise. Due to the absence of high frequencies and the partial loss of formant frequencies, previous systems using throat microphones have shown a lower recognition rate than systems which use standard microphones. To develop a high performance automatic speech recognition (ASR) system using only a throat microphone, we propose two methods. First, based on Korean phonological feature theory and a detailed throat signal analysis, we show that it is possible to develop an ASR system using only a throat microphone, and propose conditions of the feature extraction algorithm. Second, we optimize the zerocrossing with peak amplitude (ZCPA) algorithm to guarantee the high performance of the ASR system using only a throat microphone. For ZCPA optimization, we propose an intensification of the formant frequencies and a selection of cochlear filters. Experimental results show that this system yields a performance improvement of about 4% and a reduction in time complexity of 25% when compared to the performance of a standard ZCPA algorithm on throat microphone signals.
KSP 제안 키워드
Formant frequencies, High Frequency(HF), High performance, Peak amplitude, Recognition rate, Signal analysis, Speech recognition system, Time Complexity, automatic speech recognition(ASR), cochlear filters, environmental noise