ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술지 Intra- and Inter-Frame Features for Automatic Speech Recognition
Cited 12 time in scopus Download 4 time Share share facebook twitter linkedin kakaostory
저자
이성주, 강병옥, 정훈, 이윤근
발행일
201406
출처
ETRI Journal, v.36 no.3, pp.514-517
ISSN
1225-6463
출판사
한국전자통신연구원 (ETRI)
DOI
https://dx.doi.org/10.4218/etrij.14.0213.0181
협약과제
14MS1500, 모바일 플랫폼 기반 대화모델 적용 자연어 음성인터페이스 기술 개발, 이윤근
초록
In this paper, alternative dynamic features for speech recognition are proposed. The goal of this work is to improve speech recognition accuracy by deriving the representation of distinctive dynamic characteristics from a speech spectrum. This work was inspired by two temporal dynamics of a speech signal. One is the highly non-stationary nature of speech, and the other is the inter-frame change of a speech spectrum. We adopt the use of a sub-frame spectrum analyzer to capture very rapid spectral changes within a speech analysis frame. In addition, we attempt to measure spectral fluctuations of a more complex manner as opposed to traditional dynamic features such as delta or double-delta. To evaluate the proposed features, speech recognition tests over smartphone environments were conducted. The experimental results show that the feature streams simply combined with the proposed features are effective for an improvement in the recognition accuracy of a hidden Markov model-based speech recognizer. © 2014 ETRI.
키워드
Feature extraction, Speech recognition
KSP 제안 키워드
Dynamic features, Feature extractioN, Inter-frame, Non-Stationary, Spectral changes, Spectrum analyzer, Speech Signal, Speech analysis, Speech recognition accuracy, Sub-Frame, Temporal Dynamics