ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Speech Activity Detection with Lip Movement Image Signals
Cited 1 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Soo-Jong Lee, Jun Park, Eung-Kyeu Kim
Issue Date
2007-08
Citation
Pacific Rim Conference on Communications, Computers and signal Processing (PACRIM) 2007, pp.403-406
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/PACRIM.2007.4313259
Abstract
This paper describes an attempt to correlate lip movement visual information acquired via a camera with speech audio information acquired via a microphone from a human speaker in order to prevent audio created by external noise from being misrecognized as speech emitted by said speaker. Images of the face of a human speaker are acquired via a PC camera and are then separated into images that indicate lip movement and images that do not indicate lip movement. The data of lip movement image signals is saved in shared memory and shared with the speech recognition process. This data is analyzed by the speech activity detection process, which is a pre-processing step of sound recognition. We combined a speech recognition processor and an image recognizer, and the interworking function successfully operated at the rate of 99.3%. ©2007 IEEE.
KSP Keywords
Audio information, External noise, Lip movement, Pre-processing, Shared Memory, Visual information, detection process, sound recognition, speech activity detection, speech recognition