ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article Nonparametric Feature Matching Based Conditional Random Fields for Gesture Recognition from Multi-Modal Video
Cited 25 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Ju Yong Chang
Issue Date
2016-08
Citation
IEEE Transactions on Pattern Analysis and Machine Intelligence, v.38, no.8, pp.1612-1625
ISSN
0162-8828
Publisher
IEEE
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.1109/TPAMI.2016.2519021
Abstract
We present a new gesture recognition method that is based on the conditional random field (CRF) model using multiple feature matching. Our approach solves the labeling problem, determining gesture categories and their temporal ranges at the same time. A generative probabilistic model is formalized and probability densities are nonparametrically estimated by matching input features with a training dataset. In addition to the conventional skeletal joint-based features, the appearance information near the active hand in an RGB image is exploited to capture the detailed motion of fingers. The estimated likelihood function is then used as the unary term for our CRF model. The smoothness term is also incorporated to enforce the temporal coherence of our solution. Frame-wise recognition results can then be obtained by applying an efficient dynamic programming technique. To estimate the parameters of the proposed CRF model, we incorporate the structured support vector machine (SSVM) framework that can perform efficient structured learning by using large-scale datasets. Experimental results demonstrate that our method provides effective gesture recognition results for challenging real gesture datasets. By scoring 0.8563 in the mean Jaccard index, our method has obtained the state-of-the-art results for the gesture recognition track of the 2014 ChaLearn Looking at People (LAP) Challenge.
KSP Keywords
Appearance information, CRF model, Conditional Random Field(CRF), Dynamic programming technique, Generative probabilistic model, Gesture datasets, Input features, Jaccard Index, Large-scale datasets, Multi-modal, Nonparametric feature