ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article Position Puzzle Network and Augmentation: Localizing Human Keypoints beyond the Bounding Box
Cited 0 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Soonchan Park, Jinah Park
Issue Date
2023-11
Citation
Machine Vision and Applications, v.34, no.6, pp.1-14
ISSN
0932-8092
Publisher
Springer Verlag
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.1007/s00138-023-01471-6
Abstract
When estimating human pose with a partial image of a person, we, humans, do not confine the spatial range of our estimation to the given image and can readily localize keypoints outside of the image by referring to visual clues such as the body size. However, computational methods for human pose estimation do not consider those keypoints outside and focus only on the bounded area of a given image. In this paper, we propose a neural network and a data augmentation method to extend the range of human pose estimation beyond the bounding box. While our Position Puzzle Network expands the spatial range of keypoint localization by refining the position and the size of the target’s bounding box, Position Puzzle Augmentation enables the keypoint detector to estimate keypoints not only within, but also beyond the input image. We show that the proposed method enhances the baseline keypoint detectors by 39.5% and 30.5% on average in mAP and mAR, respectively, by enabling the localization of keypoints out of the bounding box using a cropped image dataset prepared for proper evaluation. Additionally, we verify that the proposed method does not degrade the performance under the original benchmarks and instead, improves the performance by alleviating false-positive errors.
KSP Keywords
Augmentation method, Body size, Bounding Box, Computational method, Data Augmentation, Human pose estimation, Image datasets, Keypoint detector, Neural networks, the body