ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Performance Improvement Method of the Video Visual Relation Detection with Multi-modal Feature Fusion
Cited 0 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Kwangju Kim, Pyong-Kun Kim, Kil-Taek Lim, Jong Taek Lee
Issue Date
2022-02
Citation
International Conference on Artificial Intelligence in Information and Communication (ICAIIC) 2022, pp.87-91
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICAIIC54071.2022.9722699
Abstract
Video visual relation detection is a novel research problem that aims to detect instances of visual relations of interest in a video. In this paper, we propose a performance improvement method of the video visual relation detection with multi-modal feature fusion. First, we introduce a spatial feature extraction method that is designed to include the relative positions of objects itself and between objects in the image. Next, we suggest a relationship classifier that is designed to accommodate the complexity of the input features. Our proposed method achieves 6.65 mAP, and ranked the 2nd place in the visual relation detection task of Video Relation Understanding Challenge (VRU), the ACM Multimedia 2020.
KSP Keywords
Detection task, Feature Fusion, Improvement method, Input features, Multi-modal, Relation Understanding, Relative position, Spatial feature extraction, feature extraction method, performance improvement