ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper MOVES: Motion-Oriented VidEo Sampling for Natural Language-Based Vehicle Retrieval
Cited 0 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Dongyoung Kim, Kyoungoh Lee, In-su Jang, Kwang-Ju Kim, Pyong-Kun Kim, Jaejun Yoo
Issue Date
2024-07
Citation
International Conference on Advanced Video and Signal-based Surveillance (AVSS) 2024, pp.1-7
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/AVSS61716.2024.10672583
Abstract
Retrieving the target vehicle through natural language descriptions plays a crucial role in intelligent transportation systems. Existing methods tackle this task by employing models that leverage the correlation between textual and visual representations, such as CLIP. However, these models struggle to capture the temporal characteristics of video data, and researchers enhance temporal understanding performance through various data augmentation and video encoders. Yet, conventional approaches in previous studies often overlook the detailed temporal characteristics of vehicles. To overcome this limitation, we introduce a MOVES: Motion-Oriented VidEo Sampling method to effectively utilize the motion information of the target vehicle. Furthermore, we construct a robust model by implementing a re-ranking algorithm to address a variety of vehicle attributes. As a result, our proposed model achieves state-of-the-art performance on the public vehicle retrieval dataset.
KSP Keywords
Art performance, Data Augmentation, Intelligent transportation systems, Motion information, Natural language, Proposed model, Re-Ranking Algorithm, Robust model, Temporal characteristics, Video data, Visual Representation