ETRI Knowledge Sharing Platform : LAttE: A label-free and multimodal framework for context-aware person re-identification

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Journal Article LAttE: A label-free and multimodal framework for context-aware person re-identification

Cited 0 time in scopus

Abstract: Person re-identification (Re-ID) refers to the task of identifying the same individual across multiple non-overlapping camera views, and it is considered a critical task in surveillance, security, and smart city applications. We introduce label-free attributes and pose embedding (LAttE), a novel Re-ID framework designed for intelligent surveillance systems that eliminates the need for manual attribute annotation. LAttE constructs a rich attribute bank by using GPT-4o to synthesize diverse human-centric descriptors, which are then embedded using a contrastive language–image pretraining encoder. These textual attributes are fused with visual and pose features through a cross-modal attention mechanism, resulting in a comprehensive representation of pedestrian appearance and structure. To further enhance robustness, we incorporate a feature alignment strategy based on the maximum mean discrepancy, improving consistency across varying viewpoints and sensor conditions. Experiments on benchmark datasets show that LAttE achieves state-of-the-art performance in both mean average precision and Rank-1 accuracy. These findings highlight its potential as a scalable and label-free solution for Re-ID tasks in intelligent vehicle applications, including pedestrian detection, in-cabin monitoring, and cooperative driving systems.

KSP Keywords: Art performance, Attention mechanism, Benchmark datasets, Context aware, Cooperative Driving Systems, Critical task, Feature alignment, Intelligent Surveillance, Intelligent Vehicle, Label-free, Maximum mean discrepancy

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.