ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation
Cited 3 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Kim Ji-Yeon, Oh Hyun-Bin, Kwon Byung-Ki, Dahun Kim, Yongjin Kwon, Tae-Hyun Oh
Issue Date
2024-07
Citation
IEEE ROBOTICS AND AUTOMATION LETTERS, v.9, no.7, pp.6186-6193
ISSN
2377-3766
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.1109/LRA.2024.3396408
Abstract
We present Uni-DVPS, a unified model for Depth-aware Video Panoptic Segmentation (DVPS) that jointly tackles distinct vision tasks, i.e., video panoptic segmentation, monocular depth estimation, and object tracking. In contrast to the prior works that adopt diverged decoder networks tailored for each task, we propose an architecture with a unified Transformer decoder network. We design a single Transformer decoder network for multi-task learning to increase shared operations to facilitate the synergies between tasks and exhibit high efficiency. We also observe that our unified query learns instance-aware representation guided by multi-task supervision, which encourages query-based tracking and obviates the need for training extra tracking module. We validate our architectural design choices with experiments on Cityscapes-DVPS and SemKITTI-DVPS datasets. The performances of all tasks are jointly improved, and we achieve state-of-the-art results on DVPQ metric for both datasets.
KSP Keywords
Architectural Design, Monocular depth estimation, Object Tracking, Query-based, Unified model, high efficiency, multi-task learning, need for, state-of-The-Art