ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article Horizontal Attention Based Generation Module for Unsupervised Domain Adaptive Stereo Matching
Cited 4 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Sungjun Wang, Junghyun Seo, Hyunjae Jeon, Sungjin Lim, Sanghyun Park, Yongseob Lim
Issue Date
2023-10
Citation
IEEE ROBOTICS AND AUTOMATION LETTERS, v.8, no.10, pp.6779-6786
ISSN
2377-3766
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.1109/LRA.2023.3313009
Abstract
The emergence of convolutional neural networks (CNNs) has led to significant advancements in various computer vision tasks. Among them, stereo matching is one of the most popular research areas that enables the reconstruction of 3D information, which is difficult to obtain with only a monocular camera. However, CNNs have their limitations, particularly their susceptibility to domain shift. The CNN-based stereo matching networks suffered from performance degradation under domain changes. Moreover, obtaining a significant amount of real-world ground truth data is laborious and costly when compared to acquiring synthetic data. In this letter, we propose an end-to-end framework that utilizes image-to-image translation to overcome the domain gap in stereo matching. Specifically, we suggest a horizontal attentive generation (HAG) module that incorporates the epipolar constraints when generating target-stylized left-right views. By employing a horizontal attention mechanism during generation, our method can address the issues related to small receptive field by aggregating more information of each view without using the entire feature map. Therefore, our network can maintain consistencies between each view during image generation, making it more robust for different datasets.
KSP Keywords
3D information, Attention mechanism, Computer Vision(CV), Convolution neural network(CNN), End to End(E2E), Feature Map, Ground truth data, Image generation, Monocular Camera, Real-world, Receptive field