Horizontal Attention Based Generation Module for Unsupervised Domain Adaptive Stereo Matching

被引：3

作者：

Wang, Sungjun ^{[1
]}

Seo, Junghyun ^{[1
]}

Jeon, Hyunjae ^{[1
]}

Lim, Sungjin ^{[1
]}

Park, Sanghyun ^{[1
]}

Lim, Yongseob ^{[1
]}

机构：

[1] Daegu Gyeongbuk Inst Sci & Technol, Daegu 42988, South Korea

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2023年 / 8卷 / 10期

基金：

新加坡国家研究基金会;

关键词：

Image synthesis; Generators; Training; Three-dimensional displays; Synthetic data; Task analysis; Image reconstruction; Deep learning for visual perception; computer vision for automation;

D O I：

10.1109/LRA.2023.3313009

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

The emergence of convolutional neural networks (CNNs) has led to significant advancements in various computer vision tasks. Among them, stereo matching is one of the most popular research areas that enables the reconstruction of 3D information, which is difficult to obtain with only a monocular camera. However, CNNs have their limitations, particularly their susceptibility to domain shift. The CNN-based stereo matching networks suffered from performance degradation under domain changes. Moreover, obtaining a significant amount of real-world ground truth data is laborious and costly when compared to acquiring synthetic data. In this letter, we propose an end-to-end framework that utilizes image-to-image translation to overcome the domain gap in stereo matching. Specifically, we suggest a horizontal attentive generation (HAG) module that incorporates the epipolar constraints when generating target-stylized left-right views. By employing a horizontal attention mechanism during generation, our method can address the issues related to small receptive field by aggregating more information of each view without using the entire feature map. Therefore, our network can maintain consistencies between each view during image generation, making it more robust for different datasets.

引用

页码：6779 / 6786

页数：8

共 55 条

[1] Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
[2] Pyramid Stereo Matching Network
Chang, Jia-Ren
Chen, Yong-Sheng
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5410 - 5418
[3] Courty N, 2017, ADV NEUR IN, V30
[4] FlowNet: Learning Optical Flow with Convolutional Networks
Dosovitskiy, Alexey
Fischer, Philipp
Ilg, Eddy
Haeusser, Philip
Hazirbas, Caner
Golkov, Vladimir
van der Smagt, Patrick
Cremers, Daniel
Brox, Thomas
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
[5] Image Style Transfer Using Convolutional Neural Networks
Gatys, Leon A.
Ecker, Alexander S.
Bethge, Matthias
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2414 - 2423
[6] Vision meets robotics: The KITTI dataset
Geiger, A.
Lenz, P.
Stiller, C.
Urtasun, R.
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) : 1231 - 1237
[7] Digging Into Self-Supervised Monocular Depth Estimation
Godard, Clement
Mac Aodha, Oisin
Firman, Michael
Brostow, Gabriel
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3827 - 3837
[8] Generative Adversarial Networks
Goodfellow, Ian
Pouget-Abadie, Jean
Mirza, Mehdi
Xu, Bing
Warde-Farley, David
Ozair, Sherjil
Courville, Aaron
Bengio, Yoshua
[J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
[9] Group-wise Correlation Stereo Network
Guo, Xiaoyang
Yang, Kai
Yang, Wukui
Wang, Xiaogang
Li, Hongsheng
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3268 - 3277
[10] Hannah M.J., 1974, Computer matching of areas in stereo images

← 1 2 3 4 5 6 →