End-to-end learning of self-rectification and self-supervised disparity prediction for stereo vision

被引：6

作者：

Zhang, Xuchong ^{[1
]}

Zhao, Yongli ^{[1
]}

Wang, Hang ^{[2
]}

Zhai, Han ^{[1
]}

Sun, Hongbin ^{[1
]}

Zheng, Nanning ^{[1
]}

机构：

[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China

[2] Xi An Jiao Tong Univ, Sch Microelect, Xian 710049, Peoples R China

来源：

NEUROCOMPUTING | 2022年 / 494卷

关键词：

Stereo vision; Self-rectification; Disparity prediction; Self-supervised matching; End -to -end learning; NETWORK;

D O I：

10.1016/j.neucom.2022.04.095

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Stereo rectification and stereo matching are two critical components for the practical application of stereo vision systems. Previous studies treat them as two individual issues. For stereo rectification, var-ious traditional algorithms are proposed to estimate homography transformations, but the performance and the efficiency are unsatisfactory for real-time deployment. For stereo matching, disparity accuracy has been largely improved by learning based methods. However, the input data of all previous stereo net-works are assumed to be a pair of offline pre-rectified images, making them invalidate for accurate matching when the stereo vision system suffers from mechanical misalignment due to external collisions or temperature variations. In this paper, we optimize these two components jointly and propose an end -to-end learning framework to achieve online self-rectification and self-supervised disparity prediction simultaneously. The overall network contains two cascaded subnetworks which enable stereo rectifica-tion and stereo matching sequentially for a pair of unrectified images. The experimental results are eval-uated on both publicly available datasets and realistic scenarios. Evaluation results demonstrate that, the proposed network produces state-of-the-art results for self-rectification in terms of computation accu-racy and speed, and also produces competitive disparity results with previous self-supervised methods. Therefore, the proposed design provides a more practical and efficient solution for stereo vision systems deployed on mobile platforms.(c) 2022 Elsevier B.V. All rights reserved.

引用

页码：308 / 319

页数：12

共 50 条

[1] ActiveStereoNet: End-to-End Self-supervised Learning for Active Stereo Systems
Zhang, Yinda
Khamis, Sameh
Rhemann, Christoph
Valentin, Julien
Kowdle, Adarsh
Tankovich, Vladimir
Schoenberg, Michael
Izadi, Shahram
Funkhouser, Thomas
Fanello, Sean
COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 802 - 819
[2] PVStereo: Pyramid Voting Module for End-to-End Self-Supervised Stereo Matching
Wang, Hengli
Fan, Rui
Cai, Peide
Liu, Ming
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) : 4353 - 4360
[3] An End-to-End Contrastive Self-Supervised Learning Framework for Language Understanding
Fang, Hongchao
Xie, Pengtao
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1324 - 1340
[4] Self-supervised end-to-end graph local clustering
Zhe Yuan
World Wide Web, 2023, 26 : 1157 - 1179
[5] Self-supervised end-to-end graph local clustering
Yuan, Zhe
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (03): : 1157 - 1179
[6] Depth Edge and Structure Optimization-Based End-to-End Self-Supervised Stereo Matching
Yang, Wenbang
Cheng, Xianjing
Yong, Zhao
Qian, Ren
Li, Jianhua
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (13)
[7] Self-Supervised Representations Improve End-to-End Speech Translation
Wu, Anne
Wang, Changhan
Pino, Juan
Gu, Jiatao
INTERSPEECH 2020, 2020, : 1491 - 1495
[8] Geometric Consistency for Self-Supervised End-to-End Visual Odometry
Iyer, Ganesh
Murthy, J. Krishna
Gupta, Gunshi
Krishna, K. Madhava
Paull, Liam
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 380 - 388
[9] End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Chang, Xuankai
Maekaku, Takashi
Fujita, Yuya
Watanabe, Shinji
INTERSPEECH 2022, 2022, : 3819 - 3823
[10] End-to-end Jordanian dialect speech-to-text self-supervised learning framework
Safieh, Ali A.
Abu Alhaol, Ibrahim
Ghnemat, Rawan
FRONTIERS IN ROBOTICS AND AI, 2022, 9

← 1 2 3 4 5 →