End-to-end learning of self-rectification and self-supervised disparity prediction for stereo vision

被引:6
|
作者
Zhang, Xuchong [1 ]
Zhao, Yongli [1 ]
Wang, Hang [2 ]
Zhai, Han [1 ]
Sun, Hongbin [1 ]
Zheng, Nanning [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Microelect, Xian 710049, Peoples R China
关键词
Stereo vision; Self-rectification; Disparity prediction; Self-supervised matching; End -to -end learning; NETWORK;
D O I
10.1016/j.neucom.2022.04.095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stereo rectification and stereo matching are two critical components for the practical application of stereo vision systems. Previous studies treat them as two individual issues. For stereo rectification, var-ious traditional algorithms are proposed to estimate homography transformations, but the performance and the efficiency are unsatisfactory for real-time deployment. For stereo matching, disparity accuracy has been largely improved by learning based methods. However, the input data of all previous stereo net-works are assumed to be a pair of offline pre-rectified images, making them invalidate for accurate matching when the stereo vision system suffers from mechanical misalignment due to external collisions or temperature variations. In this paper, we optimize these two components jointly and propose an end -to-end learning framework to achieve online self-rectification and self-supervised disparity prediction simultaneously. The overall network contains two cascaded subnetworks which enable stereo rectifica-tion and stereo matching sequentially for a pair of unrectified images. The experimental results are eval-uated on both publicly available datasets and realistic scenarios. Evaluation results demonstrate that, the proposed network produces state-of-the-art results for self-rectification in terms of computation accu-racy and speed, and also produces competitive disparity results with previous self-supervised methods. Therefore, the proposed design provides a more practical and efficient solution for stereo vision systems deployed on mobile platforms.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:308 / 319
页数:12
相关论文
共 50 条
  • [1] ActiveStereoNet: End-to-End Self-supervised Learning for Active Stereo Systems
    Zhang, Yinda
    Khamis, Sameh
    Rhemann, Christoph
    Valentin, Julien
    Kowdle, Adarsh
    Tankovich, Vladimir
    Schoenberg, Michael
    Izadi, Shahram
    Funkhouser, Thomas
    Fanello, Sean
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 802 - 819
  • [2] PVStereo: Pyramid Voting Module for End-to-End Self-Supervised Stereo Matching
    Wang, Hengli
    Fan, Rui
    Cai, Peide
    Liu, Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) : 4353 - 4360
  • [3] An End-to-End Contrastive Self-Supervised Learning Framework for Language Understanding
    Fang, Hongchao
    Xie, Pengtao
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1324 - 1340
  • [4] Self-supervised end-to-end graph local clustering
    Zhe Yuan
    World Wide Web, 2023, 26 : 1157 - 1179
  • [5] Self-supervised end-to-end graph local clustering
    Yuan, Zhe
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (03): : 1157 - 1179
  • [6] Depth Edge and Structure Optimization-Based End-to-End Self-Supervised Stereo Matching
    Yang, Wenbang
    Cheng, Xianjing
    Yong, Zhao
    Qian, Ren
    Li, Jianhua
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (13)
  • [7] Self-Supervised Representations Improve End-to-End Speech Translation
    Wu, Anne
    Wang, Changhan
    Pino, Juan
    Gu, Jiatao
    INTERSPEECH 2020, 2020, : 1491 - 1495
  • [8] Geometric Consistency for Self-Supervised End-to-End Visual Odometry
    Iyer, Ganesh
    Murthy, J. Krishna
    Gupta, Gunshi
    Krishna, K. Madhava
    Paull, Liam
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 380 - 388
  • [9] End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
    Chang, Xuankai
    Maekaku, Takashi
    Fujita, Yuya
    Watanabe, Shinji
    INTERSPEECH 2022, 2022, : 3819 - 3823
  • [10] End-to-end Jordanian dialect speech-to-text self-supervised learning framework
    Safieh, Ali A.
    Abu Alhaol, Ibrahim
    Ghnemat, Rawan
    FRONTIERS IN ROBOTICS AND AI, 2022, 9