A CNN-Based Online Self-Calibration of Binocular Stereo Cameras for Pose Change

被引:2
作者
Song, Jin Gyu [1 ]
Lee, Joon Woong [1 ]
机构
[1] Chonnam Natl Univ, Ind Engn Dept, Gwangju 500757, South Korea
来源
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期
基金
新加坡国家研究基金会;
关键词
Cameras; Convolutional neural networks; Calibration; Training; Feature extraction; Estimation; Three-dimensional displays; Convolution neural network (CNN); online self-calibration; patch-wise cross-attention mechanism; rotation-angle regression; stereovision;
D O I
10.1109/TIV.2023.3281034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article proposes a novel method that can automatically and accurately recognize the pose change of binocular stereo cameras in real time and correct these changes. Focused on predicting a five degree-of-freedom extrinsic pose, we design a convolutional neural network (CNN) that implements the regression of rotation angles of two cameras. The proposed method increases regression accuracy using the information inherent in the entire image. To this end, the CNN divides the image into patches of a certain size, extracts detailed features and context features of the patches, and extracts attention information for patches belonging to the left and right images. Training and evaluating the CNN requires many stereo images with variations from the initial setup of the cameras. We solve this problem using miscalibration. In miscalibration, angles expected to be rotated for the three axes of the left and right cameras are randomly sampled within a range of +/- 2.5 degrees, and a pair of rectified images are transformed using the sampled angles. The CNN uses these transformed images to infer the angle at which the camera axis is expected to have been rotated. Then, the pair of transformed images are corrected with these inferred angles. The superiority of the proposed method is demonstrated using the KITTI odometry dataset and the GY dataset we built.
引用
收藏
页码:2542 / 2552
页数:11
相关论文
共 35 条
  • [1] Ba L. J., 2016, arXiv
  • [2] Belk J., 2011, Quaternion distance
  • [3] Pyramid Stereo Matching Network
    Chang, Jia-Ren
    Chen, Yong-Sheng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5410 - 5418
  • [4] Continuous Stereo Self-Calibration by Camera Parameter Tracking
    Dang, Thao
    Hoffmann, Christian
    Stiller, Christoph
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (07) : 1536 - 1550
  • [5] Fan YF, 2020, IEEE INT CONF ROBOT, P1084, DOI [10.1109/ICRA40945.2020.9197581, 10.1109/icra40945.2020.9197581]
  • [6] A compact algorithm for rectification of stereo pairs
    Fusiello, A
    Trucco, E
    Verri, A
    [J]. MACHINE VISION AND APPLICATIONS, 2000, 12 (01) : 16 - 22
  • [7] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
  • [8] Geiger A, 2011, IEEE INT VEH SYM, P963, DOI 10.1109/IVS.2011.5940405
  • [9] Online Training of Stereo Self-Calibration Using Monocular Depth Estimation
    Gil, Yotam
    Elmalem, Shay
    Haim, Harel
    Marom, Emanuel
    Giryes, Raja
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2021, 7 : 812 - 823
  • [10] Hansen P, 2012, PROC CVPR IEEE, P1059, DOI 10.1109/CVPR.2012.6247784