SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses

被引:11
作者
Fan, Jinlong [1 ]
Zhang, Jing [1 ]
Tao, Dacheng [1 ]
机构
[1] Univ Sydney, Fac Engn, Sch Comp Sci, Sydney, NSW 2006, Australia
关键词
Distortion; Training; Predictive models; Lenses; Annotations; Self-supervised learning; Task analysis; image rectification; RADIAL DISTORTION; LINEAR-ESTIMATION; GEOMETRY;
D O I
10.1109/TIP.2022.3231087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has demonstrated its power in image rectification by leveraging the representation capacity of deep neural networks via supervised training based on a large-scale synthetic dataset. However, the model may overfit the synthetic images and generalize not well on real-world fisheye images due to the limited universality of a specific distortion model and the lack of explicitly modeling the distortion and rectification process. In this paper, we propose a novel self-supervised image rectification (SIR) method based on an important insight that the rectified results of distorted images of a same scene from different lenses should be the same. Specifically, we devise a new network architecture with a shared encoder and several prediction heads, each of which predicts the distortion parameter of a specific distortion model. We further leverage a differentiable warping module to generate the rectified images and re-distorted images from the distortion parameters and exploit the intra- and inter-model consistency between them during training, thereby leading to a self-supervised learning scheme without the need for ground-truth distortion parameters or normal images. Experiments on synthetic dataset and real-world fisheye images demonstrate that our method achieves comparable or even better performance than the supervised baseline method and representative state-of-the-art (SOTA) methods. The proposed self-supervised method also provides a possible way to improve the universality of distortion models while keeping their self-consistency. Code and datasets will be available at https://github.com/loong8888/SIR.
引用
收藏
页码:865 / 877
页数:13
相关论文
共 65 条
[1]   Automatic Lens Distortion Correction Using One-Parameter Division Models [J].
Aleman-Flores, Miguel ;
Alvarez, Luis ;
Gomez, Luis ;
Santana-Cedres, Daniel .
IMAGE PROCESSING ON LINE, 2014, 4 :327-343
[2]   Unsupervised Vanishing Point Detection and Camera Calibration from a Single Manhattan Image with Radial Distortion [J].
Antunes, Michel ;
Barreto, Joao P. ;
Aouada, Djamila ;
Ottersten, Bjorn .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6691-6699
[3]  
Barreto JP, 2005, IEEE I CONF COMP VIS, P625
[4]   DeepCalib: A Deep Learning Approach for Automatic Intrinsic Calibration of Wide Field-of-View Cameras [J].
Bogdan, Oleksandr ;
Eckstein, Viktor ;
Rameau, Francois ;
Bazin, Jean-Charles .
PROCEEDINGS CVMP 2018: THE 15TH ACM SIGGRAPH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, 2018,
[5]   Automatic Radial Distortion Estimation from a Single Image [J].
Bukhari, Faisal ;
Dailey, Matthew N. .
JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2013, 45 (01) :31-45
[6]  
Bukhari F, 2010, LECT NOTES COMPUT SC, V6454, P11, DOI 10.1007/978-3-642-17274-8_2
[7]   Optimizing Content-Preserving Projections for Wide-Angle Images [J].
Carroll, Robert ;
Agrawala, Maneesh ;
Agarwala, Aseem .
ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03)
[8]  
Caruso D, 2015, IEEE INT C INT ROBOT, P141, DOI 10.1109/IROS.2015.7353366
[9]  
Chao CH, 2020, INT CONF ACOUST SPEE, P2248, DOI [10.1109/ICASSP40776.2020.9054191, 10.1109/icassp40776.2020.9054191]
[10]  
Chen T, 2020, Arxiv, DOI [arXiv:2002.05709, DOI 10.48550/ARXIV.2002.05709]