SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses

被引：11

作者：

Fan, Jinlong ^{[1
]}

Zhang, Jing ^{[1
]}

Tao, Dacheng ^{[1
]}

机构：

[1] Univ Sydney, Fac Engn, Sch Comp Sci, Sydney, NSW 2006, Australia

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

关键词：

Distortion; Training; Predictive models; Lenses; Annotations; Self-supervised learning; Task analysis; image rectification; RADIAL DISTORTION; LINEAR-ESTIMATION; GEOMETRY;

D O I：

10.1109/TIP.2022.3231087

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning has demonstrated its power in image rectification by leveraging the representation capacity of deep neural networks via supervised training based on a large-scale synthetic dataset. However, the model may overfit the synthetic images and generalize not well on real-world fisheye images due to the limited universality of a specific distortion model and the lack of explicitly modeling the distortion and rectification process. In this paper, we propose a novel self-supervised image rectification (SIR) method based on an important insight that the rectified results of distorted images of a same scene from different lenses should be the same. Specifically, we devise a new network architecture with a shared encoder and several prediction heads, each of which predicts the distortion parameter of a specific distortion model. We further leverage a differentiable warping module to generate the rectified images and re-distorted images from the distortion parameters and exploit the intra- and inter-model consistency between them during training, thereby leading to a self-supervised learning scheme without the need for ground-truth distortion parameters or normal images. Experiments on synthetic dataset and real-world fisheye images demonstrate that our method achieves comparable or even better performance than the supervised baseline method and representative state-of-the-art (SOTA) methods. The proposed self-supervised method also provides a possible way to improve the universality of distortion models while keeping their self-consistency. Code and datasets will be available at https://github.com/loong8888/SIR.

引用

页码：865 / 877

页数：13

共 65 条

[1] Automatic Lens Distortion Correction Using One-Parameter Division Models [J].

Aleman-Flores, Miguel ;

Alvarez, Luis ;

Gomez, Luis ;

Santana-Cedres, Daniel .

IMAGE PROCESSING ON LINE, 2014, 4 :327-343

[2] Unsupervised Vanishing Point Detection and Camera Calibration from a Single Manhattan Image with Radial Distortion [J].

Antunes, Michel ;

Barreto, Joao P. ;

Aouada, Djamila ;

Ottersten, Bjorn .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6691-6699

[3]

Barreto JP, 2005, IEEE I CONF COMP VIS, P625

[4] DeepCalib: A Deep Learning Approach for Automatic Intrinsic Calibration of Wide Field-of-View Cameras [J].

Bogdan, Oleksandr ;

Eckstein, Viktor ;

Rameau, Francois ;

Bazin, Jean-Charles .

PROCEEDINGS CVMP 2018: THE 15TH ACM SIGGRAPH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, 2018,

[5] Automatic Radial Distortion Estimation from a Single Image [J].

Bukhari, Faisal ;

Dailey, Matthew N. .

JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2013, 45 (01) :31-45

[6]

Bukhari F, 2010, LECT NOTES COMPUT SC, V6454, P11, DOI 10.1007/978-3-642-17274-8_2

[7] Optimizing Content-Preserving Projections for Wide-Angle Images [J].

Carroll, Robert ;

Agrawala, Maneesh ;

Agarwala, Aseem .

ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03)

[8]

Caruso D, 2015, IEEE INT C INT ROBOT, P141, DOI 10.1109/IROS.2015.7353366

[9]

Chao CH, 2020, INT CONF ACOUST SPEE, P2248, DOI [10.1109/ICASSP40776.2020.9054191, 10.1109/icassp40776.2020.9054191]

[10]

Chen T, 2020, Arxiv, DOI [arXiv:2002.05709, DOI 10.48550/ARXIV.2002.05709]

← 1 2 3 4 5 6 7 →