GeoCalib: Learning Single-Image Calibration with Geometric Optimization

被引:0
作者
Veicht, Alexander [1 ]
Sarlin, Paul-Edouard [1 ]
Lindenberger, Philipp [1 ]
Pollefeys, Marc [1 ,2 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Microsoft Mixed Real & AI Lab, Cambridge, England
来源
COMPUTER VISION - ECCV 2024, PT XL | 2025年 / 15098卷
关键词
Camera calibration; Deep learning; Optimization; SELF-CALIBRATION; WIDE-ANGLE; CAMERA;
D O I
10.1007/978-3-031-73661-2_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From a single image, visual cues can help deduce intrinsic and extrinsic camera parameters like the focal length and the gravity direction. This single-image calibration can benefit various downstream applications like image editing and 3D mapping. Current approaches to this problem are based on either classical geometry with lines and vanishing points or on deep neural networks trained end-to-end. The learned approaches are more robust but struggle to generalize to new environments and are less accurate than their classical counterparts. We hypothesize that they lack the constraints that 3D geometry provides. In this work, we introduce GeoCalib, a deep neural network that leverages universal rules of 3D geometry through an optimization process. GeoCalib is trained end-to-end to estimate camera parameters and learns to find useful visual cues from the data. Experiments on various benchmarks show that GeoCalib is more robust and more accurate than existing classical and learned approaches. Its internal optimization estimates uncertainties, which help flag failure cases and benefit downstream applications like visual localization. The code and trained models are publicly available at https://github.com/cvg/GeoCalib.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 91 条
[1]  
Agarwal S., About us
[2]   Building Rome in a Day [J].
Agarwal, Sameer ;
Furukawa, Yasutaka ;
Snavely, Noah ;
Simon, Ian ;
Curless, Brian ;
Seitz, Steven M. ;
Szeliski, Richard .
COMMUNICATIONS OF THE ACM, 2011, 54 (10) :105-112
[3]  
Agarwal S, 2010, LECT NOTES COMPUT SC, V6312, P29, DOI 10.1007/978-3-642-15552-9_3
[4]  
Aguilera D., 2005, P ISPRS COM, V2
[5]  
[Anonymous], 2003, Multiple View Geometry in Computer Vision
[6]   Unsupervised Vanishing Point Detection and Camera Calibration from a Single Manhattan Image with Radial Distortion [J].
Antunes, Michel ;
Barreto, Joao P. ;
Aouada, Djamila ;
Ottersten, Bjorn .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6691-6699
[7]  
Armeni I., 2017, arXiv
[8]   INTERPRETING PERSPECTIVE IMAGES [J].
BARNARD, ST .
ARTIFICIAL INTELLIGENCE, 1983, 21 (04) :435-462
[9]  
Bazin JC, 2012, IEEE INT C INT ROBOT, P4282, DOI 10.1109/IROS.2012.6385802
[10]  
Bazin JC, 2012, PROC CVPR IEEE, P638, DOI 10.1109/CVPR.2012.6247731