Deep Homography Estimation with Pairwise Invertibility Constraint

被引:3
作者
Wang, Xiang [1 ]
Wang, Chen [1 ]
Bai, Xiao [1 ]
Liu, Yun [2 ]
Zhou, Jun [3 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
[3] Griffith Univ, Sch Informat & Commun Technol, Nathan, Qld, Australia
来源
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2018 | 2018年 / 11004卷
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Homography estimation; Supervised deep learning; Invertibility constraint; Spatial pyramid pooling;
D O I
10.1007/978-3-319-97785-0_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent works have shown that deep learning methods can improve the performance of the homography estimation due to the better features extracted by convolutional networks. Nevertheless, these works are supervised and rely too much on the labeled training dataset as they aim to make the homography be estimated as close to the ground truth as possible, which may cause overfitting. In this paper, we propose a Siamese network with pairwise invertibility constraint for supervised homography estimation. We utilize spatial pyramid pooling modules to improve the quality of extracted features in each image by exploiting context information. Discovering the fact that there is a pair of homographies from a given image pair which are inverse matrices, we propose the invertibility constraint to avoid overfitting. To employ the constraint, we adopt the matrix representation of the homography rather than the commonly used 4-point parameterization in other methods. Experiments on the synthetic dataset generated from MSCOCO dataset show that our proposed method outperforms several state-of-the-art approaches.
引用
收藏
页码:204 / 214
页数:11
相关论文
共 24 条
[1]  
[Anonymous], 2017, CVPR
[2]  
[Anonymous], 2017, CVPR
[3]  
[Anonymous], 1981, P 7 INT JOINT C ARTI
[4]  
[Anonymous], 2016, J MACH LEARN RES
[5]  
[Anonymous], 2003, Multiple view geometry in computer vision
[6]   An incremental structured part model for object recognition [J].
Bai, Xiao ;
Ren, Peng ;
Zhang, Huigang ;
Zhou, Jun .
NEUROCOMPUTING, 2015, 154 :189-199
[7]  
Baker H, 2006, TLS-TIMES LIT SUPPL, P11
[8]   Lucas-Kanade 20 years on: A unifying framework [J].
Baker, S ;
Matthews, I .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 56 (03) :221-255
[9]  
Chang Jia-Ren, 2018, ARXIV180308669
[10]  
DeTone D., 2016, ARXIV160603798