Bridging the Domain Gap for Ground-to-Aerial Image Matching

被引:123
作者
Regmi, Krishna [1 ]
Shah, Mubarak [1 ]
机构
[1] Univ Cent Florida, Ctr Res Comp Vis, Orlando, FL 32816 USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The visual entities in cross-view (e.g. ground and aerial) images exhibit drastic domain changes due to the differences in viewpoints each set of images is captured from. Existing state-of-the-art methods address the problem by learning view-invariant images descriptors. We propose a novel method for solving this task by exploiting the generative powers of conditional GANs to synthesize an aerial representation of a ground-level panorama query and use it to minimize the domain gap between the two views. The synthesized image being from the same view as the reference (target) image, helps the network to preserve important cues in aerial images following our Joint Feature Learning approach. We fuse the complementary features from a synthesized aerial image with the original ground-level panorama features to obtain a robust query representation. In addition, we employ multi-scale feature aggregation in order to preserve image representations at different scales useful for solving this complex task. Experimental results show that our proposed approach performs significantly better than the state-of-the-art methods on the challenging CVUSA dataset in terms of top-1 and top-1% retrieval accuracies. Furthermore, we evaluate the generalization of the proposed method for urban landscapes on our newly collected cross-view localization dataset with geo-reference information.
引用
收藏
页码:470 / 479
页数:10
相关论文
共 48 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[3]  
[Anonymous], 2018, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2017.2711011
[4]  
[Anonymous], 2018, INT C MACH LEARN
[5]  
[Anonymous], 1986, READINGS COMPUTER VI, DOI DOI 10.1016/B978-0-08-051581-6.50024-6
[6]  
[Anonymous], 2014, ARXIV PREPRINT ARXIV
[7]  
[Anonymous], LOCALIZING ORIENTING
[8]  
Bansal M, 2012, LECT NOTES COMPUT SC, V7583, P175, DOI 10.1007/978-3-642-33863-2_18
[9]   Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [J].
Bell, Sean ;
Zitnick, C. Lawrence ;
Bala, Kavita ;
Girshick, Ross .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2874-2883
[10]  
Choi Yunjey, 2018, IEEE C COMP VIS PATT