Ground-to-Aerial Image Geo-Localization ith a Hard Exemplar Reweighting Triplet Loss

被引:99
作者
Cai, Sudong [1 ]
Guo, Yulan [1 ,2 ]
Khan, Salman [3 ]
Hu, Jiwei [4 ]
Wen, Gongjian [1 ,2 ]
机构
[1] Natl Univ Def Technol, Changsha, Peoples R China
[2] Sun Yat Sen Univ, Guangzhou, Peoples R China
[3] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
[4] Wuhan Univ Technol, Wuhan, Peoples R China
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00848
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of ground-to-aerial image geo-localization can be achieved by matching a ground view query image to a reference database of aerial/satellite images. It is highly challenging due to the dramatic viewpoint changes and unknown orientations. In this paper, we propose a novel in-batch reweighting triplet loss to emphasize the positive effect of hard exemplars during end-to-end training. We also integrate an attention mechanism into our model using feature-level contextual information. To analyze the difficulty level of each triplet, we first enforce a modified logistic regression to triplets with a distance rectifying factor. Then, the reference negative distances for corresponding anchors are set, and the relative weights of triplets are computed by comparing their difficulty to the corresponding references. To reduce the influence of extreme hard data and less useful simple exemplars, the final weights are pruned using upper and lower bound constraints. Experiments on two benchmark datasets show that the proposed approach significantly outperforms the state-of-the-art methods.
引用
收藏
页码:8390 / 8399
页数:10
相关论文
共 41 条
[1]   Learning to Match Aerial Images with Deep Attentive Architectures [J].
Altwaijry, Hani ;
Trulls, Eduard ;
Hays, James ;
Fua, Pascal ;
Belongie, Serge .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3539-3547
[2]  
Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/CVPR.2016.572, 10.1109/TPAMI.2017.2711011]
[3]  
Bansal M., 2011, 19 ACM INT C MULT NO, P1125
[4]  
Bansal M, 2012, LECT NOTES COMPUT SC, V7583, P175, DOI 10.1007/978-3-642-33863-2_18
[5]   Deep Visual-Semantic Quantization for Efficient Image Retrieval [J].
Cao, Yue ;
Long, Mingsheng ;
Wang, Jianmin ;
Liu, Shichen .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :916-925
[6]   A Parallel Space Saving Algorithm and Performance Test [J].
Chang, Siyuan ;
Liu, Jun ;
Liu, Fang ;
Yang, Jie .
2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, :198-202
[7]   Learning a similarity metric discriminatively, with application to face verification [J].
Chopra, S ;
Hadsell, R ;
LeCun, Y .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546
[8]   Smart Mining for Deep Metric Learning [J].
Harwood, Ben ;
Kumar, Vijay B. G. ;
Carneiro, Gustavo ;
Reid, Ian ;
Drummond, Tom .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2840-2848
[9]  
He K., 2016, IEEE C COMPUT VIS PA, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]
[10]  
Hermans Alexander, 2017, ARXIV