Predictive Information Preservation via Variational Information Bottleneck for Cross-View Geo-Localization

被引：1

作者：

Li, Wansi ^{[1
]}

Hu, Qian ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Chengdu 611731, Sichuan, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2022, PT I | 2022年 / 1700卷

关键词：

Geo-localization; Variational information bottleneck; Deep neural network; DOMAIN ADAPTATION;

D O I：

10.1007/978-981-19-7946-0_34

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-view geo-localization task, which is to handle the problem of matching two images captured same target building, but from different viewpoints, e.g., satellite-view and drone-view, has received significant attention in recent years. However, this research is impeded by the large visual appearance changes across different views and irrelevant content contained in the background. Previous work mitigates the geo-view gap by some similarity-based constraints or utilizing rich contextual information near the target as auxiliary information. Despite some promising breakthroughs made by such methods, they fail to consider the involvement of irrelevant features retained in the high-dimensional features, which reduces the accuracy of the retrieval result. This paper proposes a simple and efficient model termed Predictive Information Preservation Bottleneck (PIPB), using the variational information bottleneck to discard the irrelevant information and retain the predictive information, enhancing the result performance. In particular, our proposed PIPB consists of two stages. Firstly, we learn the part-based features of each image to make full use of neighbor clues, which is realized by the square-ring partition strategy. Then, at the second stage, these learned representations are fed through the variational information bottleneck module to filter out superfluous information. This step can promote the robustness and generalization of our model and improve experiment performance. Extensive experiments are conducted on the recently-released dataset University-1652 and the fundamental benchmark CVACT, showing remarkable performance results compared to other competitive methods.

引用

页码：403 / 419

页数：17

共 53 条

[1]

Alemi AA, 2019, Arxiv, DOI arXiv:1612.00410

[2]

Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]

[3] Image Segmentation Using Information Bottleneck Method [J].

Bardera, Anton ;

Rigau, Jaume ;

Boada, Imma ;

Feixas, Miquel ;

Sbert, Mateu .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (07) :1601-1612

[4]

Bin Peng X, 2020, Arxiv, DOI arXiv:1810.00821

[5] Semantic Cross-View Matching [J].

Castaldo, Francesco ;

Zamir, Amir ;

Angst, Roland ;

Palmieri, Francesco ;

Savarese, Silvio .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, :1044-1052

[6]

Chechik G, 2009, LECT NOTES COMPUT SC, V5524, P11, DOI 10.1007/978-3-642-02172-5_2

[7] Learning a similarity metric discriminatively, with application to face verification [J].

Chopra, S ;

Hadsell, R ;

LeCun, Y .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546

[8]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[9] Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification [J].

Deng, Weijian ;

Zheng, Liang ;

Ye, Qixiang ;

Kang, Guoliang ;

Yang, Yi ;

Jiao, Jianbin .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :994-1003

[10]

Federici M, 2020, Arxiv, DOI arXiv:2002.07017

← 1 2 3 4 5 6 →