Regional Relation Modeling for Visual Place Recognition

被引：4

作者：

Zhu, Yingying ^{[1
]}

Li, Biao ^{[1
]}

Wang, Jiong ^{[2
]}

Zhao, Zhou ^{[2
]}

机构：

[1] Shenzhen Univ, Shenzhen, Peoples R China

[2] Zhejiang Univ, Hangzhou, Peoples R China

来源：

PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20) | 2020年

基金：

中国国家自然科学基金;

关键词：

Visual place recognition; Content-based image retrieval; Convolutional neural network; Relation modeling; IMAGE; FEATURES;

D O I：

10.1145/3397271.3401176

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the process of visual perception, humans perceive not only the appearance of objects existing in a place but also their relationships (e.g. spatial layout). However, the dominant works on visual place recognition are always based on the assumption that two images depict the same place if they contain enough similar objects, while the relation information is neglected. In this paper, we propose a regional relation module which models the regional relationships and converts the convolutional feature maps to the relational feature maps. We further design a cascaded pooling method to get discriminative relation descriptors by preventing the influence of confusing relations and preserving as much useful information as possible. Extensive experiments on two place recognition benchmarks demonstrate that training with the proposed regional relation module improves the appearance descriptors and the relation descriptors are complementary to appearance descriptors. When these two kinds of descriptors are concatenated together, the resulting combined descriptors outperform the state-of-the-art methods.

引用

页码：821 / 830

页数：10

共 63 条

[1] [Anonymous], 2016, ICLR
[2] [Anonymous], 2017, IEEE INT C INT ROBOT
[3] VQA: Visual Question Answering
Antol, Stanislaw
Agrawal, Aishwarya
Lu, Jiasen
Mitchell, Margaret
Batra, Dhruv
Zitnick, C. Lawrence
Parikh, Devi
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
[4] Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
[5] DisLocation: Scalable Descriptor Distinctiveness for Location Recognition
Arandjelovic, Relja
Zisserman, Andrew
[J]. COMPUTER VISION - ACCV 2014, PT IV, 2015, 9006 : 188 - 204
[6] All about VLAD
Arandjelovic, Relja
Zisserman, Andrew
[J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1578 - 1585
[7] Aggregating Deep Convolutional Features for Image Retrieval
Babenko, Artem
Lempitsky, Victor
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1269 - 1277
[8] Battaglia PW, 2016, ADV NEUR IN, V29
[9] SURF: Speeded up robust features
Bay, Herbert
Tuytelaars, Tinne
Van Gool, Luc
[J]. COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 : 404 - 417
[10] Chen DM, 2011, PROC CVPR IEEE, P737, DOI 10.1109/CVPR.2011.5995610

← 1 2 3 4 5 6 7 →