Regional Relation Modeling for Visual Place Recognition

被引:4
作者
Zhu, Yingying [1 ]
Li, Biao [1 ]
Wang, Jiong [2 ]
Zhao, Zhou [2 ]
机构
[1] Shenzhen Univ, Shenzhen, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20) | 2020年
基金
中国国家自然科学基金;
关键词
Visual place recognition; Content-based image retrieval; Convolutional neural network; Relation modeling; IMAGE; FEATURES;
D O I
10.1145/3397271.3401176
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the process of visual perception, humans perceive not only the appearance of objects existing in a place but also their relationships (e.g. spatial layout). However, the dominant works on visual place recognition are always based on the assumption that two images depict the same place if they contain enough similar objects, while the relation information is neglected. In this paper, we propose a regional relation module which models the regional relationships and converts the convolutional feature maps to the relational feature maps. We further design a cascaded pooling method to get discriminative relation descriptors by preventing the influence of confusing relations and preserving as much useful information as possible. Extensive experiments on two place recognition benchmarks demonstrate that training with the proposed regional relation module improves the appearance descriptors and the relation descriptors are complementary to appearance descriptors. When these two kinds of descriptors are concatenated together, the resulting combined descriptors outperform the state-of-the-art methods.
引用
收藏
页码:821 / 830
页数:10
相关论文
共 63 条
  • [1] [Anonymous], 2016, ICLR
  • [2] [Anonymous], 2017, IEEE INT C INT ROBOT
  • [3] VQA: Visual Question Answering
    Antol, Stanislaw
    Agrawal, Aishwarya
    Lu, Jiasen
    Mitchell, Margaret
    Batra, Dhruv
    Zitnick, C. Lawrence
    Parikh, Devi
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
  • [4] Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
  • [5] DisLocation: Scalable Descriptor Distinctiveness for Location Recognition
    Arandjelovic, Relja
    Zisserman, Andrew
    [J]. COMPUTER VISION - ACCV 2014, PT IV, 2015, 9006 : 188 - 204
  • [6] All about VLAD
    Arandjelovic, Relja
    Zisserman, Andrew
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1578 - 1585
  • [7] Aggregating Deep Convolutional Features for Image Retrieval
    Babenko, Artem
    Lempitsky, Victor
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1269 - 1277
  • [8] Battaglia PW, 2016, ADV NEUR IN, V29
  • [9] SURF: Speeded up robust features
    Bay, Herbert
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 : 404 - 417
  • [10] Chen DM, 2011, PROC CVPR IEEE, P737, DOI 10.1109/CVPR.2011.5995610