Explicit feature disentanglement for visual place recognition across appearance changes

被引:2
作者
Tang, Li [1 ]
Wang, Yue [1 ]
Tan, Qimeng [2 ]
Xiong, Rong [1 ]
机构
[1] Zhejiang Univ, Dept Control Sci & Engn, Hangzhou 30012, Peoples R China
[2] Beijing Inst Spacecraft Syst Engn, Beijing Key Lab Intelligent Space Robot Syst Tech, Beijing, Peoples R China
关键词
Place recognition; feature disentanglement; adversarial; self-supervised; changing appearance; SIMULTANEOUS LOCALIZATION; NAVIGATION; SLAM;
D O I
10.1177/17298814211037497
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the long-term deployment of mobile robots, changing appearance brings challenges for localization. When a robot travels to the same place or restarts from an existing map, global localization is needed, where place recognition provides coarse position information. For visual sensors, changing appearances such as the transition from day to night and seasonal variation can reduce the performance of a visual place recognition system. To address this problem, we propose to learn domain-unrelated features across extreme changing appearance, where a domain denotes a specific appearance condition, such as a season or a kind of weather. We use an adversarial network with two discriminators to disentangle domain-related features and domain-unrelated features from images, and the domain-unrelated features are used as descriptors in place recognition. Provided images from different domains, our network is trained in a self-supervised manner which does not require correspondences between these domains. Besides, our feature extractors are shared among all domains, making it possible to contain more appearance without increasing model complexity. Qualitative and quantitative results on two toy cases are presented to show that our network can disentangle domain-related and domain-unrelated features from given data. Experiments on three public datasets and one proposed dataset for visual place recognition are conducted to illustrate the performance of our method compared with several typical algorithms. Besides, an ablation study is designed to validate the effectiveness of the introduced discriminators in our network. Additionally, we use a four-domain dataset to verify that the network can extend to multiple domains with one model while achieving similar performance.
引用
收藏
页数:19
相关论文
共 73 条
  • [1] [Anonymous], P IEEE C COMP VIS PA
  • [2] Anoosheh A, 2019, IEEE INT CONF ROBOT, P5958, DOI [10.1109/icra.2019.8794387, 10.1109/ICRA.2019.8794387]
  • [3] Ba J., 2016, ARXIV160706450, V1050, P21
  • [4] Simultaneous Localization and Mapping: A Survey of Current Trends in Autonomous Driving
    Bresson, Guillaume
    Alsayed, Zayed
    Yu, Li
    Glaser, Sebastien
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2017, 2 (03): : 194 - 220
  • [5] Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age
    Cadena, Cesar
    Carlone, Luca
    Carrillo, Henry
    Latif, Yasir
    Scaramuzza, Davide
    Neira, Jose
    Reid, Ian
    Leonard, John J.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (06) : 1309 - 1332
  • [6] Learning Context Flexible Attention Model for Long-Term Visual Place Recognition
    Chen, Zetao
    Liu, Lingqiao
    Sa, Inkyu
    Ge, Zongyuan
    Chli, Margarita
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 4015 - 4022
  • [7] Experience-based navigation for long-term localisation
    Churchill, Winston
    Newman, Paul
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (14) : 1645 - 1661
  • [8] How to Train a CAT: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change
    Clement, Lee
    Kelly, Jonathan
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (03): : 2447 - 2454
  • [9] FAB-MAP: Probabilistic localization and mapping in the space of appearance
    Cummins, Mark
    Newman, Paul
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (06) : 647 - 665
  • [10] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893