BEVPlace: Learning LiDAR-based Place Recognition using Bird's Eye View Images

被引：21

作者：

Luo, Lun ^{[1
,2
,4
]}

Zheng, Shuhang ^{[2
]}

Li, Yixuan ^{[2
]}

Fan, Yongzhi ^{[2
]}

Yu, Beinan ^{[2
]}

Cao, Si-Yuan ^{[1
,2
]}

Li, Junwei ^{[2
]}

Shen, Hui-Liang ^{[1
,2
,3
]}

机构：

[1] Zhejiang Univ, Ningbo Innovat Ctr, Ningbo, Peoples R China

[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China

[3] Key Lab Collaborat Sensing & Autonomous Unmanned, Hangzhou, Peoples R China

[4] HAOMO AI Technol Co Ltd, Beijing, Peoples R China

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

基金：

中国国家自然科学基金;

关键词：

LOCALIZATION;

D O I：

10.1109/ICCV51070.2023.00799

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Place recognition is a key module for long-term SLAM systems. Current LiDAR-based place recognition methods usually use representations of point clouds such as unordered points or range images. These methods achieve high recall rates of retrieval, but their performance may degrade in the case of view variation or scene changes. In this work, we explore the potential of a different representation in place recognition, i.e. bird's eye view (BEV) images. We validate that, in scenes of slight viewpoint changes, a simple NetVLAD network trained on BEV images achieves comparable performance to the state-of-the-art place recognition methods. For robustness to view variations, we propose a rotation-invariant network called BEVPlace. We use group convolution to extract rotation-equivariant local features from the images and NetVLAD for global feature aggregation. In addition, we observe that the distance between BEV features is correlated with the geometry distance of point clouds. Based on the observation, we develop a method to estimate the position of the query cloud, extending the usage of place recognition. The experiments conducted on large-scale public datasets show that our method 1) achieves state-of-the-art performance in terms of recall rates, 2) is robust to view changes, 3) shows strong generalization ability, and 4) can estimate the positions of query point clouds. Source codes are publicly available at https://github.com/zjuluolun/BEVPlace.

引用

页码：8666 / 8675

页数：10

共 38 条

[1]

Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/CVPR.2016.572, 10.1109/TPAMI.2017.2711011]

[2] Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age [J].

Cadena, Cesar ;

Carlone, Luca ;

Carrillo, Henry ;

Latif, Yasir ;

Scaramuzza, Davide ;

Neira, Jose ;

Reid, Ian ;

Leonard, John J. .

IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (06) :1309-1332

[3]

Chen Xieyuanli, 2021, ROB SCI SYST 16, P1, DOI DOI 10.15607/RSS.2020.XVI.009

[4]

Cohen TS, 2016, PR MACH LEARN RES, V48

[5] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[6] Vector Neurons: A General Framework for SO(3)-Equivariant Networks [J].

Deng, Congyue ;

Litany, Or ;

Duan, Yueqi ;

Poulenard, Adrien ;

Tagliasacchi, Andrea ;

Guibas, Leonidas .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :12180-12189

[7]

Du J., 2020, EUR C COMP VIS

[8]

Fan Z., 2022, AAAI C ART INT

[9]

Fan Z., 2022, AAAI C ART INT

[10] Bags of Binary Words for Fast Place Recognition in Image Sequences [J].

Galvez-Lopez, Dorian ;

Tardos, Juan D. .

IEEE TRANSACTIONS ON ROBOTICS, 2012, 28 (05) :1188-1197

← 1 2 3 4 →