MFF-PR: Point Cloud and Image Multi-modal Feature Fusion for Place Recognition

被引:5
作者
Liu, Wenlei [1 ]
Fei, Jiajun [1 ]
Zhu, Ziyu [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Beijing, Peoples R China
来源
2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR 2022) | 2022年
关键词
Place recognition; Multi-modal fusion; SLAM; Point cloud and image;
D O I
10.1109/ISMAR55827.2022.00082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Place recognition technology can eliminate cumulative errors and thus plays a vital role in autonomous driving. In this paper, the composite feature of point cloud and image data is obtained by multi-modal feature fusion, thereby improving positioning accuracy. Semantic features, instance features, topological features, and image texture features are integrated to obtain comprehensive features, presenting strong robustness and complex scene expression abilities. Topological features consist of intra-class features and inter-instance features, which allow users to obtain more comprehensive scene structure information. The place recognition methods of data-level fusion and feature-level fusion based on point cloud and image are compared. This paper verifies the proposed method on SemanticKitti and nuScenes datasets. The results show that it outperforms state-of-the-art place recognition methods.
引用
收藏
页码:647 / 655
页数:9
相关论文
共 50 条
  • [31] DDIFN: A Dual-discriminator Multi-modal Medical Image Fusion Network
    Liu, Hui
    Li, Shanshan
    Zhu, Jicheng
    Deng, Kai
    Liu, Meng
    Nie, Liqiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [32] Efficient 3D Point Cloud Feature Learning for Large-Scale Place Recognition
    Hui, Le
    Cheng, Mingmei
    Xie, Jin
    Yang, Jian
    Cheng, Ming-Ming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1258 - 1270
  • [33] Ore Rock Fragmentation Calculation Based on Multi-Modal Fusion of Point Clouds and Images
    Peng, Jianjun
    Cui, Yunhao
    Zhong, Zhidan
    An, Yi
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [34] Multi-Modal Fusion Emotion Recognition Method of Speech Expression Based on Deep Learning
    Liu, Dong
    Wang, Zhiyong
    Wang, Lifeng
    Chen, Longxi
    FRONTIERS IN NEUROROBOTICS, 2021, 15
  • [35] Graph-Based Multi-Modal Multi-View Fusion for Facial Action Unit Recognition
    Chen, Jianrong
    Dey, Sujit
    IEEE ACCESS, 2024, 12 : 69310 - 69324
  • [36] Multi-modal Few-shot Image Recognition with enhanced semantic and visual integration
    Dong, Chunru
    Wang, Lizhen
    Zhang, Feng
    Hua, Qiang
    IMAGE AND VISION COMPUTING, 2025, 157
  • [37] Visual Scene-Aware Hybrid and Multi-Modal Feature Aggregation for Facial Expression Recognition
    Lee, Min Kyu
    Kim, Dae Ha
    Song, Byung Cheol
    SENSORS, 2020, 20 (18) : 1 - 24
  • [38] SelFLoc: Selective feature fusion for large-scale point cloud-based place
    Qiu, Qibo
    Wang, Wenxiao
    Ying, Haochao
    Liang, Dingkun
    Gao, Haiming
    He, Xiaofei
    KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [39] Enhanced Aiot Multi-Modal Fusion for Human Activity Recognition in Ambient Assisted Living Environment
    Patel, Ankit D.
    Jhaveri, Rutvij H.
    Patel, Ashish D.
    Shah, Kaushal A.
    Shah, Jigarkumar
    SOFTWARE-PRACTICE & EXPERIENCE, 2025, 55 (04) : 731 - 747
  • [40] An Efficient 3-D Point Cloud Place Recognition Approach Based on Feature Point Extraction and Transformer
    Ye, Tao
    Yan, Xiangming
    Wang, Shouan
    Li, Yunwang
    Zhou, Fuqiang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71