Image-to-Point Registration via Cross-Modality Correspondence Retrieval

被引:0
|
作者
Bie, Lin [1 ]
Li, Siqi [1 ]
Cheng, Kai [2 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
[2] Army Engn Univ, Command Control Coll, Nanjing, Peoples R China
来源
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024 | 2024年
关键词
Image-to-Point Cloud registration; cross-modality correspondence retrieval; frustum point retrieval; combined correspondence retrieval; virtual point cloud;
D O I
10.1145/3652583.3658074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image-to-Point Cloud registration between 2D images and 3D LiDAR point clouds is a significant task in computer vision. The traditional registration pipeline first establishes correspondences between images and point clouds and then performs pose estimation based on the generated matches. However, 2D-3D correspondences are inherently difficult to be established due to the large modality gap between images and LiDAR point clouds. To this end, we build a bridge to alleviate the 2D-3D modality gap, which aligns LiDAR point clouds to the virtual points generated by images. In this way, the modality gap can be alleviated to the domain gap of different types of point clouds, i.e. original point clouds and virtual point clouds. Concretely, our framework conducts feature fusion from the LiDAR and virtual point cloud by utilizing the Transformer layer. To relieve the domain gap, a frustum points retrieval module and a combined correspondences retrieval module are proposed based on the consistency of the feature and position descriptor to select the correct correspondences among the candidates, which are generated from the simultaneous retrieval of features and position descriptors. In the implementation procedure, we design a frustum retrieval loss and a combined correspondence retrieval loss for cross-modality correspondence retrieval. Experimental results and comparison with state-of-the-art Image-to-Point Cloud methods on KITTI and nuScenes datasets demonstrate our proposed method has achieved superior performance.
引用
收藏
页码:266 / 274
页数:9
相关论文
共 50 条
  • [21] MRI Cross-Modality Image-to-Image Translation
    Yang, Qianye
    Li, Nannan
    Zhao, Zixu
    Fan, Xingyu
    Chang, Eric I-Chao
    Xu, Yan
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [22] MRI Cross-Modality Image-to-Image Translation
    Qianye Yang
    Nannan Li
    Zixu Zhao
    Xingyu Fan
    Eric I-Chao Chang
    Yan Xu
    Scientific Reports, 10
  • [23] Hybrid Fusion with Intra- and Cross-Modality Attention for Image-Recipe Retrieval
    Li, Jiao
    Xu, Xing
    Yu, Wei
    Shen, Fumin
    Cao, Zuo
    Zuo, Kai
    Shen, Heng Tao
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 244 - 254
  • [24] Cross-Modality Multi-Atlas Segmentation via Deep Registration and Label Fusion
    Ding, Wangbin
    Li, Lei
    Zhuang, Xiahai
    Huang, Liqin
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (07) : 3104 - 3115
  • [25] Review of Cross-Modality Medical Image Prediction
    Zhou P.
    Chen H.-J.
    Yu Z.-K.
    Peng Y.-H.
    Li Y.-F.
    Yang F.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (01): : 220 - 226
  • [26] EEG-Loreta and Spect: Troubles on Cross-Modality Correspondence
    Barbanoj, M. J.
    Romero, S.
    Riba, J.
    NEUROPSYCHOBIOLOGY, 2008, 58 (3-4) : 236 - 237
  • [27] Image manipulation localization via dynamic cross-modality fusion and progressive integration
    Jin, Xiao
    Yu, Wen
    Shi, Wei
    NEUROCOMPUTING, 2024, 610
  • [28] A fast, accurate, cross-modality image geo-registration and target/object detection algorithm
    McKay, Troy
    Hirsch, Herb
    GEOSPATIAL INFOFUSION III, 2013, 8747
  • [29] Accurate Positioning via Cross-Modality Training
    Papaioannou, Savvas
    Wen, Hongkai
    Xiao, Zhuoling
    Markham, Andrew
    Trigoni, Niki
    SENSYS'15: PROCEEDINGS OF THE 13TH ACM CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS, 2015, : 239 - 251
  • [30] Implicit relative attribute enabled cross-modality hashing for face image-video retrieval
    Dai, Peng
    Wang, Xue
    Zhang, Weihang
    Zhang, Pengbo
    You, Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (18) : 23547 - 23577