Image-to-Point Registration via Cross-Modality Correspondence Retrieval

被引：0

作者：

Bie, Lin ^{[1
]}

Li, Siqi ^{[1
]}

Cheng, Kai ^{[2
]}

机构：

[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China

[2] Army Engn Univ, Command Control Coll, Nanjing, Peoples R China

来源：

PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024 | 2024年

关键词：

Image-to-Point Cloud registration; cross-modality correspondence retrieval; frustum point retrieval; combined correspondence retrieval; virtual point cloud;

D O I：

10.1145/3652583.3658074

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image-to-Point Cloud registration between 2D images and 3D LiDAR point clouds is a significant task in computer vision. The traditional registration pipeline first establishes correspondences between images and point clouds and then performs pose estimation based on the generated matches. However, 2D-3D correspondences are inherently difficult to be established due to the large modality gap between images and LiDAR point clouds. To this end, we build a bridge to alleviate the 2D-3D modality gap, which aligns LiDAR point clouds to the virtual points generated by images. In this way, the modality gap can be alleviated to the domain gap of different types of point clouds, i.e. original point clouds and virtual point clouds. Concretely, our framework conducts feature fusion from the LiDAR and virtual point cloud by utilizing the Transformer layer. To relieve the domain gap, a frustum points retrieval module and a combined correspondences retrieval module are proposed based on the consistency of the feature and position descriptor to select the correct correspondences among the candidates, which are generated from the simultaneous retrieval of features and position descriptors. In the implementation procedure, we design a frustum retrieval loss and a combined correspondence retrieval loss for cross-modality correspondence retrieval. Experimental results and comparison with state-of-the-art Image-to-Point Cloud methods on KITTI and nuScenes datasets demonstrate our proposed method has achieved superior performance.

引用

页码：266 / 274

页数：9

共 50 条

[21] MRI Cross-Modality Image-to-Image Translation
Yang, Qianye
Li, Nannan
Zhao, Zixu
Fan, Xingyu
Chang, Eric I-Chao
Xu, Yan
SCIENTIFIC REPORTS, 2020, 10 (01)
[22] MRI Cross-Modality Image-to-Image Translation
Qianye Yang
Nannan Li
Zixu Zhao
Xingyu Fan
Eric I-Chao Chang
Yan Xu
Scientific Reports, 10
[23] Hybrid Fusion with Intra- and Cross-Modality Attention for Image-Recipe Retrieval
Li, Jiao
Xu, Xing
Yu, Wei
Shen, Fumin
Cao, Zuo
Zuo, Kai
Shen, Heng Tao
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 244 - 254
[24] Cross-Modality Multi-Atlas Segmentation via Deep Registration and Label Fusion
Ding, Wangbin
Li, Lei
Zhuang, Xiahai
Huang, Liqin
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (07) : 3104 - 3115
[25] Review of Cross-Modality Medical Image Prediction
Zhou P.
Chen H.-J.
Yu Z.-K.
Peng Y.-H.
Li Y.-F.
Yang F.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (01): : 220 - 226
[26] EEG-Loreta and Spect: Troubles on Cross-Modality Correspondence
Barbanoj, M. J.
Romero, S.
Riba, J.
NEUROPSYCHOBIOLOGY, 2008, 58 (3-4) : 236 - 237
[27] Image manipulation localization via dynamic cross-modality fusion and progressive integration
Jin, Xiao
Yu, Wen
Shi, Wei
NEUROCOMPUTING, 2024, 610
[28] A fast, accurate, cross-modality image geo-registration and target/object detection algorithm
McKay, Troy
Hirsch, Herb
GEOSPATIAL INFOFUSION III, 2013, 8747
[29] Accurate Positioning via Cross-Modality Training
Papaioannou, Savvas
Wen, Hongkai
Xiao, Zhuoling
Markham, Andrew
Trigoni, Niki
SENSYS'15: PROCEEDINGS OF THE 13TH ACM CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS, 2015, : 239 - 251
[30] Implicit relative attribute enabled cross-modality hashing for face image-video retrieval
Dai, Peng
Wang, Xue
Zhang, Weihang
Zhang, Pengbo
You, Wei
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (18) : 23547 - 23577

← 1 2 3 4 5 →