RDD: Learning Reinforced 3D Detectors and Descriptors Based on Policy Gradient

被引:0
|
作者
Cui, Wenting [1 ,2 ]
Du, Shaoyi [1 ,2 ]
Yao, Runzhao [1 ,2 ]
Tang, Canhui [1 ,2 ]
Ye, Aixue [3 ]
Wen, Feng [3 ]
Tian, Zhiqiang [4 ]
机构
[1] Xi An Jiao Tong Univ, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intell, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China
[3] Huawei Technol Co Ltd, Huawei Noahs Ark Lab, Beijing 100085, Peoples R China
[4] Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Three-dimensional displays; Detectors; Probabilistic logic; Point cloud compression; Training; Computer architecture; Point cloud registration; 3D description and detection; policy gradient; REGISTRATION;
D O I
10.1109/TMM.2023.3338054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Keypoint detection and descriptor matching are two vital steps in the 3D feature extraction framework, but they are difficult to learn in an end-to-end fashion due to their inherent discreteness. To tackle the non-differentiable operations, we formulate feature extraction as a decision-making problem: the network is treated as a policy pool that can make probabilistic estimations for keypoint selection and feature matching, supervised by maximizing a reward expectation of actions. In this way, we propose a novel end-to-end training paradigm of 3D feature extraction based on the stochastic policy gradient method, named Reinforced Detectors and Descriptors (RDD). Firstly, we propose a local-to-global probabilistic keypoint selection module that formulates the sampling probabilities of keypoints in a local-and-global mechanism to yield sparse and accurate keypoints. Secondly, we regard feature matching as an optimal transport problem and an efficient Sinkhorn method is leveraged to solve the optimal matching probabilities. In particular, we carefully design a reward function and derive gradients of probabilistic actions, thus overcoming the discreteness and providing reinforced supervision signals. Since our reward function is calculated from sampled keypoints rather than from randomly sampled points as in existing methods, the gap between training and inference is bridged. Experimental results demonstrate that our approach exceeds the quality of state-of-the-art methods and shows strong generalization ability. Remarkably, our approach can achieve significantly higher Registration Recall than other advanced methods when aligning scenes with a small number of keypoints, due to our highly accurate and repeatable detector.
引用
收藏
页码:900 / 913
页数:14
相关论文
共 50 条
  • [31] PLReg3D: Learning 3D Local and Global Descriptors Jointly for Global Localization
    Qiao, Zhijian
    Wang, Hanwen
    Zhu, Yu
    Wang, Hesheng
    2021 27TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE (M2VIP), 2021,
  • [32] 3D silicon detectors
    Bates, R. L.
    ASTROPARTICLE, PARTICLE, SPACE PHYSICS AND DETECTORS FOR PHYSICS APPLICATIONS, 2012, 7 : 829 - 840
  • [33] Radiochromic 3D Detectors
    Oldham, Mark
    8TH INTERNATIONAL CONFERENCE ON 3D RADIATION DOSIMETRY (IC3DDOSE), 2015, 573
  • [34] A physics based machine learning model to characterize room temperature semiconductor detectors in 3D
    Banerjee S.
    Rodrigues M.
    Ballester M.
    Vija A.H.
    Katsaggelos A.K.
    Scientific Reports, 14 (1)
  • [35] A physics based machine learning model to characterize room temperature semiconductor detectors in 3D
    Banerjee, Srutarshi
    Rodrigues, Miesher
    Ballester, Manuel
    Vija, Alexander H.
    Katsaggelos, Aggelos K.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [36] Learning 3D Keypoint Descriptors for Non-rigid Shape Matching
    Wang, Hanyu
    Guo, Jianwei
    Yan, Dong-Ming
    Quan, Weize
    Zhang, Xiaopeng
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 3 - 20
  • [37] Learning context-aware local feature descriptors for 3D reconstruction
    Yang, Jian
    Zhou, Jian
    Fan, Hao
    Dong, Junyu
    Yu, Hui
    NEUROCOMPUTING, 2024, 594
  • [38] DeepPoint3D: Learning discriminative local descriptors using deep metric learning on 3D point clouds
    Srivastava, Siddharth
    Lall, Brejesh
    PATTERN RECOGNITION LETTERS, 2019, 127 : 27 - 36
  • [39] Learning Dynamic Scene-Conditioned 3D Object Detectors
    Zheng, Yu
    Duan, Yueqi
    Li, Zongtai
    Zhou, Jie
    Lu, Jiwen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 2981 - 2996
  • [40] 3D Object Recognition Based on Improved Point Cloud Descriptors
    Wen, Weiwei
    Wen, Gongjian
    Hui, Bingwei
    Qiu, Shaohua
    TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806