RDD: Learning Reinforced 3D Detectors and Descriptors Based on Policy Gradient

被引:0
|
作者
Cui, Wenting [1 ,2 ]
Du, Shaoyi [1 ,2 ]
Yao, Runzhao [1 ,2 ]
Tang, Canhui [1 ,2 ]
Ye, Aixue [3 ]
Wen, Feng [3 ]
Tian, Zhiqiang [4 ]
机构
[1] Xi An Jiao Tong Univ, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intell, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China
[3] Huawei Technol Co Ltd, Huawei Noahs Ark Lab, Beijing 100085, Peoples R China
[4] Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Three-dimensional displays; Detectors; Probabilistic logic; Point cloud compression; Training; Computer architecture; Point cloud registration; 3D description and detection; policy gradient; REGISTRATION;
D O I
10.1109/TMM.2023.3338054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Keypoint detection and descriptor matching are two vital steps in the 3D feature extraction framework, but they are difficult to learn in an end-to-end fashion due to their inherent discreteness. To tackle the non-differentiable operations, we formulate feature extraction as a decision-making problem: the network is treated as a policy pool that can make probabilistic estimations for keypoint selection and feature matching, supervised by maximizing a reward expectation of actions. In this way, we propose a novel end-to-end training paradigm of 3D feature extraction based on the stochastic policy gradient method, named Reinforced Detectors and Descriptors (RDD). Firstly, we propose a local-to-global probabilistic keypoint selection module that formulates the sampling probabilities of keypoints in a local-and-global mechanism to yield sparse and accurate keypoints. Secondly, we regard feature matching as an optimal transport problem and an efficient Sinkhorn method is leveraged to solve the optimal matching probabilities. In particular, we carefully design a reward function and derive gradients of probabilistic actions, thus overcoming the discreteness and providing reinforced supervision signals. Since our reward function is calculated from sampled keypoints rather than from randomly sampled points as in existing methods, the gap between training and inference is bridged. Experimental results demonstrate that our approach exceeds the quality of state-of-the-art methods and shows strong generalization ability. Remarkably, our approach can achieve significantly higher Registration Recall than other advanced methods when aligning scenes with a small number of keypoints, due to our highly accurate and repeatable detector.
引用
收藏
页码:900 / 913
页数:14
相关论文
共 50 条
  • [1] Evaluation of features detectors and descriptors based on 3D objects
    Moreels, P
    Perona, P
    TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 800 - 807
  • [2] Evaluation of Features Detectors and Descriptors based on 3D Objects
    Pierre Moreels
    Pietro Perona
    International Journal of Computer Vision, 2007, 73 : 263 - 284
  • [3] Evaluation of features detectors and descriptors based on 3D objects
    Moreels, Pierre
    Perona, Pietro
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 73 (03) : 263 - 284
  • [4] On the Affinity between 3D Detectors and Descriptors
    Salti, Samuele
    Petrelli, Alioscia
    Tombari, Federico
    Di Stefano, Luigi
    SECOND JOINT 3DIM/3DPVT CONFERENCE: 3D IMAGING, MODELING, PROCESSING, VISUALIZATION & TRANSMISSION (3DIMPVT 2012), 2012, : 424 - 431
  • [5] 3D keypoint detectors and descriptors for 3D objects recognition with TOF camera
    Shaiek, Ayet
    Moutarde, Fabien
    THREE-DIMENSIONAL IMAGING, INTERACTION, AND MEASUREMENT, 2011, 7864
  • [6] Performance Evaluation of 3D Descriptors Paired with Learned Keypoint Detectors
    Spezialetti, Riccardo
    Salti, Samuele
    Di Stefano, Luigi
    AI, 2021, 2 (02) : 229 - 243
  • [7] Performance Evalution of 3D Keypoint Detectors and Descriptors for Plants Health Classification
    Azimi, Shiva
    Lall, Brejesh
    Gandhi, Tapan K.
    PROCEEDINGS OF MVA 2019 16TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2019,
  • [8] Learning Descriptors for Object Recognition and 3D Pose Estimation
    Wohlhart, Paul
    Lepetit, Vincent
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3109 - 3118
  • [9] Fractal scale descriptors based on 3d wavelet moments for 3d objects
    Cui, Li
    Li, Ying
    Xu, Dong
    Li, Him
    2007 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1-4, PROCEEDINGS, 2007, : 1197 - +
  • [10] Density-based 3D shape descriptors
    Akgul, Ceyhun Burak
    Sankur, Bulent
    Yemez, Yucel
    Schmitt, Francis
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)