RDD: Learning Reinforced 3D Detectors and Descriptors Based on Policy Gradient

被引:0
|
作者
Cui, Wenting [1 ,2 ]
Du, Shaoyi [1 ,2 ]
Yao, Runzhao [1 ,2 ]
Tang, Canhui [1 ,2 ]
Ye, Aixue [3 ]
Wen, Feng [3 ]
Tian, Zhiqiang [4 ]
机构
[1] Xi An Jiao Tong Univ, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intell, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China
[3] Huawei Technol Co Ltd, Huawei Noahs Ark Lab, Beijing 100085, Peoples R China
[4] Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Three-dimensional displays; Detectors; Probabilistic logic; Point cloud compression; Training; Computer architecture; Point cloud registration; 3D description and detection; policy gradient; REGISTRATION;
D O I
10.1109/TMM.2023.3338054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Keypoint detection and descriptor matching are two vital steps in the 3D feature extraction framework, but they are difficult to learn in an end-to-end fashion due to their inherent discreteness. To tackle the non-differentiable operations, we formulate feature extraction as a decision-making problem: the network is treated as a policy pool that can make probabilistic estimations for keypoint selection and feature matching, supervised by maximizing a reward expectation of actions. In this way, we propose a novel end-to-end training paradigm of 3D feature extraction based on the stochastic policy gradient method, named Reinforced Detectors and Descriptors (RDD). Firstly, we propose a local-to-global probabilistic keypoint selection module that formulates the sampling probabilities of keypoints in a local-and-global mechanism to yield sparse and accurate keypoints. Secondly, we regard feature matching as an optimal transport problem and an efficient Sinkhorn method is leveraged to solve the optimal matching probabilities. In particular, we carefully design a reward function and derive gradients of probabilistic actions, thus overcoming the discreteness and providing reinforced supervision signals. Since our reward function is calculated from sampled keypoints rather than from randomly sampled points as in existing methods, the gap between training and inference is bridged. Experimental results demonstrate that our approach exceeds the quality of state-of-the-art methods and shows strong generalization ability. Remarkably, our approach can achieve significantly higher Registration Recall than other advanced methods when aligning scenes with a small number of keypoints, due to our highly accurate and repeatable detector.
引用
收藏
页码:900 / 913
页数:14
相关论文
共 50 条
  • [41] Improving 3D similarity search by enhancing and combining 3D descriptors
    Bustos, Benjamin
    Schreck, Tobias
    Walter, Michael
    Manuel Barrios, Juan
    Schaefer, Matthias
    Keim, Daniel
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 58 (01) : 81 - 108
  • [42] A 3D model retrieval based on combinations of partial shape descriptors
    Suzuki, Motofumi T.
    Yaginuma, Yoshitomo
    Yamada, Tsuneo
    Shimizu, Yasutaka
    NAFIPS 2006 - 2006 ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1 AND 2, 2006, : 273 - +
  • [43] 3D Visual-based Human Motion Descriptors: A Review
    Khokhlova, Margarita
    Migniot, Cyrille
    Dipanda, Albert
    2016 12TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2016, : 564 - 572
  • [44] Density-based shape descriptors for 3D object retrieval
    Akgul, Ceyhun Burak
    Sankur, Bulent
    Schmitt, Francis
    Yemez, Yucel
    MULTIMEDIA CONTENT REPRESENTATION, CLASSIFICATION AND SECURITY, 2006, 4105 : 322 - 329
  • [45] Improving 3D similarity search by enhancing and combining 3D descriptors
    Benjamin Bustos
    Tobias Schreck
    Michael Walter
    Juan Manuel Barrios
    Matthias Schaefer
    Daniel Keim
    Multimedia Tools and Applications, 2012, 58 : 81 - 108
  • [46] Novel binding pocket descriptors based on DrugScore potential fields encoded by 3D Zernike descriptors
    Britta Nisius
    Fan Sha
    Holger Gohlke
    Journal of Cheminformatics, 4 (Suppl 1)
  • [47] Algorithm for the computation of 3D Fourier descriptors
    Sijbers, J
    Ceulemans, T
    Van Dyck, D
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 790 - 793
  • [48] Distinctive 3D local deep descriptors
    Poiesi, Fabio
    Boscaini, Davide
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5720 - 5727
  • [49] Topological Descriptors for 3D Surface Analysis
    Zeppelzauer, Matthias
    Zielinski, Bartosz
    Juda, Mateusz
    Seidl, Markus
    COMPUTATIONAL TOPOLOGY IN IMAGE CONTEXT, CTIC 2016, 2016, 9667 : 77 - 87
  • [50] On shortened 3D local binary descriptors
    Quan, Siwen
    Ma, Jie
    Information Sciences, 2020, 510 : 33 - 49