RDD: Learning Reinforced 3D Detectors and Descriptors Based on Policy Gradient

被引：0

作者：

Cui, Wenting ^{[1
,2
]}

Du, Shaoyi ^{[1
,2
]}

Yao, Runzhao ^{[1
,2
]}

Tang, Canhui ^{[1
,2
]}

Ye, Aixue ^{[3
]}

Wen, Feng ^{[3
]}

Tian, Zhiqiang ^{[4
]}

机构：

[1] Xi An Jiao Tong Univ, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intell, Xian 710049, Peoples R China

[2] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China

[3] Huawei Technol Co Ltd, Huawei Noahs Ark Lab, Beijing 100085, Peoples R China

[4] Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2025年 / 27卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Three-dimensional displays; Detectors; Probabilistic logic; Point cloud compression; Training; Computer architecture; Point cloud registration; 3D description and detection; policy gradient; REGISTRATION;

D O I：

10.1109/TMM.2023.3338054

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Keypoint detection and descriptor matching are two vital steps in the 3D feature extraction framework, but they are difficult to learn in an end-to-end fashion due to their inherent discreteness. To tackle the non-differentiable operations, we formulate feature extraction as a decision-making problem: the network is treated as a policy pool that can make probabilistic estimations for keypoint selection and feature matching, supervised by maximizing a reward expectation of actions. In this way, we propose a novel end-to-end training paradigm of 3D feature extraction based on the stochastic policy gradient method, named Reinforced Detectors and Descriptors (RDD). Firstly, we propose a local-to-global probabilistic keypoint selection module that formulates the sampling probabilities of keypoints in a local-and-global mechanism to yield sparse and accurate keypoints. Secondly, we regard feature matching as an optimal transport problem and an efficient Sinkhorn method is leveraged to solve the optimal matching probabilities. In particular, we carefully design a reward function and derive gradients of probabilistic actions, thus overcoming the discreteness and providing reinforced supervision signals. Since our reward function is calculated from sampled keypoints rather than from randomly sampled points as in existing methods, the gap between training and inference is bridged. Experimental results demonstrate that our approach exceeds the quality of state-of-the-art methods and shows strong generalization ability. Remarkably, our approach can achieve significantly higher Registration Recall than other advanced methods when aligning scenes with a small number of keypoints, due to our highly accurate and repeatable detector.

引用

页码：900 / 913

页数：14

共 50 条

[1] Evaluation of features detectors and descriptors based on 3D objects
Moreels, P
Perona, P
TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 800 - 807
[2] Evaluation of Features Detectors and Descriptors based on 3D Objects
Pierre Moreels
Pietro Perona
International Journal of Computer Vision, 2007, 73 : 263 - 284
[3] Evaluation of features detectors and descriptors based on 3D objects
Moreels, Pierre
Perona, Pietro
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 73 (03) : 263 - 284
[4] On the Affinity between 3D Detectors and Descriptors
Salti, Samuele
Petrelli, Alioscia
Tombari, Federico
Di Stefano, Luigi
SECOND JOINT 3DIM/3DPVT CONFERENCE: 3D IMAGING, MODELING, PROCESSING, VISUALIZATION & TRANSMISSION (3DIMPVT 2012), 2012, : 424 - 431
[5] 3D keypoint detectors and descriptors for 3D objects recognition with TOF camera
Shaiek, Ayet
Moutarde, Fabien
THREE-DIMENSIONAL IMAGING, INTERACTION, AND MEASUREMENT, 2011, 7864
[6] Performance Evaluation of 3D Descriptors Paired with Learned Keypoint Detectors
Spezialetti, Riccardo
Salti, Samuele
Di Stefano, Luigi
AI, 2021, 2 (02) : 229 - 243
[7] Performance Evalution of 3D Keypoint Detectors and Descriptors for Plants Health Classification
Azimi, Shiva
Lall, Brejesh
Gandhi, Tapan K.
PROCEEDINGS OF MVA 2019 16TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2019,
[8] Learning Descriptors for Object Recognition and 3D Pose Estimation
Wohlhart, Paul
Lepetit, Vincent
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3109 - 3118
[9] Fractal scale descriptors based on 3d wavelet moments for 3d objects
Cui, Li
Li, Ying
Xu, Dong
Li, Him
2007 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1-4, PROCEEDINGS, 2007, : 1197 - +
[10] Density-based 3D shape descriptors
Akgul, Ceyhun Burak
Sankur, Bulent
Yemez, Yucel
Schmitt, Francis
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)

← 1 2 3 4 5 →