RDD: Learning Reinforced 3D Detectors and Descriptors Based on Policy Gradient

被引：0

作者：

Cui, Wenting ^{[1
,2
]}

Du, Shaoyi ^{[1
,2
]}

Yao, Runzhao ^{[1
,2
]}

Tang, Canhui ^{[1
,2
]}

Ye, Aixue ^{[3
]}

Wen, Feng ^{[3
]}

Tian, Zhiqiang ^{[4
]}

机构：

[1] Xi An Jiao Tong Univ, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intell, Xian 710049, Peoples R China

[2] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China

[3] Huawei Technol Co Ltd, Huawei Noahs Ark Lab, Beijing 100085, Peoples R China

[4] Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2025年 / 27卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Three-dimensional displays; Detectors; Probabilistic logic; Point cloud compression; Training; Computer architecture; Point cloud registration; 3D description and detection; policy gradient; REGISTRATION;

D O I：

10.1109/TMM.2023.3338054

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Keypoint detection and descriptor matching are two vital steps in the 3D feature extraction framework, but they are difficult to learn in an end-to-end fashion due to their inherent discreteness. To tackle the non-differentiable operations, we formulate feature extraction as a decision-making problem: the network is treated as a policy pool that can make probabilistic estimations for keypoint selection and feature matching, supervised by maximizing a reward expectation of actions. In this way, we propose a novel end-to-end training paradigm of 3D feature extraction based on the stochastic policy gradient method, named Reinforced Detectors and Descriptors (RDD). Firstly, we propose a local-to-global probabilistic keypoint selection module that formulates the sampling probabilities of keypoints in a local-and-global mechanism to yield sparse and accurate keypoints. Secondly, we regard feature matching as an optimal transport problem and an efficient Sinkhorn method is leveraged to solve the optimal matching probabilities. In particular, we carefully design a reward function and derive gradients of probabilistic actions, thus overcoming the discreteness and providing reinforced supervision signals. Since our reward function is calculated from sampled keypoints rather than from randomly sampled points as in existing methods, the gap between training and inference is bridged. Experimental results demonstrate that our approach exceeds the quality of state-of-the-art methods and shows strong generalization ability. Remarkably, our approach can achieve significantly higher Registration Recall than other advanced methods when aligning scenes with a small number of keypoints, due to our highly accurate and repeatable detector.

引用

页码：900 / 913

页数：14

共 50 条

[31] PLReg3D: Learning 3D Local and Global Descriptors Jointly for Global Localization
Qiao, Zhijian
Wang, Hanwen
Zhu, Yu
Wang, Hesheng
2021 27TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE (M2VIP), 2021,
[32] 3D silicon detectors
Bates, R. L.
ASTROPARTICLE, PARTICLE, SPACE PHYSICS AND DETECTORS FOR PHYSICS APPLICATIONS, 2012, 7 : 829 - 840
[33] Radiochromic 3D Detectors
Oldham, Mark
8TH INTERNATIONAL CONFERENCE ON 3D RADIATION DOSIMETRY (IC3DDOSE), 2015, 573
[34] A physics based machine learning model to characterize room temperature semiconductor detectors in 3D
Banerjee S.
Rodrigues M.
Ballester M.
Vija A.H.
Katsaggelos A.K.
Scientific Reports, 14 (1)
[35] A physics based machine learning model to characterize room temperature semiconductor detectors in 3D
Banerjee, Srutarshi
Rodrigues, Miesher
Ballester, Manuel
Vija, Alexander H.
Katsaggelos, Aggelos K.
SCIENTIFIC REPORTS, 2024, 14 (01):
[36] Learning 3D Keypoint Descriptors for Non-rigid Shape Matching
Wang, Hanyu
Guo, Jianwei
Yan, Dong-Ming
Quan, Weize
Zhang, Xiaopeng
COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 3 - 20
[37] Learning context-aware local feature descriptors for 3D reconstruction
Yang, Jian
Zhou, Jian
Fan, Hao
Dong, Junyu
Yu, Hui
NEUROCOMPUTING, 2024, 594
[38] DeepPoint3D: Learning discriminative local descriptors using deep metric learning on 3D point clouds
Srivastava, Siddharth
Lall, Brejesh
PATTERN RECOGNITION LETTERS, 2019, 127 : 27 - 36
[39] Learning Dynamic Scene-Conditioned 3D Object Detectors
Zheng, Yu
Duan, Yueqi
Li, Zongtai
Zhou, Jie
Lu, Jiwen
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 2981 - 2996
[40] 3D Object Recognition Based on Improved Point Cloud Descriptors
Wen, Weiwei
Wen, Gongjian
Hui, Bingwei
Qiu, Shaohua
TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806

← 1 2 3 4 5 →