PSNet: Fast Data Structuring for Hierarchical Deep Learning on Point Cloud

被引：14

作者：

Li, Luyang ^{[1
,2
]}

He, Ligang ^{[3
]}

Gao, Jinjin ^{[4
]}

Han, Xie ^{[1
]}

机构：

[1] North Univ China, Sch Data Sci & Technol, Taiyuan 030051, Peoples R China

[2] Shanxi Informat Ind Technol Res Inst Co Ltd, Taiyuan 030012, Peoples R China

[3] Univ Warwick, Dept Comp, Coventry CV4 7AL, W Midlands, England

[4] Shanxi Univ Finance & Econ, Expt Ctr, Taiyuan 030006, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2022年 / 32卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Point cloud compression; Data models; Deep learning; Training; Task analysis; Convolution; Computational modeling; point cloud; data structuring; computer vision; grouping; sampling; NETWORK;

D O I：

10.1109/TCSVT.2022.3171968

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In order to retain more feature information of local areas on a point cloud, local grouping and subsampling are the necessary data structuring steps in most hierarchical deep learning models. Due to the disorder nature of the points in a point cloud, the significant time cost may be consumed when grouping and subsampling the points, which consequently results in poor scalability. This paper proposes a fast data structuring method called PSNet (Point Structuring Net). PSNet transforms the spatial features of the points and matches them to the features of local areas in a point cloud. PSNet achieves grouping and sampling at the same time while the existing methods process sampling and grouping in two separate steps (such as using FPS plus kNN). PSNet performs feature transformation pointwise while the existing methods uses the spatial relationship among the points as the reference for grouping. Thanks to these features, PSNet has two important advantages: 1) the grouping and sampling results obtained by PSNet is stable and permutation invariant; and 2) PSNet can be easily parallelized. PSNet can replace the data structuring methods in the mainstream point cloud deep learning models in a plug-and-play manner. We have conducted extensive experiments. The results show that PSNet can improve the training and inference speed significantly while maintaining the model accuracy.

引用

页码：6835 / 6849

页数：15

共 56 条

[1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2] Balin M.F., 2019, PR MACH LEARN RES, P444
[3] 3DmFV: Three-Dimensional Point Cloud Classification in Real-Time Using Convolutional Neural Networks
Ben-Shabat, Yizhak
Lindenbaum, Michael
Fischer, Anath
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 3145 - 3152
[4] Brock A, 2016, Arxiv, DOI [arXiv:1608.04236, 10.48550/arXiv.1608.04236]
[5] Geometric Deep Learning Going beyond Euclidean data
Bronstein, Michael M.
Bruna, Joan
LeCun, Yann
Szlam, Arthur
Vandergheynst, Pierre
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (04) : 18 - 42
[6] Monocular and Binocular Interactions Oriented Deformable Convolutional Networks for Blind Quality Assessment of Stereoscopic Omnidirectional Images
Chai, Xiongli
Shao, Feng
Jiang, Qiuping
Meng, Xiangchao
Ho, Yo-Sung
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3407 - 3421
[7] Chen C., 2019, PROC IEEECVF C COMPU, P4994
[8] Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49
[9] Deformable Convolutional Networks
Dai, Jifeng
Qi, Haozhi
Xiong, Yuwen
Li, Yi
Zhang, Guodong
Hu, Han
Wei, Yichen
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 764 - 773
[10] From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection
Deng, Jiajun
Zhou, Wengang
Zhang, Yanyong
Li, Houqiang
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4722 - 4734

← 1 2 3 4 5 6 →