PSNet: Fast Data Structuring for Hierarchical Deep Learning on Point Cloud

被引:14
作者
Li, Luyang [1 ,2 ]
He, Ligang [3 ]
Gao, Jinjin [4 ]
Han, Xie [1 ]
机构
[1] North Univ China, Sch Data Sci & Technol, Taiyuan 030051, Peoples R China
[2] Shanxi Informat Ind Technol Res Inst Co Ltd, Taiyuan 030012, Peoples R China
[3] Univ Warwick, Dept Comp, Coventry CV4 7AL, W Midlands, England
[4] Shanxi Univ Finance & Econ, Expt Ctr, Taiyuan 030006, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud compression; Data models; Deep learning; Training; Task analysis; Convolution; Computational modeling; point cloud; data structuring; computer vision; grouping; sampling; NETWORK;
D O I
10.1109/TCSVT.2022.3171968
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In order to retain more feature information of local areas on a point cloud, local grouping and subsampling are the necessary data structuring steps in most hierarchical deep learning models. Due to the disorder nature of the points in a point cloud, the significant time cost may be consumed when grouping and subsampling the points, which consequently results in poor scalability. This paper proposes a fast data structuring method called PSNet (Point Structuring Net). PSNet transforms the spatial features of the points and matches them to the features of local areas in a point cloud. PSNet achieves grouping and sampling at the same time while the existing methods process sampling and grouping in two separate steps (such as using FPS plus kNN). PSNet performs feature transformation pointwise while the existing methods uses the spatial relationship among the points as the reference for grouping. Thanks to these features, PSNet has two important advantages: 1) the grouping and sampling results obtained by PSNet is stable and permutation invariant; and 2) PSNet can be easily parallelized. PSNet can replace the data structuring methods in the mainstream point cloud deep learning models in a plug-and-play manner. We have conducted extensive experiments. The results show that PSNet can improve the training and inference speed significantly while maintaining the model accuracy.
引用
收藏
页码:6835 / 6849
页数:15
相关论文
共 56 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Balin M.F., 2019, PR MACH LEARN RES, P444
  • [3] 3DmFV: Three-Dimensional Point Cloud Classification in Real-Time Using Convolutional Neural Networks
    Ben-Shabat, Yizhak
    Lindenbaum, Michael
    Fischer, Anath
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 3145 - 3152
  • [4] Brock A, 2016, Arxiv, DOI [arXiv:1608.04236, 10.48550/arXiv.1608.04236]
  • [5] Geometric Deep Learning Going beyond Euclidean data
    Bronstein, Michael M.
    Bruna, Joan
    LeCun, Yann
    Szlam, Arthur
    Vandergheynst, Pierre
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (04) : 18 - 42
  • [6] Monocular and Binocular Interactions Oriented Deformable Convolutional Networks for Blind Quality Assessment of Stereoscopic Omnidirectional Images
    Chai, Xiongli
    Shao, Feng
    Jiang, Qiuping
    Meng, Xiangchao
    Ho, Yo-Sung
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3407 - 3421
  • [7] Chen C., 2019, PROC IEEECVF C COMPU, P4994
  • [8] Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49
  • [9] Deformable Convolutional Networks
    Dai, Jifeng
    Qi, Haozhi
    Xiong, Yuwen
    Li, Yi
    Zhang, Guodong
    Hu, Han
    Wei, Yichen
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 764 - 773
  • [10] From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection
    Deng, Jiajun
    Zhou, Wengang
    Zhang, Yanyong
    Li, Houqiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4722 - 4734