Point cloud instance segmentation based on attention mechanism KNN and ASIS module

被引:0
作者
Xiang X.-Y. [1 ,2 ]
Wang L. [1 ,2 ]
Zong W.-P. [1 ,2 ]
Li G.-Y. [1 ]
机构
[1] Institute of Geospatial Information, Information Engeering University, Zhengzhou
[2] State Key Laboratory of Geo-Information Engineering, Xi’an
来源
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science) | 2023年 / 57卷 / 05期
关键词
attention mechanism; instance segmentation; point cloud; submanifold; voxel;
D O I
10.3785/j.issn.1008-973X.2023.05.003
中图分类号
学科分类号
摘要
A point cloud instance segmentation model with a k-nearest neighbors (KNN) module featuring attention mechanism and an improved associatively segmenting instances and semantics (ASIS) module was proposed to address the problems of discrete segmentation and insufficient feature utilization in traditional 3D convolution-based algorithms. The model took voxels as input and extracted point features through sparse convolution of 3D submanifolds. The KNN algorithm with attention mechanism was used for reorganizing the features in the semantic and instance feature space to alleviate the problem caused by the quantization error of extracted features. The reorganized semantic and instance features were correlated through the improved ASIS module to enhance the discrimination between point features. For semantic features and instance embedding, the softmax module and the meanshift algorithm were applied to obtain semantic and instance segmentation results respectively. The public S3DIS dataset was employed to validate the proposed model. The experimental results showed that the instance segmentation results of the proposed model achieved 53.1%, 57.1%, 65.2% and 52.8% in terms of mean coverage (mCoV), mean weighted coverage (mWCov), mean precision (mPrec) and mean recall (mRec) for the instance segmentation. The semantic segmentation achieved 61.7% and 88.1% respectively in terms of mean intersection over union (mIoU) and Over-all accuracy (Oacc) for the semantic segmentation. The ablation experiment verified the effectiveness of the proposed modules. © 2023 Zhejiang University. All rights reserved.
引用
收藏
页码:875 / 882
页数:7
相关论文
共 33 条
  • [1] ZHAO N, CHUA T S, LEE G H., Few-shot 3d point cloud semantic segmentation [C], IEEE Conference on Computer Vision and Pattern Recognition, pp. 8873-8882, (2021)
  • [2] WU K L, XU G D, LIU Z L, Et al., PointCSE: context-sensitive encoders for efficient 3d object detection from point cloud [J], International Journal of Machine Learning and Cybernetics, 28, 7, pp. 1-9, (2021)
  • [3] HE K M, GKIOXARI G, DOLLAR P, Et al., Mask r-cnn [C], IEEE International Conference on Computer Vision Workshops, pp. 2961-2969, (2017)
  • [4] YAO Pei-jun, YIN Yan-yun, Facade measurement method based on three-dimensional laser scanner and total station technology [J], Geotechnical Engineering Technique, 36, 2, pp. 156-159, (2022)
  • [5] WANG Chao-ying, XING Shuai, DAI Mo-fan, Et al., A method of ground object classification based on multi-scale deep feature fusion of remote sensing image and LiDAR point cloud [J], Journal of Geomatics Science and Technology, 38, 6, pp. 604-610, (2021)
  • [6] HOU J, DAI A, NIEssNER M., 3D-SIS: 3d semantic instance segmentation of RGB-d scans [C], IEEE Conference on Computer Vision and Pattern Recognition, pp. 4421-4430, (2019)
  • [7] YI L, ZHAO W, WANG H, Et al., GSPN: generative shape proposal network for 3d instance segmentation in point cloud [C], IEEE Conference on Computer Vision and Pattern Recognition, pp. 3947-3956, (2019)
  • [8] WANG W Y, YU R, HUANG Q, Et al., SGPN: similarity group proposal network for 3d point cloud instance segmentation [C], IEEE Conference on Computer Vision and Pattern Recognition, pp. 2569-2578, (2018)
  • [9] WANG X L, LIU S, SHEN X Y, Et al., Associatively segmenting instances and semantics in point clouds [C], IEEE Conference on Computer Vision and Pattern Recognition, pp. 4096-4105, (2019)
  • [10] PHAM Q H, NGUYEN T, HUA B S, Et al., Jsis3d: joint semantic-instance segmentation of 3d point clouds with multi-task pointwise networks and multi-value conditional random fields [C], IEEE Conference on Computer Vision and Pattern Recognition, pp. 8827-8836, (2018)