HAPGN: Hierarchical Attentive Pooling Graph Network for Point Cloud Segmentation

被引:45
作者
Chen, Chaofan [1 ]
Qian, Shengsheng [2 ,3 ]
Fang, Quan [2 ,3 ]
Xu, Changsheng [2 ,3 ,4 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230026, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Feature extraction; Task analysis; Layout; Logic gates; Machine learning; Two dimensional displays; Point cloud segmentation; hierarchical graph pooling; gated graph attention network; REPRESENTATION;
D O I
10.1109/TMM.2020.3009499
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Among different 3D data representations, point cloud stands out for its efficiency and flexibility. Hence, many researchers have been involved in the point cloud analysis recently. Existing approaches for point cloud segmentation task typically suffer from two limitations: 1) They usually treat different neighbor points as equals which cannot characterize the correlation between the center point and its neighborhoods well. Moreover, different parts may have different local structures for a point cloud, but they just learn a single representation space which is not sufficient and stable. 2) They often capture hierarchical information by heuristic sampling approaches which cannot reveal the spatial relationships of points well to learn global features. To overcome these limitations, we propose a novel hierarchical attentive pooling graph network (HAPGN) which utilizes the gated graph attention network (GGAN) and hierarchical graph pooling module (HiGPool) as building blocks for point cloud segmentation. Specifically, GGAN can highlight not only the importance of different neighbor points but also the importance of different representation spaces to enhance the local feature extraction. HiGPool is a novel pooling module that can capture the spatial layouts of points to learn the hierarchical features adequately. Experimental results on the ShapeNet part dataset and S3DIS dataset show that HAPGN can achieve superior performance over the state-of-the-art segmentation approaches. Furthermore, we also combine our proposed HiGPool with some recent approaches for point cloud classification and achieve better results on the ModelNet40 dataset.
引用
收藏
页码:2335 / 2346
页数:12
相关论文
共 62 条
[1]  
Nguyen A, 2013, PROCEEDINGS OF THE 2013 6TH IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), P225, DOI 10.1109/RAM.2013.6758588
[2]  
[Anonymous], 2018, INT C MACH LEARN
[3]  
[Anonymous], 2006, International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences (ISPRS)
[4]  
[Anonymous], 2014, CORR
[5]  
[Anonymous], 2016, ARXIV160804236
[6]   3D Semantic Parsing of Large-Scale Indoor Spaces [J].
Armeni, Iro ;
Sener, Ozan ;
Zamir, Amir R. ;
Jiang, Helen ;
Brilakis, Ioannis ;
Fischer, Martin ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1534-1543
[7]   GIFT: A Real-time and Scalable 3D Shape Search Engine [J].
Bai, Song ;
Bai, Xiang ;
Zhou, Zhichao ;
Zhang, Zhaoxiang ;
Latecki, Longin Jan .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5023-5032
[8]   Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522
[9]  
Bruna J., 2014, P ICLR
[10]   Point Cloud Encoding for 3D Building Model Retrieval [J].
Chen, Jyun-Yuan ;
Lin, Chao-Hung ;
Hsu, Po-Chi ;
Chen, Chung-Hao .
IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (02) :337-345