3DSAC: Size Adaptive Clustering for 3D object detection in point clouds

被引:7
作者
Yu, Hang [1 ]
Su, Jinhe [1 ]
Cai, Guorong [1 ]
Piao, Yingchao [2 ]
Liu, Niansheng [1 ]
Huang, Min [1 ]
机构
[1] Jimei Univ, Sch Comp Engn, Xiamen 361021, Peoples R China
[2] Chinese Acad Sci, Comp Network Informat Ctr, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object detection; Point cloud; Hough voting; Size adaptive; Relation information;
D O I
10.1016/j.jag.2023.103231
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
3D object detection is important for various indoor applications to understand the environment. Previous voting-based methods voted on the center of each seed point, which may suffer from errors from background points or adjacent objects. And the size-fixed feature grouping module is unsuitable for indoor objects with variable sizes. In this paper, we propose a Size Adaptive Clustering method for 3D object detection in point clouds . First, we present a super-voting module to divide seed points into foreground and background points and perform enhanced voting on the foreground seeds. To create a good match for the feature clustering area and the size of an object, we design a size-adaptive clustering module to infer a clustering radius based on the seed-to-vote displacement offset. Finally, because indoor objects are highly related to spatial room layouts, a position-aware module is used to calculate aware weights among objects and enhance the features of occluded objects. Experiments show that our method outperforms VoteNet by a large margin on ScanNet V2 (mAP@0.250 +8.3%, mAP@0.50 +14.2%) and SUN RGB-D (mAP@0.250 +3.5%, mAP@0.50 +13.6%). The proposed method can detect indoor objects with variable sizes in high accuracy, and perform robustly in case of occluded objects. The code of 3DSAC will be available at github-3DSAC.
引用
收藏
页数:13
相关论文
共 35 条
[1]  
[Anonymous], 2017, NIPS, DOI [10.48550/arXiv.1706.02413, DOI 10.48550/ARXIV.1706.02413]
[2]   A Hierarchical Graph Network for 3D Object Detection on Point Clouds [J].
Chen, Jintai ;
Lei, Biwen ;
Song, Qingyu ;
Ying, Haochao ;
Chen, Danny Z. ;
Wu, Jian .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :389-398
[3]   Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds [J].
Cheng, Bowen ;
Sheng, Lu ;
Shi, Shaoshuai ;
Yang, Ming ;
Xu, Dong .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8959-8968
[4]   Zillow Indoor Dataset: Annotated Floor Plans With 360° Panoramas and 3D Room Layouts [J].
Cruz, Steve ;
Hutchcroft, Will ;
Li, Yuguang ;
Khosravan, Naji ;
Boyadzhiev, Ivaylo ;
Kang, Sing Bing .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2133-2143
[5]   ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].
Dai, Angela ;
Chang, Angel X. ;
Savva, Manolis ;
Halber, Maciej ;
Funkhouser, Thomas ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443
[6]  
Ding ZP, 2019, LECT NOTES COMPUT SC, V11766, P202, DOI [10.1007/978-3-030-32248-9, 10.1007/978-3-030-32248-9_23]
[7]  
Duan Y, 2022, P IEEECVF C COMPUTER, P16980
[8]   Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts [J].
Hou, Ji ;
Graham, Benjamin ;
Niesner, Matthias ;
Xie, Saining .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15582-15592
[9]  
Ku J, 2018, IEEE INT C INT ROBOT, P5750, DOI 10.1109/IROS.2018.8594049
[10]   Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds [J].
Kuang, Hongwu ;
Wang, Bei ;
An, Jianping ;
Zhang, Ming ;
Zhang, Zehan .
SENSORS, 2020, 20 (03)