Occlusion Is Underrated: An Occlusion Attention Strategy Assembled in 3-D Object Detectors

被引：1

作者：

He, Yufei ^{[1
]}

Wu, Yan ^{[1
]}

Mo, Yujian ^{[1
]}

Hu, Yinghao ^{[1
]}

Zhang, Yuwei ^{[1
]}

Wang, Jijun ^{[1
]}

机构：

[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China

来源：

IEEE SENSORS JOURNAL | 2024年 / 24卷 / 10期

关键词：

Uncertainty; Point cloud compression; Three-dimensional displays; Data augmentation; Task analysis; Feature extraction; Detectors; 3-D object detection; data augmentation; occluded scenes; uncertainty estimation;

D O I：

10.1109/JSEN.2024.3384401

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

LiDAR sensors provide rich geometrical information for 3-D scene understanding, which has been widely used as a unique input for 3-D object detection. However, due to the intrinsic property, point clouds scanned by LiDAR are always sparse and incomplete, and objects are occluded to different extents, which will deteriorate the detection accuracy. The existing methods overlook occlusion or tackle occlusion implicitly. In this article, we emphasize the universality of occlusion in point clouds and propose a novel occlusion-attention strategy, which aims to increase model's sensitivity to occlusion and maintain great performance in occlude scenes. The proposed method simulates different types and levels of occlusion and explores the relationship between the uncertainty caused by occlusion and the prediction distribution. The major changes include the following: 1) data augmentation specifically for occlusion scenes to force feature extractor into learning efficient features regardless of damage and 2) uncertainty estimation module to model prediction as a distribution instead of the deterministic label. We incorporate the proposed methods into various classical 3-D base detectors and demonstrate performance gain in the KITTI dataset, which proves the particularity of occlusion structure and the necessity of uncertainty estimation.

引用

页码：16502 / 16509

页数：8

共 30 条

[1] TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers [J].

Bai, Xuyang ;

Hu, Zeyu ;

Zhu, Xinge ;

Huang, Qingqiu ;

Chen, Yilun ;

Fu, Hangbo ;

Tai, Chiew-Lan .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1080-1089

[2] Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving [J].

Choi, Jiwoong ;

Chun, Dayoung ;

Kim, Hyun ;

Lee, Hyuk-Jae .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :502-511

[3]

Dong ZC, 2024, Arxiv, DOI arXiv:2310.07591

[4] AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [J].

Du, Liang ;

Ye, Xiaoqing ;

Tan, Xiao ;

Johns, Edward ;

Chen, Bo ;

Ding, Errui ;

Xue, Xiangyang ;

Feng, Jianfeng .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :8097-8109

[5] InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting [J].

Fang, Hao-Shu ;

Sun, Jianhua ;

Wang, Runzhong ;

Gou, Minghao ;

Li, Yong-Lu ;

Lu, Cewu .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :682-691

[6] Learning to See the Invisible: End-to-End Trainable Amodal Instance Segmentation [J].

Follmann, Patrick ;

Koenig, Rebecca ;

Haertinger, Philipp ;

Klostermann, Michael ;

Boettger, Tobias .

2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1328-1336

[7]

Harakeh A, 2020, IEEE INT CONF ROBOT, P87, DOI [10.1109/ICRA40945.2020.9196544, 10.1109/icra40945.2020.9196544]

[8] Bounding Box Regression with Uncertainty for Accurate Object Detection [J].

He, Yihui ;

Zhu, Chenchen ;

Wang, Jianren ;

Savvides, Marios ;

Zhang, Xiangyu .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2883-2892

[9]

Hu PY, 2020, PROC CVPR IEEE, P10998, DOI 10.1109/CVPR42600.2020.01101

[10]

Kendall Alex, 2017, Advances in Neural Information Processing Systems, V30

← 1 2 3 →