DO-SA&R: Distant Object Augmented Set Abstraction and Regression for Point-Based 3D Object Detection

被引：1

作者：

He, Xuan ^{[1
]}

Wang, Zian ^{[1
]}

Lin, Jiacheng ^{[1
]}

Nai, Ke ^{[2
]}

Yuan, Jin ^{[1
]}

Li, Zhiyong ^{[1
,3
,4
]}

机构：

[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China

[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China

[3] Hunan Univ, Natl Engn Res Ctr Robot Visual Percept & Control T, Changsha 410082, Peoples R China

[4] Hunan Univ, Sch Robot, Changsha 410012, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Feature extraction; Point cloud compression; Object detection; Training; Detectors; Autonomous vehicles; Point-based 3D object detection; scene understanding; autonomous driving;

D O I：

10.1109/TIP.2023.3326394

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Point-based 3D detection approaches usually suffer from the severe point sampling imbalance problem between foreground and background. We observe that prior works have attempted to alleviate this imbalance by emphasizing foreground sampling. However, even adequate foreground sampling may be extremely unbalanced between nearby and distant objects, yielding unsatisfactory performance in detecting distant objects. To tackle this issue, this paper first proposes a novel method named Distant Object Augmented Set Abstraction and Regression (DO-SA&R) to enhance distant object detection, which is vital for the timely response of decision-making systems like autonomous driving. Technically, our approach first designs DO-SA with novel distant object augmented farthest point sampling (DO-FPS) to emphasize sampling on distant objects by leveraging both object-dependent and depth-dependent information. Then, we propose distant object augmented regression to reweight all the instance boxes for strengthening regression training on distant objects. In practice, the proposed DO-SA&R can be easily embedded into the existing modules, yielding consistent performance improvements, especially on detecting distant objects. Extensive experiments are conducted on the popular KITTI, nuScenes and Waymo datasets, and DO-SA&R demonstrates superior performance, especially for distant object detection. Our code is available at https://github.com/mikasa3lili/DO-SAR.

引用

页码：5852 / 5864

页数：13

共 50 条

[21] Multi-Modal Fusion Based on Depth Adaptive Mechanism for 3D Object Detection
Liu, Zhanwen
Cheng, Juanru
Fan, Jin
Lin, Shan
Wang, Yang
Zhao, Xiangmo
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 707 - 717
[22] Transformer-Based Optimized Multimodal Fusion for 3D Object Detection in Autonomous Driving
Alaba, Simegnew Yihunie
Ball, John E.
IEEE ACCESS, 2024, 12 : 50165 - 50176
[23] RI-Fusion: 3D Object Detection Using Enhanced Point Features With Range-Image Fusion for Autonomous Driving
Zhang, Xinyu
Wang, Li
Zhang, Guoxin
Lan, Tianwei
Zhang, Haoming
Zhao, Lijun
Li, Jun
Zhu, Lei
Liu, Huaping
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[24] Multimodal 3D Object Detection Based on Sparse Interaction in Internet of Vehicles
Li, Hui
Ge, Tongao
Bai, Keqiang
Nie, Gaofeng
Xu, Lingwei
Ai, Xiaoxue
Cao, Song
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 2174 - 2186
[25] VRVP: Valuable Region and Valuable Point Anchor-Free 3D Object Detection
Deng, Pengzhen
Zhou, Li
Chen, Jie
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 33 - 40
[26] PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
Shaoshuai Shi
Li Jiang
Jiajun Deng
Zhe Wang
Chaoxu Guo
Jianping Shi
Xiaogang Wang
Hongsheng Li
International Journal of Computer Vision, 2023, 131 : 531 - 551
[27] Local-to-Global Semantic Learning for Multi-View 3D Object Detection From Point Cloud
Qiao, Renzhong
Ji, Hongbing
Zhu, Zhigang
Zhang, Wenbo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9371 - 9385
[28] CL3D: Camera-LiDAR 3D Object Detection With Point Feature Enhancement and Point-Guided Fusion
Lin, Chunmian
Tian, Daxin
Duan, Xuting
Zhou, Jianshan
Zhao, Dezong
Cao, Dongpu
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18040 - 18050
[29] Enhancing Grid-Based 3D Object Detection in Autonomous Driving With Improved Dimensionality Reduction
Huang, Dihe
Chen, Ying
Ding, Yikang
Liu, Yong
Nie, Qiang
Wang, Chengjie
Li, Zhiheng
IEEE ACCESS, 2023, 11 : 35243 - 35254
[30] AGO-Net: Association-Guided 3D Point Cloud Object Detection Network
Du, Liang
Ye, Xiaoqing
Tan, Xiao
Johns, Edward
Chen, Bo
Ding, Errui
Xue, Xiangyang
Feng, Jianfeng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8097 - 8109

← 1 2 3 4 5 →