DO-SA&R: Distant Object Augmented Set Abstraction and Regression for Point-Based 3D Object Detection

被引:1
|
作者
He, Xuan [1 ]
Wang, Zian [1 ]
Lin, Jiacheng [1 ]
Nai, Ke [2 ]
Yuan, Jin [1 ]
Li, Zhiyong [1 ,3 ,4 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China
[3] Hunan Univ, Natl Engn Res Ctr Robot Visual Percept & Control T, Changsha 410082, Peoples R China
[4] Hunan Univ, Sch Robot, Changsha 410012, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Feature extraction; Point cloud compression; Object detection; Training; Detectors; Autonomous vehicles; Point-based 3D object detection; scene understanding; autonomous driving;
D O I
10.1109/TIP.2023.3326394
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Point-based 3D detection approaches usually suffer from the severe point sampling imbalance problem between foreground and background. We observe that prior works have attempted to alleviate this imbalance by emphasizing foreground sampling. However, even adequate foreground sampling may be extremely unbalanced between nearby and distant objects, yielding unsatisfactory performance in detecting distant objects. To tackle this issue, this paper first proposes a novel method named Distant Object Augmented Set Abstraction and Regression (DO-SA&R) to enhance distant object detection, which is vital for the timely response of decision-making systems like autonomous driving. Technically, our approach first designs DO-SA with novel distant object augmented farthest point sampling (DO-FPS) to emphasize sampling on distant objects by leveraging both object-dependent and depth-dependent information. Then, we propose distant object augmented regression to reweight all the instance boxes for strengthening regression training on distant objects. In practice, the proposed DO-SA&R can be easily embedded into the existing modules, yielding consistent performance improvements, especially on detecting distant objects. Extensive experiments are conducted on the popular KITTI, nuScenes and Waymo datasets, and DO-SA&R demonstrates superior performance, especially for distant object detection. Our code is available at https://github.com/mikasa3lili/DO-SAR.
引用
收藏
页码:5852 / 5864
页数:13
相关论文
共 50 条
  • [21] Multi-Modal Fusion Based on Depth Adaptive Mechanism for 3D Object Detection
    Liu, Zhanwen
    Cheng, Juanru
    Fan, Jin
    Lin, Shan
    Wang, Yang
    Zhao, Xiangmo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 707 - 717
  • [22] Transformer-Based Optimized Multimodal Fusion for 3D Object Detection in Autonomous Driving
    Alaba, Simegnew Yihunie
    Ball, John E.
    IEEE ACCESS, 2024, 12 : 50165 - 50176
  • [23] RI-Fusion: 3D Object Detection Using Enhanced Point Features With Range-Image Fusion for Autonomous Driving
    Zhang, Xinyu
    Wang, Li
    Zhang, Guoxin
    Lan, Tianwei
    Zhang, Haoming
    Zhao, Lijun
    Li, Jun
    Zhu, Lei
    Liu, Huaping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [24] Multimodal 3D Object Detection Based on Sparse Interaction in Internet of Vehicles
    Li, Hui
    Ge, Tongao
    Bai, Keqiang
    Nie, Gaofeng
    Xu, Lingwei
    Ai, Xiaoxue
    Cao, Song
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 2174 - 2186
  • [25] VRVP: Valuable Region and Valuable Point Anchor-Free 3D Object Detection
    Deng, Pengzhen
    Zhou, Li
    Chen, Jie
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 33 - 40
  • [26] PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
    Shaoshuai Shi
    Li Jiang
    Jiajun Deng
    Zhe Wang
    Chaoxu Guo
    Jianping Shi
    Xiaogang Wang
    Hongsheng Li
    International Journal of Computer Vision, 2023, 131 : 531 - 551
  • [27] Local-to-Global Semantic Learning for Multi-View 3D Object Detection From Point Cloud
    Qiao, Renzhong
    Ji, Hongbing
    Zhu, Zhigang
    Zhang, Wenbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9371 - 9385
  • [28] CL3D: Camera-LiDAR 3D Object Detection With Point Feature Enhancement and Point-Guided Fusion
    Lin, Chunmian
    Tian, Daxin
    Duan, Xuting
    Zhou, Jianshan
    Zhao, Dezong
    Cao, Dongpu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18040 - 18050
  • [29] Enhancing Grid-Based 3D Object Detection in Autonomous Driving With Improved Dimensionality Reduction
    Huang, Dihe
    Chen, Ying
    Ding, Yikang
    Liu, Yong
    Nie, Qiang
    Wang, Chengjie
    Li, Zhiheng
    IEEE ACCESS, 2023, 11 : 35243 - 35254
  • [30] AGO-Net: Association-Guided 3D Point Cloud Object Detection Network
    Du, Liang
    Ye, Xiaoqing
    Tan, Xiao
    Johns, Edward
    Chen, Bo
    Ding, Errui
    Xue, Xiangyang
    Feng, Jianfeng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8097 - 8109