CAMRL: A Joint Method of Channel Attention and Multidimensional Regression Loss for 3D Object Detection in Automated Vehicles

被引:36
作者
Gao, Honghao [1 ,2 ]
Fang, Danqing [1 ]
Xiao, Junsheng [1 ]
Hussain, Walayat [3 ]
Kim, Jung Yoon [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Gachon Univ, Coll Future Ind, Seongnam 13120, South Korea
[3] Victoria Univ, Victoria Univ Business Sch, Melbourne, Vic 3011, Australia
关键词
Three-dimensional displays; Object detection; Cameras; Estimation; Roads; Solid modeling; Detectors; Automated vehicle; 3D object detection; depth estimation; multidimensional regression loss; channel attention; FOCAL LOSS; VISION;
D O I
10.1109/TITS.2022.3219474
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Fully automated vehicles collect information about their road environments to adjust their driving actions, such as braking and slowing down. The development of artificial intelligence (AI) and the Internet of Things (IoT) has improved the cognitive abilities of vehicles, allowing them to detect traffic signs, pedestrians, and obstacles for increasing the intelligence of these transportation systems. Three-dimensional (3D) object detection in front-view images taken by vehicle cameras is important for both object detection and depth estimation. In this paper, a joint channel attention and multidimensional regression loss method for 3D object detection in automated vehicles (called CAMRL) is proposed to improve the average precision of 3D object detection by focusing on the model's ability to infer the locations and sizes of objects. First, channel attention is introduced to effectively learn the yaw angles from the road images captured by vehicle cameras. Second, a multidimensional regression loss algorithm is designed to further optimize the size and position parameters during the training process. Third, the intrinsic parameters of the camera and the depth estimate of the model are combined to reduce the object depth computation error, allowing us to calculate the distance between an object and the camera after the object's size is confirmed. As a result, objects are detected, and their depth estimations are validated. Then, the vehicle can determine when and how to stop if an object is nearby. Finally, experiments conducted on the KITTI dataset demonstrate that our method is effective and performs better than other baseline methods, especially in terms of 3D object detection and bird's-eye view (BEV) evaluation.
引用
收藏
页码:8831 / 8845
页数:15
相关论文
共 50 条
  • [1] MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection
    Qiao, Junchao
    Liu, Biao
    Yang, Jiaqi
    Wang, Baohua
    Xiu, Sanmu
    Du, Xin
    Nie, Xiaobo
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7326 - 7332
  • [2] Multimodal 3D Object Detection Based on Sparse Interaction in Internet of Vehicles
    Li, Hui
    Ge, Tongao
    Bai, Keqiang
    Nie, Gaofeng
    Xu, Lingwei
    Ai, Xiaoxue
    Cao, Song
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 2174 - 2186
  • [3] Focal Loss in 3D Object Detection
    Yun, Peng
    Tai, Lei
    Wang, Yuan
    Liu, Chengju
    Liu, Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) : 1263 - 1270
  • [4] PSNS-SSD: Pixel-Level Suppressed Nonsalient Semantic and Multicoupled Channel Enhancement Attention for 3D Object Detection
    Song, Xiaogang
    Zhou, Zhenhua
    Zhang, Lei
    Lu, Xiaofeng
    Hei, Xinhong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 603 - 610
  • [5] Collaborative 3D Object Detection for Autonomous Vehicles via Learnable Communications
    Wang, J.
    Zeng, Y.
    Gong, Y.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (09) : 9804 - 9816
  • [6] A Multi-Modal Fusion-Based 3D Multi-Object Tracking Framework With Joint Detection
    Wang, Xiyang
    Fu, Chunyun
    He, Jiawei
    Huang, Mingguang
    Meng, Ting
    Zhang, Siyu
    Zhou, Hangning
    Xu, Ziyao
    Zhang, Chi
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 532 - 539
  • [7] 3D Object Detection with LiDAR Based on Multi-Attention Mechanism
    Cao, Jie
    Peng, Yiqiang
    Fan, Likang
    Mo, Lingfan
    Wang, Longfei
    LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
  • [8] Multimodal Cooperative 3D Object Detection Over Connected Vehicles for Autonomous Driving
    Chi, Fangyuan
    Wang, Yixiao
    Pourazad, Mahsa T.
    Nasiopoulos, Panos
    Leung, Victor C. M.
    IEEE NETWORK, 2023, 37 (04): : 265 - 272
  • [9] 3D Object Detection for Self-Driving Vehicles Enhanced by Object Velocity
    Alexandrino, Leandro
    Olyaei, Hadi Z.
    Albuquerque, Andre
    Georgieva, Petia
    Drummond, Miguel V.
    IEEE ACCESS, 2024, 12 : 8220 - 8229
  • [10] Scalable 3D Object Detection Pipeline With Center-Based Sequential Feature Aggregation for Intelligent Vehicles
    Jiang, Qi
    Hu, Chuan
    Zhao, Baixuan
    Huang, Yonghui
    Zhang, Xi
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1512 - 1523