CAMRL: A Joint Method of Channel Attention and Multidimensional Regression Loss for 3D Object Detection in Automated Vehicles

被引：36

作者：

Gao, Honghao ^{[1
,2
]}

Fang, Danqing ^{[1
]}

Xiao, Junsheng ^{[1
]}

Hussain, Walayat ^{[3
]}

Kim, Jung Yoon ^{[2
]}

机构：

[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China

[2] Gachon Univ, Coll Future Ind, Seongnam 13120, South Korea

[3] Victoria Univ, Victoria Univ Business Sch, Melbourne, Vic 3011, Australia

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 08期

关键词：

Three-dimensional displays; Object detection; Cameras; Estimation; Roads; Solid modeling; Detectors; Automated vehicle; 3D object detection; depth estimation; multidimensional regression loss; channel attention; FOCAL LOSS; VISION;

D O I：

10.1109/TITS.2022.3219474

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Fully automated vehicles collect information about their road environments to adjust their driving actions, such as braking and slowing down. The development of artificial intelligence (AI) and the Internet of Things (IoT) has improved the cognitive abilities of vehicles, allowing them to detect traffic signs, pedestrians, and obstacles for increasing the intelligence of these transportation systems. Three-dimensional (3D) object detection in front-view images taken by vehicle cameras is important for both object detection and depth estimation. In this paper, a joint channel attention and multidimensional regression loss method for 3D object detection in automated vehicles (called CAMRL) is proposed to improve the average precision of 3D object detection by focusing on the model's ability to infer the locations and sizes of objects. First, channel attention is introduced to effectively learn the yaw angles from the road images captured by vehicle cameras. Second, a multidimensional regression loss algorithm is designed to further optimize the size and position parameters during the training process. Third, the intrinsic parameters of the camera and the depth estimate of the model are combined to reduce the object depth computation error, allowing us to calculate the distance between an object and the camera after the object's size is confirmed. As a result, objects are detected, and their depth estimations are validated. Then, the vehicle can determine when and how to stop if an object is nearby. Finally, experiments conducted on the KITTI dataset demonstrate that our method is effective and performs better than other baseline methods, especially in terms of 3D object detection and bird's-eye view (BEV) evaluation.

引用

页码：8831 / 8845

页数：15

共 50 条

[21] ClusterFusion: Leveraging Radar Spatial Features for Radar-Camera 3D Object Detection in Autonomous Vehicles
Kurniawan, Irfan Tito
Trilaksono, Bambang Riyanto
IEEE ACCESS, 2023, 11 : 121511 - 121528
[22] Monocular 3D object detection with thermodynamic loss and decoupled instance depth
Liu, Gang
Xie, Xiaoxiao
Yu, Qingchen
CONNECTION SCIENCE, 2024, 36 (01)
[23] Intracranial aneurysm detection based on 3D point cloud object detection method
Li, Jun
Liu, Juntong
Wang, Jiaqi
Wang, Peipei
Ye, Mingquan
COGENT ENGINEERING, 2024, 11 (01):
[24] Enhancing 3D object detection in autonomous vehicles based on synthetic virtual environment analysis
Li, Vladislav
Siniosoglou, Ilias
Karamitsou, Thomai
Lytos, Anastasios
Moscholios, Ioannis D.
Goudos, Sotirios K.
Banerjee, Jyoti S.
Sarigiannidis, Panagiotis
Argyriou, Vasileios
IMAGE AND VISION COMPUTING, 2025, 154
[25] CFPC: The Curbed Fake Point Collector to Pseudo-LiDAR-Based 3D Object Detection for Autonomous Vehicles
Gao, Honghao
Shao, Jie
Iqbal, Muddesar
Wang, Ye
Xiang, Zhengzhe
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 1922 - 1934
[26] Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection for Autonomous Driving
Yuan, Zhenxun
Song, Xiao
Bai, Lei
Wang, Zhe
Ouyang, Wanli
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2068 - 2078
[27] TEMPORAL AXIAL ATTENTION FOR LIDAR-BASED 3D OBJECT DETECTION IN AUTONOMOUS DRIVING
Carranza-Garcia, Manuel
Riquelme, Jose C.
Zakhor, Avideh
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 201 - 205
[28] POAT-Net: Parallel Offset-Attention Assisted Transformer for 3D Object Detection for Autonomous Driving
Wang, Jinyang
Lin, Xiao
Yu, Hongying
IEEE ACCESS, 2021, 9 : 151110 - 151117
[29] Action Detection Based on 3D Convolution Neural Network with Channel Attention Mechanism
Gao, Yan
Liang, Huilai
Liu, Baodi
Wang, Yanjiang
2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 602 - 606
[30] MVDCANet: An End-to-End Self-Attention-Based Multiview-Dualchannel 3D Object Detection
Gao, Xinwen
Hu, Daojun
IEEE SENSORS JOURNAL, 2021, 21 (24) : 27789 - 27800

← 1 2 3 4 5 →