CAMRL: A Joint Method of Channel Attention and Multidimensional Regression Loss for 3D Object Detection in Automated Vehicles

被引：36

作者：

Gao, Honghao ^{[1
,2
]}

Fang, Danqing ^{[1
]}

Xiao, Junsheng ^{[1
]}

Hussain, Walayat ^{[3
]}

Kim, Jung Yoon ^{[2
]}

机构：

[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China

[2] Gachon Univ, Coll Future Ind, Seongnam 13120, South Korea

[3] Victoria Univ, Victoria Univ Business Sch, Melbourne, Vic 3011, Australia

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 08期

关键词：

Three-dimensional displays; Object detection; Cameras; Estimation; Roads; Solid modeling; Detectors; Automated vehicle; 3D object detection; depth estimation; multidimensional regression loss; channel attention; FOCAL LOSS; VISION;

D O I：

10.1109/TITS.2022.3219474

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Fully automated vehicles collect information about their road environments to adjust their driving actions, such as braking and slowing down. The development of artificial intelligence (AI) and the Internet of Things (IoT) has improved the cognitive abilities of vehicles, allowing them to detect traffic signs, pedestrians, and obstacles for increasing the intelligence of these transportation systems. Three-dimensional (3D) object detection in front-view images taken by vehicle cameras is important for both object detection and depth estimation. In this paper, a joint channel attention and multidimensional regression loss method for 3D object detection in automated vehicles (called CAMRL) is proposed to improve the average precision of 3D object detection by focusing on the model's ability to infer the locations and sizes of objects. First, channel attention is introduced to effectively learn the yaw angles from the road images captured by vehicle cameras. Second, a multidimensional regression loss algorithm is designed to further optimize the size and position parameters during the training process. Third, the intrinsic parameters of the camera and the depth estimate of the model are combined to reduce the object depth computation error, allowing us to calculate the distance between an object and the camera after the object's size is confirmed. As a result, objects are detected, and their depth estimations are validated. Then, the vehicle can determine when and how to stop if an object is nearby. Finally, experiments conducted on the KITTI dataset demonstrate that our method is effective and performs better than other baseline methods, especially in terms of 3D object detection and bird's-eye view (BEV) evaluation.

引用

页码：8831 / 8845

页数：15

共 50 条

[41] PIFNet: 3D Object Detection Using Joint Image and Point Cloud Features for Autonomous Driving
Zheng, Wenqi
Xie, Han
Chen, Yunfan
Roh, Jeongjin
Shin, Hyunchul
APPLIED SCIENCES-BASEL, 2022, 12 (07):
[42] Range-Aware Attention Network for LiDAR-Based 3D Object Detection With Auxiliary Point Density Level Estimation
Lu, Yantao
Hao, Xuetao
Li, Yilan
Chai, Weiheng
Sun, Shiqi
Velipasalar, Senem
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (01) : 292 - 305
[43] YOLOv7-3D: A Monocular 3D Traffic Object Detection Method from a Roadside Perspective
Ye, Zixun
Zhang, Hongying
Gu, Jingliang
Li, Xue
APPLIED SCIENCES-BASEL, 2023, 13 (20):
[44] Repair Method of Data Loss in Weld Surface Defect Detection Based on Light Intensity and 3D Geometry
Jiang, Shiyi
Li, Xuexing
Xing, Yanfeng
IEEE ACCESS, 2020, 8 : 205814 - 205820
[45] Automated, Depth Sensor Based Object Detection and Path Planning for Robot-Aided 3D Scanning
Ziegler, Jakob
Gattringer, Hubert
Kaserer, Dominik
Mueller, Andreas
ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, 2018, 49 : 336 - 343
[46] 3D Object Detection Method for Autonomous Vehicle Based on Sparse Color Point Cloud
Luo Y.
Qin H.
Qiche Gongcheng/Automotive Engineering, 2021, 43 (04): : 492 - 500
[47] Incorporating Spatio-Temporal Information in Frustum-ConvNet for Improved 3D Object Detection in Instrumented Vehicles
Venkatesh, G. M.
O'Connor, Noel E.
Little, Suzanne
2022 10TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP), 2022,
[48] Polar-view based object detection algorithm using 3D low-channel lidar
Kwon S.-S.
Park T.-H.
Journal of Institute of Control, Robotics and Systems, 2019, 25 (01) : 56 - 62
[49] 3D Space Object and Road Detection for Autonomous Vehicles Using Monocular Camera Images and Deep Learning Algorithms
Kim, Gi-Woong
Kang, Jae-Young
Journal of Institute of Control, Robotics and Systems, 2024, 30 (07) : 677 - 684
[50] Attention-Guided Feature Extraction and Multiscale Feature Fusion 3D ResNet for Automated Pulmonary Nodule Detection
Zhang, Guanglu
Zhang, Hongjun
Yao, Yuhua
Shen, Qiuhui
IEEE ACCESS, 2022, 10 : 61530 - 61543

← 1 2 3 4 5 →