Object Detection and Monocular Stable Distance Estimation for Road Environments: A Fusion Architecture Using YOLO-RedeCa and Abnormal Jumping Change Filter

被引：3

作者：

Lv, Hejun ^{[1
]}

Du, Yu ^{[1
]}

Ma, Yan ^{[1
]}

Yuan, Ying ^{[1
]}

机构：

[1] Beijing Union Univ, Coll Robot, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China

来源：

ELECTRONICS | 2024年 / 13卷 / 15期

关键词：

automatic driving technique; object detection; monocular distance measurement; Kalman filter;

D O I：

10.3390/electronics13153058

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Enabling rapid and accurate comprehensive environmental perception for vehicles poses a major challenge. Object detection and monocular distance estimation are the two main technologies, though they are often used separately. Thus, it is necessary to strengthen and optimize the interaction between them. Vehicle motion or object occlusions can cause sudden variations in the positions or sizes of detection boxes within temporal data, leading to fluctuations in distance estimates. So, we propose a method to integrate a detector based on YOLOv5-RedeCa, a Bot-Sort tracker and an anomaly jumping change filter. This combination allows for more accurate detection and tracking of objects. The anomaly jump filter smooths distance variations caused by sudden changes in detection box sizes. Our method increases accuracy while reducing computational demands, showing outstanding performance on several datasets. Notably, on the KITTI dataset, the standard deviation of the continuous ranging results remains consistently low, especially in scenarios with multiple object occlusions or disappearances. These results validate our method's effectiveness and precision in managing dual tasks.

引用

页数：20

共 51 条

[1] Aharon N, 2022, Arxiv, DOI [arXiv:2206.14651, DOI 10.48550/ARXIV.2206.14651]
[2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934]
[3] Dai JF, 2016, ADV NEUR IN, V29
[4] Object detection using YOLO: challenges, architectural successors, datasets and applications
Diwan, Tausif
Anirudh, G.
Tembhurne, Jitendra, V
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) : 9243 - 9275
[5] Eigen D, 2014, ADV NEUR IN, V27
[6] Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture
Eigen, David
Fergus, Rob
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2650 - 2658
[7] The PASCAL Visual Object Classes Challenge: A Retrospective
Everingham, Mark
Eslami, S. M. Ali
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
[8] A geometric approach to shape from defocus
Favaro, P
Soatto, S
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (03) : 406 - 417
[9] Fu C.-Y., 2017, arXiv
[10] TMSO-Net: Texture adaptive multi-scale observation for light field image depth estimation
Fu, Congrui
Yuan, Hui
Xu, Hongji
Zhang, Hao
Shen, Liquan
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90

← 1 2 3 4 5 6 →