Object Detection and Monocular Stable Distance Estimation for Road Environments: A Fusion Architecture Using YOLO-RedeCa and Abnormal Jumping Change Filter

被引:3
作者
Lv, Hejun [1 ]
Du, Yu [1 ]
Ma, Yan [1 ]
Yuan, Ying [1 ]
机构
[1] Beijing Union Univ, Coll Robot, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China
关键词
automatic driving technique; object detection; monocular distance measurement; Kalman filter;
D O I
10.3390/electronics13153058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enabling rapid and accurate comprehensive environmental perception for vehicles poses a major challenge. Object detection and monocular distance estimation are the two main technologies, though they are often used separately. Thus, it is necessary to strengthen and optimize the interaction between them. Vehicle motion or object occlusions can cause sudden variations in the positions or sizes of detection boxes within temporal data, leading to fluctuations in distance estimates. So, we propose a method to integrate a detector based on YOLOv5-RedeCa, a Bot-Sort tracker and an anomaly jumping change filter. This combination allows for more accurate detection and tracking of objects. The anomaly jump filter smooths distance variations caused by sudden changes in detection box sizes. Our method increases accuracy while reducing computational demands, showing outstanding performance on several datasets. Notably, on the KITTI dataset, the standard deviation of the continuous ranging results remains consistently low, especially in scenarios with multiple object occlusions or disappearances. These results validate our method's effectiveness and precision in managing dual tasks.
引用
收藏
页数:20
相关论文
共 51 条
  • [1] Aharon N, 2022, Arxiv, DOI [arXiv:2206.14651, DOI 10.48550/ARXIV.2206.14651]
  • [2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934]
  • [3] Dai JF, 2016, ADV NEUR IN, V29
  • [4] Object detection using YOLO: challenges, architectural successors, datasets and applications
    Diwan, Tausif
    Anirudh, G.
    Tembhurne, Jitendra, V
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) : 9243 - 9275
  • [5] Eigen D, 2014, ADV NEUR IN, V27
  • [6] Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture
    Eigen, David
    Fergus, Rob
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2650 - 2658
  • [7] The PASCAL Visual Object Classes Challenge: A Retrospective
    Everingham, Mark
    Eslami, S. M. Ali
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
  • [8] A geometric approach to shape from defocus
    Favaro, P
    Soatto, S
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (03) : 406 - 417
  • [9] Fu C.-Y., 2017, arXiv
  • [10] TMSO-Net: Texture adaptive multi-scale observation for light field image depth estimation
    Fu, Congrui
    Yuan, Hui
    Xu, Hongji
    Zhang, Hao
    Shen, Liquan
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90