Object Detection and Monocular Stable Distance Estimation for Road Environments: A Fusion Architecture Using YOLO-RedeCa and Abnormal Jumping Change Filter

被引:3
作者
Lv, Hejun [1 ]
Du, Yu [1 ]
Ma, Yan [1 ]
Yuan, Ying [1 ]
机构
[1] Beijing Union Univ, Coll Robot, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China
关键词
automatic driving technique; object detection; monocular distance measurement; Kalman filter;
D O I
10.3390/electronics13153058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enabling rapid and accurate comprehensive environmental perception for vehicles poses a major challenge. Object detection and monocular distance estimation are the two main technologies, though they are often used separately. Thus, it is necessary to strengthen and optimize the interaction between them. Vehicle motion or object occlusions can cause sudden variations in the positions or sizes of detection boxes within temporal data, leading to fluctuations in distance estimates. So, we propose a method to integrate a detector based on YOLOv5-RedeCa, a Bot-Sort tracker and an anomaly jumping change filter. This combination allows for more accurate detection and tracking of objects. The anomaly jump filter smooths distance variations caused by sudden changes in detection box sizes. Our method increases accuracy while reducing computational demands, showing outstanding performance on several datasets. Notably, on the KITTI dataset, the standard deviation of the continuous ranging results remains consistently low, especially in scenarios with multiple object occlusions or disappearances. These results validate our method's effectiveness and precision in managing dual tasks.
引用
收藏
页数:20
相关论文
共 51 条
  • [21] DepthTransfer: Depth Extraction from Video Using Non-Parametric Sampling
    Karsch, Kevin
    Liu, Ce
    Kang, Sing Bing
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (11) : 2144 - 2158
  • [22] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [23] Semi-Supervised Deep Learning for Monocular Depth Map Prediction
    Kuznietsov, Yevhen
    Stuckle, Jorg
    Leibe, Bastian
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2215 - 2223
  • [24] Li CY, 2022, Arxiv, DOI [arXiv:2209.02976, DOI 10.48550/ARXIV.2209.02976]
  • [25] Fast-BEV: A Fast and Strong Birds-Eye View Perception Baseline
    Li, Yangguang
    Huang, Bin
    Chen, Zeren
    Cui, Yufeng
    Liang, Feng
    Shen, Mingzhu
    Liu, Fenggang
    Xie, Enze
    Sheng, Lu
    Ouyang, Wanli
    Shao, Jing
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8665 - 8679
  • [26] DepthFormer: Exploiting Long-range Correlation and Local Information for Accurate Monocular Depth Estimation
    Li, Zhenyu
    Chen, Zehui
    Liu, Xianming
    Jiang, Junjun
    [J]. MACHINE INTELLIGENCE RESEARCH, 2023, 20 (06) : 837 - 854
  • [27] Li ZX, 2024, Arxiv, DOI arXiv:1712.00960
  • [28] Microsoft COCO: Common Objects in Context
    Lin, Tsung-Yi
    Maire, Michael
    Belongie, Serge
    Hays, James
    Perona, Pietro
    Ramanan, Deva
    Dollar, Piotr
    Zitnick, C. Lawrence
    [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
  • [29] Single Image Depth Estimation From Predicted Semantic Labels
    Liu, Beyang
    Gould, Stephen
    Koller, Daphne
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1253 - 1260
  • [30] DPDFormer: A Coarse-to-Fine Model for Monocular Depth Estimation
    Liu, Chunpu
    Yang, Guanglei
    Zuo, Wangmeng
    Zang, Tianyi
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (05)