Object Detection and Monocular Stable Distance Estimation for Road Environments: A Fusion Architecture Using YOLO-RedeCa and Abnormal Jumping Change Filter

被引：3

作者：

Lv, Hejun ^{[1
]}

Du, Yu ^{[1
]}

Ma, Yan ^{[1
]}

Yuan, Ying ^{[1
]}

机构：

[1] Beijing Union Univ, Coll Robot, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China

来源：

ELECTRONICS | 2024年 / 13卷 / 15期

关键词：

automatic driving technique; object detection; monocular distance measurement; Kalman filter;

D O I：

10.3390/electronics13153058

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Enabling rapid and accurate comprehensive environmental perception for vehicles poses a major challenge. Object detection and monocular distance estimation are the two main technologies, though they are often used separately. Thus, it is necessary to strengthen and optimize the interaction between them. Vehicle motion or object occlusions can cause sudden variations in the positions or sizes of detection boxes within temporal data, leading to fluctuations in distance estimates. So, we propose a method to integrate a detector based on YOLOv5-RedeCa, a Bot-Sort tracker and an anomaly jumping change filter. This combination allows for more accurate detection and tracking of objects. The anomaly jump filter smooths distance variations caused by sudden changes in detection box sizes. Our method increases accuracy while reducing computational demands, showing outstanding performance on several datasets. Notably, on the KITTI dataset, the standard deviation of the continuous ranging results remains consistently low, especially in scenarios with multiple object occlusions or disappearances. These results validate our method's effectiveness and precision in managing dual tasks.

引用

页数：20

共 51 条

[21] DepthTransfer: Depth Extraction from Video Using Non-Parametric Sampling
Karsch, Kevin
Liu, Ce
Kang, Sing Bing
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (11) : 2144 - 2158
[22] ImageNet Classification with Deep Convolutional Neural Networks
Krizhevsky, Alex
Sutskever, Ilya
Hinton, Geoffrey E.
[J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
[23] Semi-Supervised Deep Learning for Monocular Depth Map Prediction
Kuznietsov, Yevhen
Stuckle, Jorg
Leibe, Bastian
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2215 - 2223
[24] Li CY, 2022, Arxiv, DOI [arXiv:2209.02976, DOI 10.48550/ARXIV.2209.02976]
[25] Fast-BEV: A Fast and Strong Birds-Eye View Perception Baseline
Li, Yangguang
Huang, Bin
Chen, Zeren
Cui, Yufeng
Liang, Feng
Shen, Mingzhu
Liu, Fenggang
Xie, Enze
Sheng, Lu
Ouyang, Wanli
Shao, Jing
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8665 - 8679
[26] DepthFormer: Exploiting Long-range Correlation and Local Information for Accurate Monocular Depth Estimation
Li, Zhenyu
Chen, Zehui
Liu, Xianming
Jiang, Junjun
[J]. MACHINE INTELLIGENCE RESEARCH, 2023, 20 (06) : 837 - 854
[27] Li ZX, 2024, Arxiv, DOI arXiv:1712.00960
[28] Microsoft COCO: Common Objects in Context
Lin, Tsung-Yi
Maire, Michael
Belongie, Serge
Hays, James
Perona, Pietro
Ramanan, Deva
Dollar, Piotr
Zitnick, C. Lawrence
[J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
[29] Single Image Depth Estimation From Predicted Semantic Labels
Liu, Beyang
Gould, Stephen
Koller, Daphne
[J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1253 - 1260
[30] DPDFormer: A Coarse-to-Fine Model for Monocular Depth Estimation
Liu, Chunpu
Yang, Guanglei
Zuo, Wangmeng
Zang, Tianyi
[J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (05)

← 1 2 3 4 5 6 →