A Deep Learning-Based Hybrid Framework for Object Detection and Recognition in Autonomous Driving

被引:67
作者
Li, Yanfen [1 ]
Wang, Hanxiang [1 ]
Dang, L. Minh [1 ]
Nguyen, Tan N. [2 ]
Han, Dongil [1 ]
Lee, Ahyun [3 ]
Jang, Insung [3 ]
Moon, Hyeonjoon [1 ]
机构
[1] Sejong Univ, Dept Comp Sci & Engn, Seoul 05006, South Korea
[2] Sejong Univ, Dept Architectural Engn, Seoul 05006, South Korea
[3] Elect & Telecommun Res Inst ETRI, City & Geospatial ICT Res Sect, Daejeon 34129, South Korea
来源
IEEE ACCESS | 2020年 / 8卷
基金
新加坡国家研究基金会;
关键词
Object detection; Task analysis; Risk management; Autonomous vehicles; Feature extraction; Pose estimation; Computational modeling; Deep learning; intention recognition; object detection; risk assessment;
D O I
10.1109/ACCESS.2020.3033289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a key technology of intelligent transportation system, the intelligent vehicle is the carrier of comprehensive integration of many technologies. Although vision-based autonomous driving has shown excellent prospects, there is still a problem of how to analyze the complicated traffic situation by the collected data. Recently, autonomous driving has been formulated as many tasks separately by using different models, such as object detection task and intention recognition task. In this study, a vision-based system was developed to detect and identity various objects and predict the intention of pedestrians in the traffic scene. The main contributions of this research are (1) an optimized model was presented to detect 10 kinds of objects based on the structure of YOLOv4; (2) a fine-tuned Part Affinity Fields approach was proposed to estimate the pose of pedestrians; (3) Explainable Artificial Intelligence (XAI) technology is added to explain and assist the estimation results in the risk assessment phase; (4) an elaborate self-driving dataset that includes several different subsets for each corresponding task was introduced; and (5) an end-to-end system containing multiple models with high accuracy was developed. Experimental results proved that the total parameters of optimized YOLOv4 are reduced by 74% which satisfies the real-time capability. In addition, the detection precision of the optimized YOLOv4 achieved an improvement of 2.6%; compared to the state-of-the-art.
引用
收藏
页码:194228 / 194239
页数:12
相关论文
共 31 条
[1]   2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[2]  
[Anonymous], 2018, ARXIV180607421
[3]  
Bochkovskiy A., 2020, ARXIV200410934
[4]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[5]   Multi-task learning for dangerous object detection in autonomous driving [J].
Chen, Yaran ;
Zhao, Dongbin ;
Lv, Le ;
Zhang, Qichao .
INFORMATION SCIENCES, 2018, 432 :559-571
[6]   Sensor-based and vision-based human activity recognition: A comprehensive survey [J].
Dang, L. Minh ;
Min, Kyungbok ;
Wang, Hanxiang ;
Piran, Md. Jalil ;
Lee, Cheol Hee ;
Moon, Hyeonjoon .
PATTERN RECOGNITION, 2020, 108 (108)
[7]   Intention Recognition of Pedestrians and Cyclists by 2D Pose Estimation [J].
Fang, Zhijie ;
Lopez, Antonio M. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (11) :4773-4783
[8]   Using k-poselets for detecting people and localizing their keypoints [J].
Gkioxari, Georgia ;
Hariharan, Bharath ;
Girshick, Ross ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :CP32-CP32
[9]  
Han Y., 2020, P IEEE CVF C COMP VI, P998
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778