Driver Attention Detection Based on Improved YOLOv5

被引：6

作者：

Wang, Zhongzhou ^{[1
]}

Yao, Keming ^{[1
]}

Guo, Fuao ^{[1
]}

机构：

[1] Jiangsu Univ Technol, Coll Elect Informat Engn, Changzhou 213000, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 11期

关键词：

deep learning; YOLOv5; attention detection; distracted behavior detection; multi-scale feature extraction; Swin Transformer;

D O I：

10.3390/app13116645

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

In response to negative impacts such as personal and property safety hazards caused by drivers being distracted while driving on the road, this article proposes a driver's attention state-detection method based on the improved You Only Look Once version five (YOLOv5). Both fatigue and distracted behavior can cause a driver's attention to be diverted during the driving process. Firstly, key facial points of the driver are located, and the aspect ratio of the eyes and mouth is calculated. Through the examination of relevant information and repeated experimental verification, threshold values for the aspect ratio of the eyes and mouth under fatigue conditions, corresponding to closed eyes and yawning, are established. By calculating the aspect ratio of the driver's eyes and mouth, it is possible to accurately detect whether the driver is in a state of fatigue. Secondly, distracted abnormal behavior is detected using an improved YOLOv5 model. The backbone network feature extraction element is modified by adding specific modules to obtain different receptive fields through multiple convolution operations on the input feature map, thereby enhancing the feature extraction ability of the network. The introduction of Swin Transformer modules in the feature fusion network replaces the Bottleneck modules in the C3 module, reducing the computational complexity of the model while increasing its receptive field. Additionally, the network connection in the feature fusion element has been modified to enhance its ability to fuse information from feature maps of different sizes. Three datasets were created of distracting behaviors commonly observed during driving: smoking, drinking water, and using a mobile phone. These datasets were used to train and test the model. After testing, the mAP (mean average precision) has improved by 2.4% compared to the model before improvement. Finally, through comparison and ablation experiments, the feasibility of this method has been verified, which can effectively detect fatigue and distracted abnormal behavior.

引用

页数：14

共 27 条

[1] Drowsiness Detection System Based on PERCLOS and Facial Physiological Signal [J].

Chang, Robert Chen-Hao ;

Wang, Chia-Yu ;

Chen, Wei-Ting ;

Chiu, Cheng-Di .

SENSORS, 2022, 22 (14)

[2] Fatigue Driving Detection Method Based on Combination of BP Neural Network and Time Cumulative Effect [J].

Chen, Jian ;

Yan, Ming ;

Zhu, Feng ;

Xu, Jing ;

Li, Hai ;

Sun, Xiaoguang .

SENSORS, 2022, 22 (13)

[3] Multi-modal Biometrics Based Implicit Driver Identification System Using Multi-TF Images of ECG and EMG [J].

Choi, Gyuho ;

Ziyang, Gong ;

Wu, Jingyi ;

Esposito, Christian ;

Choi, Chang .

COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 159

[4] Eye Aspect Ratio for Real-Time Drowsiness Detection to Improve Driver Safety [J].

Dewi, Christine ;

Chen, Rung-Ching ;

Chang, Chun-Wei ;

Wu, Shih-Hung ;

Jiang, Xiaoyi ;

Yu, Hui .

ELECTRONICS, 2022, 11 (19)

[5] A Multimodal Fusion Fatigue Driving Detection Method Based on Heart Rate and PERCLOS [J].

Du, Guanglong ;

Zhang, Linlin ;

Su, Kang ;

Wang, Xueqian ;

Teng, Shaohua ;

Liu, Peter X. .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) :21810-21820

[6] Driver Distraction Detection Based on the True Driver's Focus of Attention [J].

Huang, Tao ;

Fu, Rui .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) :19374-19386

[7] Real-time fatigue driving detection system based on multi-module fusion [J].

Jia, Huijie ;

Xiao, Zhongjun ;

Ji, Peng .

COMPUTERS & GRAPHICS-UK, 2022, 108 :22-33

[8] Feature Alignment FPN for Oriented Object Detection in Remote Sensing Images [J].

Li, Zhiqing ;

Li, Erzhu ;

Xu, Tianyu ;

Samat, Alim ;

Liu, Wei .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20

[9] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].

Liu, Ze ;

Lin, Yutong ;

Cao, Yue ;

Hu, Han ;

Wei, Yixuan ;

Zhang, Zheng ;

Lin, Stephen ;

Guo, Baining .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002

[10] Automatic recognition of pavement cracks from combined GPR B-scan and C-scan images using multiscale feature fusion deep neural networks [J].

Liu, Zhen ;

Gu, Xingyu ;

Chen, Jiaqi ;

Wang, Danyu ;

Chen, Yihan ;

Wang, Lutai .

AUTOMATION IN CONSTRUCTION, 2023, 146

← 1 2 3 →