Two-stream video-based deep learning model for crashes and near-crashes

被引：1

作者：

Shi, Liang ^{[1
,2
]}

Guo, Feng ^{[1
]}

机构：

[1] Virginia Polytech Inst & State Univ, Dept Stat, Blacksburg, VA 24061 USA

[2] Virginia Polytech Inst & State Univ, Virginia Tech Transportat Inst, Blacksburg, VA 24061 USA

来源：

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES | 2024年 / 166卷

关键词：

Crash prediction; Front-view video driving data; Deep learning; TimeSFormer; Optical flow; XGBoost; Naturalistic driving study;

D O I：

10.1016/j.trc.2024.104794

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

The use of videos for effective crash and near-crash prediction can significantly enhance the development of safety countermeasures and emergency response. This paper presents a two- stream hybrid model with temporal and spatial streams for crash and near-crash identification based on front-view video driving data. The novel temporal stream integrates optical flow and TimeSFormer, utilizing divided-space-time attention. The spatial stream employs TimeSFormer with space attention to complement spatial information that is not captured by the optical flow. An XGBoost classifier merges the two streams through late fusion. The proposed approach utilizes data from the Second Strategic Highway Research Program Naturalistic Driving Study, which encompasses 1922 crashes, 6960 near-crashes, and 8611 normal driving segments. The results demonstrate excellent performance, achieving an overall accuracy of 0.894. The F1 scores for crashes, near-crashes, and normal driving segments were 0.760, 0.892, and 0.923, respectively, indicating strong predictive power for all three categories. The proposed approach offers a highly effective and scalable solution for identifying crashes and near-crashes using front-view video driving data and has broad applications in the field of traffic safety.

引用

页数：14

共 46 条

[1] ViViT: A Video Vision Transformer [J].

Arnab, Anurag ;

Dehghani, Mostafa ;

Heigold, Georg ;

Sun, Chen ;

Lucic, Mario ;

Schmid, Cordelia .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :6816-6826

[2] Safety critical event prediction through unified analysis of driver and vehicle volatilities: Application of deep learning methods [J].

Arvin, Ramin ;

Khattak, Asad J. ;

Qi, Hairong .

ACCIDENT ANALYSIS AND PREVENTION, 2021, 151

[3] DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation [J].

Bao, Wentao ;

Yu, Qi ;

Kong, Yu .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :7599-7608

[4] Uncertainty-based Traffic Accident Anticipation with Spatio-Temporal Relational Learning [J].

Bao, Wentao ;

Yu, Qi ;

Kong, Yu .

MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :2682-2690

[5]

Bertasius G, 2021, PR MACH LEARN RES, V139

[6] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].

Carreira, Joao ;

Zisserman, Andrew .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733

[7] Anticipating Accidents in Dashcam Videos [J].

Chan, Fu-Hsiang ;

Chen, Yu-Ting ;

Xiang, Yu ;

Sun, Min .

COMPUTER VISION - ACCV 2016, PT IV, 2017, 10114 :136-153

[8]

Chen K, 2019, Arxiv, DOI [arXiv:1906.07155, DOI 10.48550/ARXIV.1906.07155]

[9]

Chen T., 2015, R PACKAGE VERSION 04, V1

[10]

Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]

← 1 2 3 4 5 →