Finding and understanding pedal misapplication crashes using a deep learning natural language model

被引：4

作者：

Bareiss, Max ^{[1
]}

Smith, Colin ^{[1
]}

Gabler, Hampton C. ^{[1
]}

机构：

[1] Virginia Tech, Dept Biomed Engn, Blacksburg, VA USA

来源：

TRAFFIC INJURY PREVENTION | 2021年 / 22卷

关键词：

Pedal misapplication; NMVCCS; deep learning; BERT; NLP;

D O I：

10.1080/15389588.2021.1982616

中图分类号：

R1 [预防医学、卫生学];

学科分类号：

1004 ; 120402 ;

摘要：

Objective The objective of this study was to develop a system which used the BERT natural language understanding model to identify pedal misapplication (PM) crashes from their crash narratives and validate the accuracy of the system. Methods The training dataset used for this study was 11 cases from the NMVCCS study and 952 cases from the North Carolina state crash database. Cases for this study were selected from their respective full datasets using a keyword search algorithm containing terms indicative of a pedal-related mistake. A BERT language model was used to classify each case narrative as either no pedal misapplication, PM by vehicle 1, PM by vehicle 2, or PM by vehicle 3. After training, the language model was used to determine the incidence of pedal misapplication in a test dataset of 8,668 North Carolina and NMVCCS cases and these results were compared to a manual review of the dataset. After manual review, 2,969 cases were pedal misapplications. Results The model's AUC ROC performance at detecting PM was quantified on the entire testing dataset to evaluate the power of the system to generalize to case narratives unseen at training time. The AUC ROC value was 0.9835, indicating strong generalization to all crash narratives. By choosing the optimal threshold using the ROC curve, the system correctly identified PM in 95.7% of crash narratives. When pedal misapplication was correctly identified, the correct vehicle was identified in 95.9% of cases. A total of 3,062 pedal misapplications were identified. The model labeled cases 353 times faster than a researcher. Conclusions The strong performance of the model suggests that the automated interpretation of case narratives can be used for future research studies without any manual review. This would save time and enable the use of datasets where manual review would be infeasible. The automated extraction of information from crash narratives using deep learning natural language models has not been demonstrated previously in the literature, to the best of the authors' knowledge. This technique can be applied to large, infrequently used datasets of crash narratives and extended to extract useful vehicle, occupant, or environment information to make these datasets amenable to traditional statistical analyses.

引用

页码：S169 / S172

页数：4

共 50 条

[31] Deep Learning on Graphs for Natural Language Processing
Wu, Lingfei
Chen, Yu
Ji, Heng
Liu, Bang
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4084 - 4085
[32] Deep Learning Techniques for Natural Language Processing
Rodzin, Sergey
Bova, Victoria
Kravchenko, Yury
Rodzina, Lada
ARTIFICIAL INTELLIGENCE TRENDS IN SYSTEMS, VOL 2, 2022, 502 : 121 - 130
[33] Ontology-Based Natural Language Processing for Sentimental Knowledge Analysis Using Deep Learning Architectures
Jain, Deepak Kumar
Qamar, Shamimul
Sangwan, Saurabh Raj
Ding, Weiping
Kulkarni, Anand J.
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (01)
[34] An Analysis of Early Use of Deep Learning Terms in Natural Language Processing
Basic, B. Dalbelo
di Buono, M. P.
2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 1125 - 1129
[35] Towards Emotion Cause Generation in Natural Language Processing using Deep Learning
Riyadh, Md
Shafiq, M. Omair
2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 140 - 147
[36] A Deep Learning Model of Common Sense Knowledge for Augmenting Natural Language Processing Tasks in Portuguese Language
Carvalho, Cecilia Silvestre
Pinheiro, Vladia C.
Freire, Livio
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020, 2020, 12037 : 303 - 312
[37] Fast compression and optimization of deep learning models for natural language processing
Pietron, Marcin
Karwatowski, Michal
Wielgosz, Maciej
Duda, Jerzy
2019 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS (CANDARW 2019), 2019, : 162 - 168
[38] Understanding Natural Disaster Scenes from Mobile Images Using Deep Learning
Tang, Shimin
Chen, Zhiqiang
APPLIED SCIENCES-BASEL, 2021, 11 (09):
[39] Recognition of Indian Sign Language (ISL) Using Deep Learning Model
Sakshi Sharma
Sukhwinder Singh
Wireless Personal Communications, 2022, 123 : 671 - 692
[40] Recognition of Indian Sign Language (ISL) Using Deep Learning Model
Sharma, Sakshi
Singh, Sukhwinder
WIRELESS PERSONAL COMMUNICATIONS, 2022, 123 (01) : 671 - 692

← 1 2 3 4 5 →