Crash narrative classification: Identifying agricultural crashes using machine learning with curated keywords
被引:3
作者:
Kim, Jisung
论文数: 0引用数: 0
h-index: 0
机构:
Texas A&M Transportat Inst, Mobil Div, Transportat Planning, College Stn, TX USATexas A&M Transportat Inst, Mobil Div, Transportat Planning, College Stn, TX USA
Kim, Jisung
[1
]
Trueblood, Amber Brooke
论文数: 0引用数: 0
h-index: 0
机构:
Texas A&M Transportat Inst, Ctr Transportat Safety, Crash Analyt Team, College Stn, TX USATexas A&M Transportat Inst, Mobil Div, Transportat Planning, College Stn, TX USA
Trueblood, Amber Brooke
[2
]
Kum, Hye-Chung
论文数: 0引用数: 0
h-index: 0
机构:
Texas A&M Univ, Sch Publ Hlth, Dept Hlth Policy & Management, Populat Informat Lab, College Stn, TX USATexas A&M Transportat Inst, Mobil Div, Transportat Planning, College Stn, TX USA
Kum, Hye-Chung
[3
]
Shipp, Eva M.
论文数: 0引用数: 0
h-index: 0
机构:
Texas A&M Transportat Inst, Ctr Transportat Safety, Crash Analyt Team, College Stn, TX USATexas A&M Transportat Inst, Mobil Div, Transportat Planning, College Stn, TX USA
Shipp, Eva M.
[2
]
机构:
[1] Texas A&M Transportat Inst, Mobil Div, Transportat Planning, College Stn, TX USA
[2] Texas A&M Transportat Inst, Ctr Transportat Safety, Crash Analyt Team, College Stn, TX USA
[3] Texas A&M Univ, Sch Publ Hlth, Dept Hlth Policy & Management, Populat Informat Lab, College Stn, TX USA
Objective Traditionally, structured or coded data fields from a crash report are the basis for identifying crashes involving different types of vehicles, such as farm equipment. However, using only the structured data can lead to misclassification of vehicle or crash type. The objective of the current article is to examine the use of machine learning methods for identifying agricultural crashes based on the crash narrative and to transfer the application of models to different settings (e.g., future years of data, other states). Methods Different data representations (e.g., bag-of-words [BoW], bag-of-keywords [BoK]) and document classification algorithms (e.g., support vector machine [SVM], multinomial naive Bayes classifier [MNB]) were explored using Texas and Louisiana crash narratives across different time periods. Results The BoK-support vector classifier (SVC), BoK-MNB, and BoW-SVC models trained with Texas data were better predictive models than the baseline rule-based algorithm on the future year test data, with F1 scores of 0.88, 0.89, 0.85 vs. 0.84. The BoK-MNB trained with Louisiana data performed the closest to the baseline rule-based algorithm on the future year test data (F1 scores, 0.91 baseline rule-based algorithm vs. 0.89 BoK-MNB). The BoK-SVC and BoK-MNB models trained with Texas and Louisiana data were better productive models for Texas future year test data with F1 scores 0.89 and 0.90 vs. 0.84. The BoK-MNB model trained with both states' data was a better predictive model for the Louisiana future year test data, F1 score 0.94 vs. 0.91. Conclusions The findings of this study support that machine learning methodologies can potentially reduce the amount of human power required to develop key word lists and manually review narratives.
机构:
Univ Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USA
Univ Iowa, Dept Emergency Med, Iowa City, IA 52242 USAUniv Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USA
Harland, Karisa K.
Greenan, Mitchell
论文数: 0引用数: 0
h-index: 0
机构:
Univ Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USAUniv Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USA
Greenan, Mitchell
Ramirez, Marizen
论文数: 0引用数: 0
h-index: 0
机构:
Univ Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USAUniv Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USA
机构:
Univ Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USA
Univ Iowa, Dept Emergency Med, Iowa City, IA 52242 USAUniv Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USA
Harland, Karisa K.
Greenan, Mitchell
论文数: 0引用数: 0
h-index: 0
机构:
Univ Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USAUniv Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USA
Greenan, Mitchell
Ramirez, Marizen
论文数: 0引用数: 0
h-index: 0
机构:
Univ Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USAUniv Iowa, Coll Publ Hlth, Dept Occupat & Environm Hlth, Injury Prevent Res Ctr, Iowa City, IA 52242 USA