Comparison and Validation of Injury Risk Classifiers for Advanced Automated Crash Notification Systems

被引:18
作者
Kusano, Kristofer [1 ]
Gabler, Hampton C. [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
关键词
advanced automated crash notification; machine learning; injury risk; logistic regression; STATISTICAL-ANALYSIS; FIELD TRIAGE; TRAUMA; OVERTRIAGE; COST;
D O I
10.1080/15389588.2014.927577
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Objective: The odds of death for a seriously injured crash victim are drastically reduced if he or she received care at a trauma center. Advanced automated crash notification (AACN) algorithms are postcrash safety systems that use data measured by the vehicles during the crash to predict the likelihood of occupants being seriously injured. The accuracy of these models are crucial to the success of an AACN. The objective of this study was to compare the predictive performance of competing injury risk models and algorithms: logistic regression, random forest, AdaBoost, naive Bayes, support vector machine, and classification k-nearest neighbors. Methods: This study compared machine learning algorithms to the widely adopted logistic regression modeling approach. Machine learning algorithms have not been commonly studied in the motor vehicle injury literature. Machine learning algorithms may have higher predictive power than logistic regression, despite the drawback of lacking the ability to perform statistical inference. To evaluate the performance of these algorithms, data on 16,398 vehicles involved in non-rollover collisions were extracted from the NASS-CDS. Vehicles with any occupants having an Injury Severity Score (ISS) of 15 or greater were defined as those requiring victims to be treated at a trauma center. The performance of each model was evaluated using cross-validation. Cross-validation assesses how a model will perform in the future given new data not used for model training. The crash Delta V (change in velocity during the crash), damage side (struck side of the vehicle), seat belt use, vehicle body type, number of events, occupant age, and occupant sex were used as predictors in each model. Results and Conclusions: Logistic regression slightly outperformed the machine learning algorithms based on sensitivity and specificity of the models. Previous studies on AACN risk curves used the same data to train and test the power of the models and as a result had higher sensitivity compared to the cross-validated results from this study. Future studies should account for future data; for example, by using cross-validation or risk presenting optimistic predictions of field performance. Past algorithms have been criticized for relying on age and sex, being difficult to measure by vehicle sensors, and inaccuracies in classifying damage side. The models with accurate damage side and including age/sex did outperform models with less accurate damage side and without age/sex, but the differences were small, suggesting that the success of AACN is not reliant on these predictors.
引用
收藏
页码:S126 / S133
页数:8
相关论文
共 27 条
  • [1] Using logistic regression to estimate the influence of accident factors on accident severity
    Al-Ghamdi, AS
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2002, 34 (06) : 729 - 741
  • [2] Bahouth G.T., 2004, Topics in Emergency Medicine, V26, P157
  • [3] INJURY SEVERITY SCORE - METHOD FOR DESCRIBING PATIENTS WITH MULTIPLE INJURIES AND EVALUATING EMERGENCY CARE
    BAKER, SP
    ONEILL, B
    HADDON, W
    LONG, WB
    [J]. JOURNAL OF TRAUMA-INJURY INFECTION AND CRITICAL CARE, 1974, 14 (03): : 187 - 196
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] Champion H., 2004, TOPICS EMERG MED, V26, P143
  • [6] Davis J., 2006, P 23 INT C MACH LEAR, P233, DOI [10.1145/1143844.1143874, DOI 10.1145/1143844.1143874]
  • [7] LARGE COST SAVINGS REALIZED FROM THE 2006 FIELD TRIAGE GUIDELINE: REDUCTION IN OVERTRIAGE IN U.S. TRAUMA CENTERS
    Faul, Mark
    Wald, Marlena M.
    Sullivent, Ernest E.
    Sasser, Scott M.
    Kapil, Vikas
    Lerner, E. Brooke
    Hunt, Richard C.
    [J]. PREHOSPITAL EMERGENCY CARE, 2012, 16 (02) : 222 - 229
  • [8] A decision-theoretic generalization of on-line learning and an application to boosting
    Freund, Y
    Schapire, RE
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) : 119 - 139
  • [9] Gabler H., 2008, Event data recorders : a decade of innovation
  • [10] Garavaglia S, 1998, NE SAS US GROUP C NE