Predicting Procedure Step Performance From Operator and Text Features: A Critical First Step Toward Machine Learning-Driven Procedure Design

被引:1
作者
McDonald, Anthony D. [1 ]
Ade, Nilesh [1 ]
Peres, S. Camille [1 ]
机构
[1] Texas A&M Univ, College Stn, TX USA
关键词
machine learning; procedure design; operator performance; random forest; decision tree; READABILITY FORMULA; WRITTEN PROCEDURES; FEATURE-SELECTION; DECISION TREES; MODELS; SAFETY; RECALCULATION; MANAGEMENT; WORKING; RULES;
D O I
10.1177/0018720820958588
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Objective The goal of this study is to assess machine learning for predicting procedure performance from operator and procedure characteristics. Background Procedures are vital for the performance and safety of high-risk industries. Current procedure design guidelines are insufficient because they rely on subjective assessments and qualitative analyses that struggle to integrate and quantify the diversity of factors that influence procedure performance. Method We used data from a 25-participant study with four procedures, conducted on a high-fidelity oil extraction simulation to develop logistic regression (LR), random forest (RF), and decision tree (DT) algorithms that predict procedure step performance from operator, step, readability, and natural language processing-based features. Features were filtered using the Boruta approach. The algorithms were trained and optimized with a repeated 10-fold cross-validation. After training, inference was performed using variable importance and partial dependence plots. Results The RF, DT, and LR algorithms with all features had an area under the receiver operating characteristic curve (AUC) of 0.78, 0.77, and 0.75, respectively, and significantly outperformed the LR with only operator features (LROP), with an AUC of 0.61. The most important features were experience, familiarity, total words, and character-based metrics. The partial dependence plots showed that steps with fewer words, abbreviations, and characters were correlated with correct step performance. Conclusion Machine learning algorithms are a promising approach for predicting step-level procedure performance, with acknowledged limitations on interpolating to nonobserved data, and may help guide procedure design after validation with additional data on further tasks. Application After validation, the inferences from these models can be used to generate procedure design alternatives.
引用
收藏
页码:701 / 717
页数:17
相关论文
共 87 条
[1]   Development of a procedure writers' guide framework: Integrating the procedure life cycle and reflecting on current industry practices [J].
Ahmed, Lubna ;
Quddus, Noor ;
Kannan, Pranav ;
Peres, S. Camille ;
Mannan, M. Sam .
INTERNATIONAL JOURNAL OF INDUSTRIAL ERGONOMICS, 2020, 76
[2]  
ANDERSON J, 1983, J READING, V26, P490
[3]   ACQUISITION OF COGNITIVE SKILL [J].
ANDERSON, JR .
PSYCHOLOGICAL REVIEW, 1982, 89 (04) :369-406
[4]  
[Anonymous], DATA OBJECT
[5]  
Bamberger Richard., 1984, Lesen, verstehen, lernen, schreiben: die Schwierigkeitsstufen von Texten in deutscher Sprache
[6]  
Baron R, 2009, FAILURE FOLLOW PROCE
[7]  
Bates S., 2012, 919 RR HLTH SAF EX L
[8]   Quantifying human and organizational factors in accident management using decision trees:: the HORAAM method [J].
Baumont, G ;
Ménage, F ;
Schneiter, JR ;
Spurgin, A ;
Vogel, A .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2000, 70 (02) :113-124
[9]  
Benoit K., 2018, J OPEN SOURCE SOFTWA, V3, P774, DOI [DOI 10.21105/JOSS.00774, 10.21105/joss.00774]
[10]   Industrial and occupational ergonomics in the petrochemical process industry: A regression trees approach [J].
Bevilacqua, M. ;
Ciarapica, F. E. ;
Giacchetta, G. .
ACCIDENT ANALYSIS AND PREVENTION, 2008, 40 (04) :1468-1479