Machine learning for phenotyping opioid overdose events

被引:24
作者
Badger, Jonathan [1 ,2 ]
LaRose, Eric [1 ]
Mayer, John [1 ]
Bashiri, Fereshteh [1 ]
Page, David [2 ,3 ]
Peissig, Peggy [1 ]
机构
[1] Marshfield Clin Fdn Med Res & Educ, Res Inst, Marshfield, WI 54449 USA
[2] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI USA
[3] Univ Wisconsin, Dept Comp Sci, 1210 W Dayton St, Madison, WI 53706 USA
关键词
Machine learning; Opioid; Phenotype; Overdose; Electronic health record; INDUCED RESPIRATORY DEPRESSION; UNITED-STATES; HIGH-THROUGHPUT; RISK; DRUG; EPIDEMIC; DEATHS; TRENDS; ABUSE; HEROIN;
D O I
10.1016/j.jbi.2019.103185
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: To develop machine learning models for classifying the severity of opioid overdose events from clinical data. Materials and methods: Opioid overdoses were identified by diagnoses codes from the Marshfield Clinic population and assigned a severity score via chart review to form a gold standard set of labels. Three primary feature sets were constructed from disparate data sources surrounding each event and used to train machine learning models for phenotyping. Results: Random forest and penalized logistic regression models gave the best performance with cross-validated mean areas under the ROC curves (AUCs) for all severity classes of 0.893 and 0.882 respectively. Features derived from a common data model outperformed features collected from disparate data sources for the same cohort of patients (AUCs 0.893 versus 0.837, p value = 0.002). The addition of features extracted from free text to machine learning models also increased AUCs from 0.827 to 0.893 (p value < 0.0001). Key word features extracted using natural language processing (NLP) such as 'Narcan' and 'Endotracheal Tube' are important for classifying overdose event severity. Conclusion: Random forest models using features derived from a common data model and free text can be effective for classifying opioid overdose events.
引用
收藏
页数:11
相关论文
共 36 条
[1]   Learning statistical models of phenotypes using noisy labeled training data [J].
Agarwal, Vibhu ;
Podchiyska, Tanya ;
Banda, Juan M. ;
Goel, Veena ;
Leung, Tiffany I. ;
Minty, Evan P. ;
Sweeney, Timothy E. ;
Gyang, Elsie ;
Shah, Nigam H. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2016, 23 (06) :1166-1173
[2]  
[Anonymous], 2017, Annual Surveillance Report of Drug-Related Risks and Outcomes, DOI https://www.cdc.gov/drugoverdose/pdf/pubs/2017-cdc-drug-surveillance-report.pdf
[3]  
Banda Juan M, 2017, AMIA Jt Summits Transl Sci Proc, V2017, P48
[4]   DRUG THERAPY Management of Opioid Analgesic Overdose [J].
Boyer, Edward W. .
NEW ENGLAND JOURNAL OF MEDICINE, 2012, 367 (02) :146-155
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Cawley GC, 2010, J MACH LEARN RES, V11, P2079
[7]   Applying active learning to high-throughput phenotyping algorithms for electronic health records data [J].
Chen, Yukun ;
Carroll, Robert J. ;
Hinz, Eugenia R. McPeek ;
Shah, Anushi ;
Eyler, Anne E. ;
Denny, Joshua C. ;
Xu, Hua .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (E2) :E253-E259
[8]   The opioid abuse and misuse epidemic: Implications for pharmacists in hospitals and health systems [J].
Cobaugh, Daniel J. ;
Gainor, Carl ;
Gaston, Cynthia L. ;
Kwong, Tai C. ;
Magnani, Barbarajean ;
McPherson, Mary Lynn ;
Painter, Jacob T. ;
Krenzelok, Edward P. .
AMERICAN JOURNAL OF HEALTH-SYSTEM PHARMACY, 2014, 71 (18) :1539-1554
[9]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[10]   Underlying Factors in Drug Overdose Deaths [J].
Dowell, Deborah ;
Noonan, Rita K. ;
Houry, Debra .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 318 (23) :2295-2296