Expert-augmented machine learning

被引:68
作者
Gennatas, Efstathios D. [1 ,10 ]
Friedman, Jerome H.
Ungar, Lyle H. [2 ]
Pirracchio, Romain [3 ]
Eaton, Eric [2 ]
Reichmann, Lara G. [4 ]
Interian, Yannet [4 ]
Luna, Jose Marcio [5 ]
Simone, Charles B., II [6 ]
Auerbach, Andrew [7 ]
Delgado, Elier [8 ]
van der Laan, Mark J. [9 ]
Solberg, Timothy D. [1 ]
Valdes, Gilmer [1 ]
机构
[1] Univ Calif San Francisco, Dept Radiat Oncol, San Francisco, CA 94143 USA
[2] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
[3] Univ Calif San Francisco, Dept Anesthesia & Perioperat Care, San Francisco, CA 94143 USA
[4] Univ San Francisco, Data Inst, San Francisco, CA 94105 USA
[5] Univ Penn, Dept Radiat Oncol, Philadelphia, PA 19104 USA
[6] New York Proton Ctr, Dept Radiat Oncol, New York, NY 10035 USA
[7] Univ Calif San Francisco, Div Hosp Med, San Francisco, CA 94143 USA
[8] Innova Montreal Inc, Montreal, PQ J4W 2P2, Canada
[9] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA
[10] Stanford Univ, Dept Radiat Oncol, Stanford, CA 94305 USA
关键词
machine learning; medicine; computational medicine; ACUTE PHYSIOLOGY SCORE; SAPS; PREDICTION; MORTALITY; APACHE;
D O I
10.1073/pnas.1906831117
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Machine learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption is limited by the level of trust afforded by given models. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may involve combining the complementary strengths of humans and machines. Here, we present expert-augmented machine learning (EAML), an automated method that guides the extraction of expert knowledge and its integration into machine-learned models. We used a large dataset of intensive-care patient data to derive 126 decision rules that predict hospital mortality. Using an online platform, we asked 15 clinicians to assess the relative risk of the subpopulation defined by each rule compared to the total sample. We compared the clinician-assessed risk to the empirical risk and found that, while clinicians agreed with the data in most cases, there were notable exceptions where they overestimated or underestimated the true risk. Studying the rules with greatest disagreement, we identified problems with the training data, including one miscoded variable and one hidden confounder. Filtering the rules based on the extent of disagreement between clinician-assessed risk and empirical risk, we improved performance on out-of-sample data and were able to train with less data. EAML provides a platform for automated creation of problem-specific priors, which help build robust and dependable machine-learning models in critical applications.
引用
收藏
页码:4571 / 4577
页数:7
相关论文
共 24 条
[1]   Elevation of blood urea nitrogen is predictive of long-term mortality in critically ill patients independent of "normal" creatinine [J].
Beier, Kevin ;
Eppanapally, Sabitha ;
Bazick, Heidi S. ;
Chang, Domingo ;
Mahadevappa, Karthik ;
Gibbons, Fiona K. ;
Christopher, Kenneth B. .
CRITICAL CARE MEDICINE, 2011, 39 (02) :305-313
[2]   Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission [J].
Caruana, Rich ;
Lou, Yin ;
Gehrke, Johannes ;
Koch, Paul ;
Sturm, Marc ;
Elhadad, Noemie .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :1721-1730
[3]   Predicting dire outcomes of patients with community acquired pneumonia [J].
Cooper, GF ;
Abraham, V ;
Aliferis, CF ;
Aronis, JM ;
Buchanan, BG ;
Caruana, R ;
Fine, MJ ;
Janosky, JE ;
Livingston, G ;
Mitchell, T ;
Monti, S ;
Spirtes, P .
JOURNAL OF BIOMEDICAL INFORMATICS, 2005, 38 (05) :347-366
[4]   Outcome prediction in a surgical ICU using automatically calculated SAPS II scores [J].
Engel, JM ;
Junger, A ;
Bottger, S ;
Benson, M ;
Michel, A ;
Rohrig, R ;
Jost, A ;
Hempelmann, G .
ANAESTHESIA AND INTENSIVE CARE, 2003, 31 (05) :548-554
[5]   PREDICTIVE LEARNING VIA RULE ENSEMBLES [J].
Frieman, Jerome H. ;
Popescu, Bogdan E. .
ANNALS OF APPLIED STATISTICS, 2008, 2 (03) :916-954
[6]  
Gennatas E. D., THESIS
[7]   Prognosis research strategy (PROGRESS) 4: Stratified medicine research [J].
Hingorani, Aroon D. ;
van der Windt, Danielle A. ;
Riley, Richard D. ;
Abrams, Keith ;
Moons, Karel G. M. ;
Steyerberg, Ewout W. ;
Schroter, Sara ;
Sauerbrei, Willi ;
Altman, Douglas G. ;
Hemingway, Harry .
BMJ-BRITISH MEDICAL JOURNAL, 2013, 346
[8]   The MIMIC Code Repository: enabling reproducibility in critical care research [J].
Johnson, Alistair E. W. ;
Stone, David J. ;
Celi, Leo A. ;
Pollard, Tom J. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2018, 25 (01) :32-39
[9]   MIMIC-III, a freely accessible critical care database [J].
Johnson, Alistair E. W. ;
Pollard, Tom J. ;
Shen, Lu ;
Lehman, Li-wei H. ;
Feng, Mengling ;
Ghassemi, Mohammad ;
Moody, Benjamin ;
Szolovits, Peter ;
Celi, Leo Anthony ;
Mark, Roger G. .
SCIENTIFIC DATA, 2016, 3
[10]   APACHE - ACUTE PHYSIOLOGY AND CHRONIC HEALTH EVALUATION - A PHYSIOLOGICALLY BASED CLASSIFICATION-SYSTEM [J].
KNAUS, WA ;
ZIMMERMAN, JE ;
WAGNER, DP ;
DRAPER, EA ;
LAWRENCE, DE .
CRITICAL CARE MEDICINE, 1981, 9 (08) :591-597