On the Safety of Machine Learning: Cyber-Physical Systems, Decision Sciences, and Data Products

被引:112
作者
Varshney, Kush R. [1 ]
Alemzadeh, Homa [2 ]
机构
[1] IBM Thomas J Watson Res Ctr, Dept Data Sci, Yorktown Hts, NY USA
[2] Univ Virginia, Dept Elect & Comp Engn, Charlottesville, VA USA
关键词
cyber-physical systems; data products; decision science; machine learning; safety;
D O I
10.1089/big.2016.0051
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Machine learning algorithms increasingly influence our decisions and interact with us in all parts of our daily lives. Therefore, just as we consider the safety of power plants, highways, and a variety of other engineered socio-technical systems, we must also take into account the safety of systems involving machine learning. Heretofore, the definition of safety has not been formalized in a machine learning context. In this article, we do so by defining machine learning safety in terms of risk, epistemic uncertainty, and the harm incurred by unwanted outcomes. We then use this definition to examine safety in all sorts of applications in cyber-physical systems, decision sciences, and data products. We find that the foundational principle of modern statistical machine learning, empirical risk minimization, is not always a sufficient objective. We discuss how four different categories of strategies for achieving safety in engineering, including inherently safe design, safety reserves, safe fail, and procedural safeguards can be mapped to a machine learning context. We then discuss example techniques that can be adopted in each category, such as considering interpretability and causality of predictive models, objective functions beyond expected prediction accuracy, human involvement for labeling difficult or rare examples, and user experience design of software and open data.
引用
收藏
页码:246 / 255
页数:10
相关论文
共 54 条
[1]   Adverse Events in Robotic Surgery: A Retrospective Study of 14 Years of FDA Data [J].
Alemzadeh, Homa ;
Raman, Jaishankar ;
Leveson, Nancy ;
Kalbarczyk, Zbigniew ;
Iyer, Ravishankar K. .
PLOS ONE, 2016, 11 (04)
[2]   Systems-Theoretic Safety Assessment of Robotic Telesurgical Systems [J].
Alemzadeh, Homa ;
Chen, Daniel ;
Lewis, Andrew ;
Kalbarczyk, Zbigniew ;
Raman, Jaishankar ;
Leveson, Nancy ;
Iyer, Ravishankar .
COMPUTER SAFETY, RELIABILITY, AND SECURITY, SAFECOMP 2015, 2015, 9337 :213-227
[3]  
Alemzadeh Homa, 2016, THESIS
[4]  
[Anonymous], 2013, International Conference on Machine Learning (ICML)
[5]  
[Anonymous], CAUSAL FALLING RULE
[6]  
[Anonymous], 2015, Machine learning methods for estimating heterogeneous causal eects
[7]  
[Anonymous], 2016, ICML WORKSHOP HUMAN
[8]  
[Anonymous], CALIFORNIA LAW REV
[9]   Beat the Machine: Challenging Humans to Find a Predictive Model's "Unknown Unknowns" [J].
Attenberg, Joshua ;
Ipeirotis, Panos ;
Provost, Foster .
ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2015, 6 (01)
[10]   A Chance-Constrained Programming Approach to Preoperative Planning of Robotic Cardiac Surgery Under Task-Level Uncertainty [J].
Azimian, Hamidreza ;
Naish, Michael D. ;
Kiaii, Bob ;
Patel, Rajni V. .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2015, 19 (02) :612-622