Black-Box vs. White-Box: Understanding Their Advantages and Weaknesses From a Practical Point of View

被引:340
作者
Loyola-Gonzalez, Octavio [1 ]
机构
[1] Tecnol Monterrey, Puebla 72453, Mexico
关键词
Black-box; white-box; explainable artificial intelligence; deep learning; BAYESIAN NETWORK; CONTRAST PATTERNS; ASSOCIATION RULES; EMERGING PATTERNS; NEURAL-NETWORKS; CLASS IMBALANCE; DECISION TREES; MODELS; SUPPORT; QUALITY;
D O I
10.1109/ACCESS.2019.2949286
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, in the international scientific community of machine learning, there exists an enormous discussion about the use of black-box models or explainable models; especially in practical problems. On the one hand, a part of the community defends that black-box models are more accurate than explainable models in some contexts, like image preprocessing. On the other hand, there exist another part of the community alleging that explainable models are better than black-box models because they can obtain comparable results and also they can explain these results in a language close to a human expert by using patterns. In this paper, advantages and weaknesses for each approach are shown; taking into account a state-of-the-art review for both approaches, their practical applications, trends, and future challenges. This paper shows that both approaches are suitable for solving practical problems, but experts in machine learning need to understand the input data, the problem to solve, and the best way for showing the output data before applying a machine learning model. Also, we propose some ideas for fusing both, explainable and black-box, approaches to provide better solutions to experts in real-world domains. Additionally, we show one way to measure the effectiveness of the applied machine learning model by using expert opinions jointly with statistical methods. Throughout this paper, we show the impact of using explainable and black-box models on the security and medical applications.
引用
收藏
页码:154096 / 154113
页数:18
相关论文
共 169 条
[71]  
Fisher J. L., 1990, Proceedings. 10th International Conference on Pattern Recognition (Cat. No.90CH2898-5), P567, DOI 10.1109/ICPR.1990.118166
[72]  
Freitas A, 2007, LECT NOTES COMPUT SC, V4654, P303
[73]   Edited AdaBoost by weighted kNN [J].
Gao, Yunlong ;
Gao, Feng .
NEUROCOMPUTING, 2010, 73 (16-18) :3079-3088
[74]   Finding the best diversity generation procedures for mining contrast patterns [J].
Garcia-Borroto, Milton ;
Fco. Martinez-Trinidad, Jose ;
Ariel Carrasco-Ochoa, Jesus .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (11) :4859-4866
[75]   Fuzzy emerging patterns for classifying hard domains [J].
Garcia-Borroto, Milton ;
Fco Martinez-Trinidad, Jose ;
Ariel Carrasco-Ochoa, Jesus .
KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 28 (02) :473-489
[76]  
Geng L., 2007, QUALITY MEASURES DAT, V43, P3, DOI [DOI 10.1007/978-3-540-44918-8_1, 10.1007/978-3-540-44918-8_1]
[77]   Interestingness measures for data mining: A survey [J].
Geng, Liqiang ;
Hamilton, Howard J. .
ACM COMPUTING SURVEYS, 2006, 38 (03) :3
[78]   Extremely randomized trees [J].
Geurts, P ;
Ernst, D ;
Wehenkel, L .
MACHINE LEARNING, 2006, 63 (01) :3-42
[79]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672, DOI DOI 10.1145/3422622
[80]   Principles of QSAR models validation: internal and external [J].
Gramatica, Paola .
QSAR & COMBINATORIAL SCIENCE, 2007, 26 (05) :694-701