A Multidisciplinary Survey and Framework for Design and Evaluation of Explainable AI Systems

被引：292

作者：

Mohseni, Sina ^{[1
,3
]}

Zarei, Niloofar ^{[1
,3
]}

Ragan, Eric D. ^{[2
]}

机构：

[1] Texas A&M Univ, College Stn, TX 77843 USA

[2] Univ Florida, E301 CSE Bldg, Gainesville, FL 32611 USA

[3] B208 Langford Bldg,3137 TAMU, College Stn, TX 77840 USA

来源：

ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS | 2021年 / 11卷 / 3-4期

基金：

美国国家科学基金会;

关键词：

Explainable artificial intelligence (XAI); human-computer interaction (HCI); machine learning; explanation; transparency; VISUAL ANALYTICS; MENTAL MODELS; PART; EXPLANATION; TRUST; INTERPRETABILITY; ACCOUNTABILITY; VISUALIZATION; TRANSPARENCY; PREDICTION;

D O I：

10.1145/3387166

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The need for interpretable and accountable intelligent systems grows along with the prevalence of artificial intelligence (AI) applications used in everyday life. Explainable AI (XAI) systems are intended to selfexplain the reasoning behind system decisions and predictions. Researchers from different disciplines work together to define, design, and evaluate explainable systems. However, scholars from different disciplines focus on different objectives and fairly independent topics of XAI research, which poses challenges for identifying appropriate design and evaluation methodology and consolidating knowledge across efforts. To this end, this article presents a survey and framework intended to share knowledge and experiences of XAI design and evaluation methods across multiple disciplines. Aiming to support diverse design goals and evaluation methods in XAI research, after a thorough review of XAI related papers in the fields of machine learning, visualization, and human-computer interaction, we present a categorization of XAI design goals and evaluation methods. Our categorization presents the mapping between design goals for different XAI user groups and their evaluation methods. From our findings, we develop a framework with step-by-step design guidelines paired with evaluation methods to close the iterative design and evaluation cycles in multidisciplinary XAI teams. Further, we provide summarized ready-to-use tables of evaluation methods and recommendations for different goals in XAI research.

引用

页数：45

共 225 条

[71] Designs for explaining intelligent agents
Haynes, Steven R.
Cohen, Mark A.
Ritter, Frank E.
[J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2009, 67 (01) : 90 - 110
[72] Agency plus automation: Designing artificial intelligence into interactive systems
Heer, Jeffrey
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (06) : 1844 - 1850
[73] Women Also Snowboard: Overcoming Bias in Captioning Models
Hendricks, Lisa Anne
Burns, Kaylee
Saenko, Kate
Darrell, Trevor
Rohrbach, Anna
[J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 793 - 811
[74] Herlocker J. L., 2000, CSCW 2000. ACM 2000 Conference on Computer Supported Cooperative Work, P241, DOI 10.1145/358916.358995
[75] Herman B., 2017, The promise and peril of human evaluation for model interpretability
[76] Hoffman R.R., 2018, Metrics for explainable AI: challenges and prospects
[77] Hoffman R.R., 2017, Macrocognition Metrics and Scenarios, P35
[78] Explaining Explanation, Part 4: A Deep Dive on Deep Nets
Hoffman, Robert
Miller, Tim
Mueller, Shane T.
Klein, Gary
Clancey, William J.
[J]. IEEE INTELLIGENT SYSTEMS, 2018, 33 (03) : 87 - 95
[79] Explaining Explanation, Part 2: Empirical Foundations
Hoffman, Robert R.
Mueller, Shane T.
Klein, Gary
[J]. IEEE INTELLIGENT SYSTEMS, 2017, 32 (04) : 78 - 86
[80] Explaining Explanation, Part 1: Theoretical Foundations
Hoffman, Robert R.
Klein, Gary
[J]. IEEE INTELLIGENT SYSTEMS, 2017, 32 (03) : 68 - 73

← 3 4 5 6 7 8 9 10 11 12 →