A Multidisciplinary Survey and Framework for Design and Evaluation of Explainable AI Systems

被引:292
作者
Mohseni, Sina [1 ,3 ]
Zarei, Niloofar [1 ,3 ]
Ragan, Eric D. [2 ]
机构
[1] Texas A&M Univ, College Stn, TX 77843 USA
[2] Univ Florida, E301 CSE Bldg, Gainesville, FL 32611 USA
[3] B208 Langford Bldg,3137 TAMU, College Stn, TX 77840 USA
基金
美国国家科学基金会;
关键词
Explainable artificial intelligence (XAI); human-computer interaction (HCI); machine learning; explanation; transparency; VISUAL ANALYTICS; MENTAL MODELS; PART; EXPLANATION; TRUST; INTERPRETABILITY; ACCOUNTABILITY; VISUALIZATION; TRANSPARENCY; PREDICTION;
D O I
10.1145/3387166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The need for interpretable and accountable intelligent systems grows along with the prevalence of artificial intelligence (AI) applications used in everyday life. Explainable AI (XAI) systems are intended to selfexplain the reasoning behind system decisions and predictions. Researchers from different disciplines work together to define, design, and evaluate explainable systems. However, scholars from different disciplines focus on different objectives and fairly independent topics of XAI research, which poses challenges for identifying appropriate design and evaluation methodology and consolidating knowledge across efforts. To this end, this article presents a survey and framework intended to share knowledge and experiences of XAI design and evaluation methods across multiple disciplines. Aiming to support diverse design goals and evaluation methods in XAI research, after a thorough review of XAI related papers in the fields of machine learning, visualization, and human-computer interaction, we present a categorization of XAI design goals and evaluation methods. Our categorization presents the mapping between design goals for different XAI user groups and their evaluation methods. From our findings, we develop a framework with step-by-step design guidelines paired with evaluation methods to close the iterative design and evaluation cycles in multidisciplinary XAI teams. Further, we provide summarized ready-to-use tables of evaluation methods and recommendations for different goals in XAI research.
引用
收藏
页数:45
相关论文
共 225 条
  • [71] Designs for explaining intelligent agents
    Haynes, Steven R.
    Cohen, Mark A.
    Ritter, Frank E.
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2009, 67 (01) : 90 - 110
  • [72] Agency plus automation: Designing artificial intelligence into interactive systems
    Heer, Jeffrey
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (06) : 1844 - 1850
  • [73] Women Also Snowboard: Overcoming Bias in Captioning Models
    Hendricks, Lisa Anne
    Burns, Kaylee
    Saenko, Kate
    Darrell, Trevor
    Rohrbach, Anna
    [J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 793 - 811
  • [74] Herlocker J. L., 2000, CSCW 2000. ACM 2000 Conference on Computer Supported Cooperative Work, P241, DOI 10.1145/358916.358995
  • [75] Herman B., 2017, The promise and peril of human evaluation for model interpretability
  • [76] Hoffman R.R., 2018, Metrics for explainable AI: challenges and prospects
  • [77] Hoffman R.R., 2017, Macrocognition Metrics and Scenarios, P35
  • [78] Explaining Explanation, Part 4: A Deep Dive on Deep Nets
    Hoffman, Robert
    Miller, Tim
    Mueller, Shane T.
    Klein, Gary
    Clancey, William J.
    [J]. IEEE INTELLIGENT SYSTEMS, 2018, 33 (03) : 87 - 95
  • [79] Explaining Explanation, Part 2: Empirical Foundations
    Hoffman, Robert R.
    Mueller, Shane T.
    Klein, Gary
    [J]. IEEE INTELLIGENT SYSTEMS, 2017, 32 (04) : 78 - 86
  • [80] Explaining Explanation, Part 1: Theoretical Foundations
    Hoffman, Robert R.
    Klein, Gary
    [J]. IEEE INTELLIGENT SYSTEMS, 2017, 32 (03) : 68 - 73