Local Interpretations for Explainable Natural Language Processing: A Survey

被引:8
|
作者
Luo, Siwen [1 ]
Ivison, Hamish [2 ]
Han, Soyeon Caren [3 ]
Poon, Josiah [4 ]
机构
[1] Univ Western Australia, 35 Stirling Hwy, Perth, WA 6009, Australia
[2] Univ Washington, 3800 E Stevens Way NE, Seattle, WA 98195 USA
[3] Univ Melbourne, 700 Swanston St, Melbourne, Vic 3010, Australia
[4] Univ Sydney, 1 Cleveland St, Darlington, NSW 2008, Australia
关键词
Deep neural networks; explainable AI; local interpretation; natural language processing; PREDICTION;
D O I
10.1145/3649450
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As the use of deep learning techniques has grown across various fields over the past decade, complaints about the opaqueness of the black-box models have increased, resulting in an increased focus on transparency in deep learning models. This work investigates various methods to improve the interpretability of deep neural networks for Natural Language Processing (NLP) tasks, including machine translation and sentiment analysis. We provide a comprehensive discussion on the definition of the term interpretability and its various aspects at the beginning of this work. The methods collected and summarised in this survey are only associated with local interpretation and are specifically divided into three categories: (1) interpreting the model's predictions through related input features; (2) interpreting through natural language explanation; (3) probing the hidden states of models and word representations.
引用
收藏
页数:36
相关论文
共 50 条
  • [21] Adversarial attack and defense technologies in natural language processing: A survey
    Qiu, Shilin
    Liu, Qihe
    Zhou, Shijie
    Huang, Wen
    NEUROCOMPUTING, 2022, 492 : 278 - 307
  • [22] Identification of Causal Dependencies by using Natural Language Processing: A Survey
    Nazaruka, Erika
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING (ENASE), 2019, : 603 - 613
  • [23] Pre-trained models for natural language processing: A survey
    XiPeng Qiu
    TianXiang Sun
    YiGe Xu
    YunFan Shao
    Ning Dai
    XuanJing Huang
    Science China Technological Sciences, 2020, 63 : 1872 - 1897
  • [24] A survey on detecting mental disorders with natural language processing: Literature review, trends and challenges
    Montejo-Raez, Arturo
    Molina-Gonzalez, M. Dolores
    Jimenez-Zafra, Salud Maria
    Garcia-Cumbreras, Miguel Angel
    Garcia-Lopez, Luis Joaquin
    COMPUTER SCIENCE REVIEW, 2024, 53
  • [25] Understanding poetry using natural language processing tools: a survey
    De Sisto, Mirella
    Hernandez-Lorenzo, Laura
    de la Rosa, Javier
    Ros, Salvador
    Gonzalez-Blanco, Elena
    DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2024, 39 (02) : 500 - 521
  • [26] Explainable Prediction of Machine-Tool Breakdowns Based on Combination of Natural Language Processing and Classifiers
    Ben Ayed, Maha
    Soualhi, Moncef
    Mairot, Nicolas
    Giampiccolo, Sylvain
    Ketata, Raouf
    Zerhouni, Noureddine
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 4, INTELLISYS 2023, 2024, 825 : 105 - 121
  • [27] Natural Language Processing for Smart Healthcare
    Zhou, Binggui
    Yang, Guanghua
    Shi, Zheng
    Ma, Shaodan
    IEEE REVIEWS IN BIOMEDICAL ENGINEERING, 2024, 17 : 4 - 18
  • [28] Gender Bias in Natural Language Processing and Computer Vision: A Comparative Survey
    Bartl, Marion
    Mandal, Abhishek
    Leavy, Susan
    Little, Suzanne
    ACM COMPUTING SURVEYS, 2025, 57 (06)
  • [29] Survey on Mathematical Word Problem Solving Using Natural Language Processing
    Ughade, Shounaak
    Kumbhar, Satish
    PROCEEDINGS OF 2019 1ST INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION AND COMMUNICATION TECHNOLOGY (ICIICT 2019), 2019,
  • [30] Literature Survey of statistical, deep and reinforcement learning in Natural Language Processing
    Kaushik, Pranav
    Sharma, Akanksha Rai
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2017, : 350 - 354