Local Interpretations for Explainable Natural Language Processing: A Survey

被引：8

作者：

Luo, Siwen ^{[1
]}

Ivison, Hamish ^{[2
]}

Han, Soyeon Caren ^{[3
]}

Poon, Josiah ^{[4
]}

机构：

[1] Univ Western Australia, 35 Stirling Hwy, Perth, WA 6009, Australia

[2] Univ Washington, 3800 E Stevens Way NE, Seattle, WA 98195 USA

[3] Univ Melbourne, 700 Swanston St, Melbourne, Vic 3010, Australia

[4] Univ Sydney, 1 Cleveland St, Darlington, NSW 2008, Australia

来源：

ACM COMPUTING SURVEYS | 2024年 / 56卷 / 09期

关键词：

Deep neural networks; explainable AI; local interpretation; natural language processing; PREDICTION;

D O I：

10.1145/3649450

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

As the use of deep learning techniques has grown across various fields over the past decade, complaints about the opaqueness of the black-box models have increased, resulting in an increased focus on transparency in deep learning models. This work investigates various methods to improve the interpretability of deep neural networks for Natural Language Processing (NLP) tasks, including machine translation and sentiment analysis. We provide a comprehensive discussion on the definition of the term interpretability and its various aspects at the beginning of this work. The methods collected and summarised in this survey are only associated with local interpretation and are specifically divided into three categories: (1) interpreting the model's predictions through related input features; (2) interpreting through natural language explanation; (3) probing the hidden states of models and word representations.

引用

页数：36

共 50 条

[21] Adversarial attack and defense technologies in natural language processing: A survey
Qiu, Shilin
Liu, Qihe
Zhou, Shijie
Huang, Wen
NEUROCOMPUTING, 2022, 492 : 278 - 307
[22] Identification of Causal Dependencies by using Natural Language Processing: A Survey
Nazaruka, Erika
PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING (ENASE), 2019, : 603 - 613
[23] Pre-trained models for natural language processing: A survey
XiPeng Qiu
TianXiang Sun
YiGe Xu
YunFan Shao
Ning Dai
XuanJing Huang
Science China Technological Sciences, 2020, 63 : 1872 - 1897
[24] A survey on detecting mental disorders with natural language processing: Literature review, trends and challenges
Montejo-Raez, Arturo
Molina-Gonzalez, M. Dolores
Jimenez-Zafra, Salud Maria
Garcia-Cumbreras, Miguel Angel
Garcia-Lopez, Luis Joaquin
COMPUTER SCIENCE REVIEW, 2024, 53
[25] Understanding poetry using natural language processing tools: a survey
De Sisto, Mirella
Hernandez-Lorenzo, Laura
de la Rosa, Javier
Ros, Salvador
Gonzalez-Blanco, Elena
DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2024, 39 (02) : 500 - 521
[26] Explainable Prediction of Machine-Tool Breakdowns Based on Combination of Natural Language Processing and Classifiers
Ben Ayed, Maha
Soualhi, Moncef
Mairot, Nicolas
Giampiccolo, Sylvain
Ketata, Raouf
Zerhouni, Noureddine
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 4, INTELLISYS 2023, 2024, 825 : 105 - 121
[27] Natural Language Processing for Smart Healthcare
Zhou, Binggui
Yang, Guanghua
Shi, Zheng
Ma, Shaodan
IEEE REVIEWS IN BIOMEDICAL ENGINEERING, 2024, 17 : 4 - 18
[28] Gender Bias in Natural Language Processing and Computer Vision: A Comparative Survey
Bartl, Marion
Mandal, Abhishek
Leavy, Susan
Little, Suzanne
ACM COMPUTING SURVEYS, 2025, 57 (06)
[29] Survey on Mathematical Word Problem Solving Using Natural Language Processing
Ughade, Shounaak
Kumbhar, Satish
PROCEEDINGS OF 2019 1ST INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION AND COMMUNICATION TECHNOLOGY (ICIICT 2019), 2019,
[30] Literature Survey of statistical, deep and reinforcement learning in Natural Language Processing
Kaushik, Pranav
Sharma, Akanksha Rai
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2017, : 350 - 354

← 1 2 3 4 5 →