Local Interpretations for Explainable Natural Language Processing: A Survey

被引:8
作者
Luo, Siwen [1 ]
Ivison, Hamish [2 ]
Han, Soyeon Caren [3 ]
Poon, Josiah [4 ]
机构
[1] Univ Western Australia, 35 Stirling Hwy, Perth, WA 6009, Australia
[2] Univ Washington, 3800 E Stevens Way NE, Seattle, WA 98195 USA
[3] Univ Melbourne, 700 Swanston St, Melbourne, Vic 3010, Australia
[4] Univ Sydney, 1 Cleveland St, Darlington, NSW 2008, Australia
关键词
Deep neural networks; explainable AI; local interpretation; natural language processing; PREDICTION;
D O I
10.1145/3649450
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As the use of deep learning techniques has grown across various fields over the past decade, complaints about the opaqueness of the black-box models have increased, resulting in an increased focus on transparency in deep learning models. This work investigates various methods to improve the interpretability of deep neural networks for Natural Language Processing (NLP) tasks, including machine translation and sentiment analysis. We provide a comprehensive discussion on the definition of the term interpretability and its various aspects at the beginning of this work. The methods collected and summarised in this survey are only associated with local interpretation and are specifically divided into three categories: (1) interpreting the model's predictions through related input features; (2) interpreting through natural language explanation; (3) probing the hidden states of models and word representations.
引用
收藏
页数:36
相关论文
共 50 条
  • [31] DEEP LEARNING IN NATURAL LANGUAGE PROCESSING: A STATE-OF-THE-ART SURVEY
    Chai, Junyi
    Li, Anming
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 535 - 540
  • [32] Token-modification adversarial attacks for natural language processing: A survey
    Roth, Tom
    Gao, Yansong
    Abuadbba, Alsharif
    Nepal, Surya
    Liu, Wei
    AI COMMUNICATIONS, 2024, 37 (04) : 655 - 676
  • [33] Survey of Adversarial Attack, Defense and Robustness Analysis for Natural Language Processing
    Zheng H.
    Chen J.
    Zhang Y.
    Zhang X.
    Ge C.
    Liu Z.
    Ouyang Y.
    Ji S.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (08): : 1727 - 1750
  • [34] A Critical Survey on the use of Fuzzy Sets in Speech and Natural Language Processing
    Carvalho, Joao P.
    Batista, Fernando
    Coheur, Luisa
    2012 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2012,
  • [35] Application of Natural Language Processing in Nephrology Research
    Farrell, Douglas
    Chan, Lili
    CLINICAL JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2023, 18 (06): : 806 - 808
  • [36] Exploiting deep representations for natural language processing
    Zi-Yi Dou
    Xing Wang
    Shuming Shi
    Zhaopeng Tu
    NEUROCOMPUTING, 2020, 386 (386) : 1 - 7
  • [37] Natural language based financial forecasting: a survey
    Xing, Frank Z.
    Cambria, Erik
    Welsch, Roy E.
    ARTIFICIAL INTELLIGENCE REVIEW, 2018, 50 (01) : 49 - 73
  • [38] The fusion of fuzzy theories and natural language processing: A state-of-the-art survey
    Liu, Ming
    Zhang, Hongjun
    Xu, Zeshui
    Ding, Kun
    APPLIED SOFT COMPUTING, 2024, 162
  • [39] Explainable Automatic Industrial Carbon Footprint Estimation From Bank Transaction Classification Using Natural Language Processing
    Gonzalez-Gonzalez, Jaime
    Garcia-Mendez, Silvia
    De Arriba-Perez, Francisco
    Gonzalez-Castano, Francisco J.
    Barba-Seara, Oscar
    IEEE ACCESS, 2022, 10 : 126326 - 126338
  • [40] Explainable multimodal prediction of treatment-resistance in patients with depression leveraging brain morphometry and natural language processing
    Kim, Narae
    Kim, Narae
    Park, Chulhyoung
    Gan, Sujin
    Son, Sang Joon
    Park, Rae Woong
    Park, Bumhee
    PSYCHIATRY RESEARCH, 2024, 334