Relevance-based Infilling for Natural Language Counterfactuals

被引:0
|
作者
Betti, Lorenzo [1 ,2 ]
Abrate, Carlo [3 ,4 ]
Bonchi, Francesco [3 ,5 ]
Kaltenbrunner, Andreas [1 ,6 ]
机构
[1] ISI Fdn, Turin, Italy
[2] Cent European Univ, Dept Network & Data Sci, Vienna, Austria
[3] CENTAI, Turin, Italy
[4] Sapienza Univ, Rome, Italy
[5] Eurecat, Barcelona, Spain
[6] Univ Oberta Catalunya, Barcelona, Spain
来源
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023 | 2023年
关键词
NLP; masked language model; explainability; counterfactuals;
D O I
10.1145/3583780.3615029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Counterfactual explanations are a natural way for humans to gain understanding and trust in the outcomes of complex machine learning algorithms. In the context of natural language processing, generating counterfactuals is particularly challenging as it requires the generated text to be fluent, grammatically correct, and meaningful. In this study, we improve the current state of the art for the generation of such counterfactual explanations for text classifiers. Our approach, named RELITC (Relevance-based Infilling for Textual Counterfactuals), builds on the idea of masking a fraction of text tokens based on their importance in a given prediction task and employs a novel strategy, based on the entropy of their associated probability distributions, to determine the infilling order of these tokens. Our method uses less time than competing methods to generate counterfactuals that require less changes, are closer to the original text and preserve its content better, while being competitive in terms of fluency. We demonstrate the effectiveness of the method on four different datasets and show the quality of its outcomes in a comparison with human generated counterfactuals.(1)
引用
收藏
页码:88 / 98
页数:11
相关论文
共 50 条
  • [1] Robust relevance-based language models
    Li, Xiaoyan
    PROCEEDINGS OF THE FIFTH IASTED INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INTERNET, AND INFORMATION TECHNOLOGY, 2006, : 341 - 348
  • [2] Relevance-based language modelling for recommender systems
    Parapar, Javier
    Bellogin, Alejandro
    Castells, Pablo
    Barreiro, Alvaro
    INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (04) : 966 - 980
  • [3] Relevance-based Word Embedding
    Zamani, Hamed
    Croft, W. Bruce
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 505 - 514
  • [4] Relevance-Based Entity Embedding
    Zeng, Weixin
    Zhao, Xiang
    Tang, Jiuyang
    Liao, Jinzhi
    Wang, Chang-Dong
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 300 - 304
  • [5] Relevance-based curriculum for technical education
    Hora, M
    Somkuwar, V
    GLOBAL CONGRESS ON ENGINEERING EDUCATION INCORPORATING: 5TH WORLD CONFERENCE ON ENGINEERING EDUCATION/4TH EAST-WEST CONGRESS ON ENGINEERING EDUCATION/1998 INTERNATIONAL CONGRESS OF ENGINEERING DEANS AND INDUSTRY LEADERS, CONGRESS PROCEEDINGS, 1998, : 386 - 389
  • [6] Generating Realistic Natural Language Counterfactuals
    Robeer, Marcel
    Bex, Floris
    Feelders, Ad
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3611 - 3625
  • [7] A relevance-based approach to poetry in translation
    Dahlgren, M
    PERSPECTIVES-STUDIES IN TRANSLATOLOGY, 2000, 8 (02): : 97 - 108
  • [8] Relevance-based content extraction of HTML documents
    吴麒
    陈兴蜀
    朱锴
    王春晖
    Journal of Central South University, 2012, 19 (07) : 1921 - 1926
  • [9] Relevance-Based Selectivity: The Case of Implicit Learning
    Eitam, Baruch
    Glicksohn, Arit
    Shoval, Roy
    Cohen, Asher
    Schul, Yaacov
    Hassin, Ran R.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2013, 39 (06) : 1508 - 1515
  • [10] Relevance-based abstraction identification: technique and evaluation
    Gacitua, Ricardo
    Sawyer, Pete
    Gervasi, Vincenzo
    REQUIREMENTS ENGINEERING, 2011, 16 (03) : 251 - 265