Revisiting Softmax for Uncertainty Approximation in Text Classification

被引:3
|
作者
Holm, Andreas Nugaard [1 ]
Wright, Dustin [1 ]
Augenstein, Isabelle [1 ]
机构
[1] Univ Copenhagen, Dept Comp Sci, DK-1172 Copenhagen, Denmark
关键词
text classification; uncertainty quantification; efficiency;
D O I
10.3390/info14070420
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Uncertainty approximation in text classification is an important area with applications in domain adaptation and interpretability. One of the most widely used uncertainty approximation methods is Monte Carlo (MC) dropout, which is computationally expensive as it requires multiple forward passes through the model. A cheaper alternative is to simply use a softmax based on a single forward pass without dropout to estimate model uncertainty. However, prior work has indicated that these predictions tend to be overconfident. In this paper, we perform a thorough empirical analysis of these methods on five datasets with two base neural architectures in order to identify the trade-offs between the two. We compare both softmax and an efficient version of MC dropout on their uncertainty approximations and downstream text classification performance, while weighing their runtime (cost) against performance (benefit). We find that, while MC dropout produces the best uncertainty approximations, using a simple softmax leads to competitive, and in some cases better, uncertainty estimation for text classification at a much lower computational cost, suggesting that softmax can in fact be a sufficient uncertainty estimate when computational resources are a concern.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Uncertainty Quantification for Text Classification
    Zhang, Dell
    Sensoy, Murat
    Makrehchi, Masoud
    Taneva-Popova, Bilyana
    Gui, Lin
    He, Yulan
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3426 - 3429
  • [2] Uncertainty Quantification for Text Classification
    Zhang, Dell
    Sensoy, Murat
    Makrehchi, Masoud
    Taneva-Popova, Bilyana
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 362 - 369
  • [3] Text Classification Research Based on Improved SoftMax Regression Algorithm
    She, Xiangyang
    Zhu, Yinglong
    2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2018, : 273 - 276
  • [4] Deep learning uncertainty quantification for clinical text classification
    Peluso, Alina
    Danciu, Ioana
    Yoon, Hong-Jun
    Yusof, Jamaludin Mohd
    Bhattacharya, Tanmoy
    Spannaus, Adam
    Schaefferkoetter, Noah
    Durbin, Eric B.
    Wu, Xiao-Cheng
    Stroup, Antoinette
    Doherty, Jennifer
    Schwartz, Stephen
    Wiggins, Charles
    Coyle, Linda
    Penberthy, Lynne
    Tourassi, Georgia D.
    Gao, Shang
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 149
  • [5] Benchmarking Scalable Predictive Uncertainty in Text Classification
    Van Landeghem, Jordy
    Blaschko, Matthew
    Anckaert, Bertrand
    Moens, Marie-Francine
    IEEE ACCESS, 2022, 10 : 43703 - 43737
  • [6] Efficient Uncertainty Quantification for Multilabel Text Classification
    Yu, Jialin
    Cristea, Alexandra, I
    Harit, Anoushka
    Sun, Zhongtian
    Aduragba, Olanrewaju Tahir
    Shi, Lei
    Al Moubayed, Noura
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [7] Uncertainty-Aware Reliable Text Classification
    Hu, Yibo
    Khan, Latifur
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 628 - 636
  • [8] Explaining Prediction Uncertainty in Text Classification: The DUX Approach
    Andersen, Jakob Smedegaard
    Zukunft, Olaf
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023, 2023, : 57 - 62
  • [9] XPROAX-Local explanations for text classification with progressive neighborhood approximation
    Cai, Yi
    Zimek, Arthur
    Ntoutsi, Eirini
    2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2021,
  • [10] Revisiting two-stage feature selection based on coverage policies for text classification
    Mendez-Molina, Arquimides
    Li Ona-Garcia, Ana
    Ariel Carrasco-Ochoa, Jesus
    Martinez-Trinidad, Jose Fco.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 2949 - 2957