Uncertainty Quantification for Text Classification

被引:4
|
作者
Zhang, Dell [1 ]
Sensoy, Murat [2 ]
Makrehchi, Masoud [3 ]
Taneva-Popova, Bilyana [4 ]
Gui, Lin [5 ]
He, Yulan [5 ,6 ]
机构
[1] Thomson Reuters Labs, London, England
[2] Amazon Alexa AI, London, England
[3] Thomson Reuters Labs, Toronto, ON, Canada
[4] Thomson Reuters Labs, Zug, Switzerland
[5] Kings Coll London, London, England
[6] Alan Turing Inst, London, England
来源
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023 | 2023年
基金
英国工程与自然科学研究理事会;
关键词
uncertainty quantification; text classification; language models;
D O I
10.1145/3539618.3594243
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This full-day tutorial introduces modern techniques for practical uncertainty quantification specifically in the context of multi-class and multi-label text classification. First, we explain the usefulness of estimating aleatoric uncertainty and epistemic uncertainty for text classification models. Then, we describe several state-of-the-art approaches to uncertainty quantification and analyze their scalability to big text data: Virtual Ensemble in GBDT, Bayesian Deep Learning (including Deep Ensemble, Monte-Carlo Dropout, Bayes by Backprop, and their generalization Epistemic Neural Networks), Evidential Deep Learning (including Prior Networks and Posterior Networks), as well as Distance Awareness (including Spectral-normalized Neural Gaussian Process and Deep Deterministic Uncertainty). Next, we talk about the latest advances in uncertainty quantification for pre-trained language models (including asking language models to express their uncertainty, interpreting uncertainties of text classifiers built on large-scale language models, uncertainty estimation in text generation, calibration of language models, and calibration for in-context learning). After that, we discuss typical application scenarios of uncertainty quantification in text classification (including in-domain calibration, cross-domain robustness, and novel class detection). Finally, we list popular performance metrics for the evaluation of uncertainty quantification effectiveness in text classification. Practical hands-on examples/exercises are provided to the attendees for them to experiment with different uncertainty quantification methods on a few real-world text classification datasets such as CLINC150.
引用
收藏
页码:3426 / 3429
页数:4
相关论文
共 50 条
  • [1] Uncertainty Quantification for Text Classification
    Zhang, Dell
    Sensoy, Murat
    Makrehchi, Masoud
    Taneva-Popova, Bilyana
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 362 - 369
  • [2] Deep learning uncertainty quantification for clinical text classification
    Peluso, Alina
    Danciu, Ioana
    Yoon, Hong-Jun
    Yusof, Jamaludin Mohd
    Bhattacharya, Tanmoy
    Spannaus, Adam
    Schaefferkoetter, Noah
    Durbin, Eric B.
    Wu, Xiao-Cheng
    Stroup, Antoinette
    Doherty, Jennifer
    Schwartz, Stephen
    Wiggins, Charles
    Coyle, Linda
    Penberthy, Lynne
    Tourassi, Georgia D.
    Gao, Shang
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 149
  • [3] Efficient Uncertainty Quantification for Multilabel Text Classification
    Yu, Jialin
    Cristea, Alexandra, I
    Harit, Anoushka
    Sun, Zhongtian
    Aduragba, Olanrewaju Tahir
    Shi, Lei
    Al Moubayed, Noura
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] Revisiting Softmax for Uncertainty Approximation in Text Classification
    Holm, Andreas Nugaard
    Wright, Dustin
    Augenstein, Isabelle
    INFORMATION, 2023, 14 (07)
  • [5] Uncertainty Quantification for Extreme Classification
    Jiang, Jyun-Yu
    Chang, Wei-Cheng
    Zhang, Jiong
    Hsieh, Cho-Jui
    Yu, Hsiang-Fu
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1649 - 1659
  • [6] Benchmarking Scalable Predictive Uncertainty in Text Classification
    Van Landeghem, Jordy
    Blaschko, Matthew
    Anckaert, Bertrand
    Moens, Marie-Francine
    IEEE ACCESS, 2022, 10 : 43703 - 43737
  • [7] Uncertainty-Aware Reliable Text Classification
    Hu, Yibo
    Khan, Latifur
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 628 - 636
  • [8] Uncertainty Quantification and Estimation in Medical Image Classification
    Yang, Sidi
    Fevens, Thomas
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT III, 2021, 12893 : 671 - 683
  • [9] Explaining Prediction Uncertainty in Text Classification: The DUX Approach
    Andersen, Jakob Smedegaard
    Zukunft, Olaf
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023, 2023, : 57 - 62
  • [10] Risk-aware classification via uncertainty quantification
    Sensoy, Murat
    Kaplan, Lance M.
    Julier, Simon
    Saleki, Maryam
    Cerutti, Federico
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 265