Uncertainty Quantification for Text Classification

被引：4

作者：

Zhang, Dell ^{[1
]}

Sensoy, Murat ^{[2
]}

Makrehchi, Masoud ^{[3
]}

Taneva-Popova, Bilyana ^{[4
]}

Gui, Lin ^{[5
]}

He, Yulan ^{[5
,6
]}

机构：

[1] Thomson Reuters Labs, London, England

[2] Amazon Alexa AI, London, England

[3] Thomson Reuters Labs, Toronto, ON, Canada

[4] Thomson Reuters Labs, Zug, Switzerland

[5] Kings Coll London, London, England

[6] Alan Turing Inst, London, England

来源：

PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023 | 2023年

基金：

英国工程与自然科学研究理事会;

关键词：

uncertainty quantification; text classification; language models;

D O I：

10.1145/3539618.3594243

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This full-day tutorial introduces modern techniques for practical uncertainty quantification specifically in the context of multi-class and multi-label text classification. First, we explain the usefulness of estimating aleatoric uncertainty and epistemic uncertainty for text classification models. Then, we describe several state-of-the-art approaches to uncertainty quantification and analyze their scalability to big text data: Virtual Ensemble in GBDT, Bayesian Deep Learning (including Deep Ensemble, Monte-Carlo Dropout, Bayes by Backprop, and their generalization Epistemic Neural Networks), Evidential Deep Learning (including Prior Networks and Posterior Networks), as well as Distance Awareness (including Spectral-normalized Neural Gaussian Process and Deep Deterministic Uncertainty). Next, we talk about the latest advances in uncertainty quantification for pre-trained language models (including asking language models to express their uncertainty, interpreting uncertainties of text classifiers built on large-scale language models, uncertainty estimation in text generation, calibration of language models, and calibration for in-context learning). After that, we discuss typical application scenarios of uncertainty quantification in text classification (including in-domain calibration, cross-domain robustness, and novel class detection). Finally, we list popular performance metrics for the evaluation of uncertainty quantification effectiveness in text classification. Practical hands-on examples/exercises are provided to the attendees for them to experiment with different uncertainty quantification methods on a few real-world text classification datasets such as CLINC150.

引用

页码：3426 / 3429

页数：4

共 50 条

[1] Uncertainty Quantification for Text Classification
Zhang, Dell
Sensoy, Murat
Makrehchi, Masoud
Taneva-Popova, Bilyana
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 362 - 369
[2] Deep learning uncertainty quantification for clinical text classification
Peluso, Alina
Danciu, Ioana
Yoon, Hong-Jun
Yusof, Jamaludin Mohd
Bhattacharya, Tanmoy
Spannaus, Adam
Schaefferkoetter, Noah
Durbin, Eric B.
Wu, Xiao-Cheng
Stroup, Antoinette
Doherty, Jennifer
Schwartz, Stephen
Wiggins, Charles
Coyle, Linda
Penberthy, Lynne
Tourassi, Georgia D.
Gao, Shang
JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 149
[3] Efficient Uncertainty Quantification for Multilabel Text Classification
Yu, Jialin
Cristea, Alexandra, I
Harit, Anoushka
Sun, Zhongtian
Aduragba, Olanrewaju Tahir
Shi, Lei
Al Moubayed, Noura
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[4] Revisiting Softmax for Uncertainty Approximation in Text Classification
Holm, Andreas Nugaard
Wright, Dustin
Augenstein, Isabelle
INFORMATION, 2023, 14 (07)
[5] Uncertainty Quantification for Extreme Classification
Jiang, Jyun-Yu
Chang, Wei-Cheng
Zhang, Jiong
Hsieh, Cho-Jui
Yu, Hsiang-Fu
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1649 - 1659
[6] Benchmarking Scalable Predictive Uncertainty in Text Classification
Van Landeghem, Jordy
Blaschko, Matthew
Anckaert, Bertrand
Moens, Marie-Francine
IEEE ACCESS, 2022, 10 : 43703 - 43737
[7] Uncertainty-Aware Reliable Text Classification
Hu, Yibo
Khan, Latifur
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 628 - 636
[8] Uncertainty Quantification and Estimation in Medical Image Classification
Yang, Sidi
Fevens, Thomas
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT III, 2021, 12893 : 671 - 683
[9] Explaining Prediction Uncertainty in Text Classification: The DUX Approach
Andersen, Jakob Smedegaard
Zukunft, Olaf
PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023, 2023, : 57 - 62
[10] Risk-aware classification via uncertainty quantification
Sensoy, Murat
Kaplan, Lance M.
Julier, Simon
Saleki, Maryam
Cerutti, Federico
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 265

← 1 2 3 4 5 →