Uncertainty Quantification for Text Classification

被引:4
|
作者
Zhang, Dell [1 ]
Sensoy, Murat [2 ]
Makrehchi, Masoud [3 ]
Taneva-Popova, Bilyana [4 ]
Gui, Lin [5 ]
He, Yulan [5 ,6 ]
机构
[1] Thomson Reuters Labs, London, England
[2] Amazon Alexa AI, London, England
[3] Thomson Reuters Labs, Toronto, ON, Canada
[4] Thomson Reuters Labs, Zug, Switzerland
[5] Kings Coll London, London, England
[6] Alan Turing Inst, London, England
来源
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023 | 2023年
基金
英国工程与自然科学研究理事会;
关键词
uncertainty quantification; text classification; language models;
D O I
10.1145/3539618.3594243
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This full-day tutorial introduces modern techniques for practical uncertainty quantification specifically in the context of multi-class and multi-label text classification. First, we explain the usefulness of estimating aleatoric uncertainty and epistemic uncertainty for text classification models. Then, we describe several state-of-the-art approaches to uncertainty quantification and analyze their scalability to big text data: Virtual Ensemble in GBDT, Bayesian Deep Learning (including Deep Ensemble, Monte-Carlo Dropout, Bayes by Backprop, and their generalization Epistemic Neural Networks), Evidential Deep Learning (including Prior Networks and Posterior Networks), as well as Distance Awareness (including Spectral-normalized Neural Gaussian Process and Deep Deterministic Uncertainty). Next, we talk about the latest advances in uncertainty quantification for pre-trained language models (including asking language models to express their uncertainty, interpreting uncertainties of text classifiers built on large-scale language models, uncertainty estimation in text generation, calibration of language models, and calibration for in-context learning). After that, we discuss typical application scenarios of uncertainty quantification in text classification (including in-domain calibration, cross-domain robustness, and novel class detection). Finally, we list popular performance metrics for the evaluation of uncertainty quantification effectiveness in text classification. Practical hands-on examples/exercises are provided to the attendees for them to experiment with different uncertainty quantification methods on a few real-world text classification datasets such as CLINC150.
引用
收藏
页码:3426 / 3429
页数:4
相关论文
共 50 条
  • [31] Hercules: Deep Hierarchical Attentive Multilevel Fusion Model With Uncertainty Quantification for Medical Image Classification
    Abdar, Moloud
    Fahami, Mohammad Amin
    Rundo, Leonardo
    Radeva, Petia
    Frangi, Alejandro F.
    Acharya, U. Rajendra
    Khosravi, Abbas
    Lam, Hak-Keung
    Jung, Alexander
    Nahavandi, Saeid
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) : 274 - 285
  • [32] Converting numerical classification into text classification
    Macskassy, SA
    Hirsh, H
    Banerjee, A
    Dayanik, AA
    ARTIFICIAL INTELLIGENCE, 2003, 143 (01) : 51 - 77
  • [33] Optimal uncertainty quantification with model uncertainty and legacy data
    Kamga, P. -H. T.
    Li, B.
    McKerns, M.
    Nguyen, L. H.
    Ortiz, M.
    Owhadi, H.
    Sullivan, T. J.
    JOURNAL OF THE MECHANICS AND PHYSICS OF SOLIDS, 2014, 72 : 1 - 19
  • [34] On the Statistical Formalism of Uncertainty Quantification
    Berger, James O.
    Smith, Leonard A.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 6, 2019, 6 : 433 - 460
  • [35] Uncertainty quantification in game theory
    de Cursi, Eduardo Souza
    CHAOS SOLITONS & FRACTALS, 2021, 143
  • [36] On uncertainty quantification in vibroacoustic problems
    Sepahvand, K.
    Marburg, S.
    EURODYN 2014: IX INTERNATIONAL CONFERENCE ON STRUCTURAL DYNAMICS, 2014, : 3259 - 3263
  • [37] Multifidelity approaches for uncertainty quantification
    Biehler J.
    Mäck M.
    Nitzler J.
    Hanss M.
    Koutsourelakis P.-S.
    Wall W.A.
    GAMM Mitteilungen, 2019, 42 (02)
  • [38] CONVEX OPTIMAL UNCERTAINTY QUANTIFICATION
    Han, Shuo
    Tao, Molei
    Topcu, Ufuk
    Owhadi, Houman
    Murray, Richard M.
    SIAM JOURNAL ON OPTIMIZATION, 2015, 25 (03) : 1368 - 1387
  • [39] On uncertainty quantification in hydrogeology and hydrogeophysics
    Linde, Niklas
    Ginsbourger, David
    Irving, James
    Nobile, Fabio
    Doucet, Arnaud
    ADVANCES IN WATER RESOURCES, 2017, 110 : 166 - 181
  • [40] Uncertainty quantification for reservoir geomechanics
    Pereira, Leonardo Cabral
    Sanchez, Marcelo
    do Nascimento Guimaraes, Leonardo Jose
    GEOMECHANICS FOR ENERGY AND THE ENVIRONMENT, 2016, 8 : 76 - 84