Failure Prediction in 2D Document Information Extraction with Calibrated Confidence Scores

被引:0
作者
Kivimaki, Juhani [1 ]
Lebedev, Aleksey [2 ]
Nurminen, Jukka K. [1 ]
机构
[1] Univ Helsinki, Helsinki, Finland
[2] Basware Inc, Espoo, Finland
来源
2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC | 2023年
关键词
machine learning; uncertainty estimation; confidence calibration; failure prediction; information extraction;
D O I
10.1109/COMPSAC57700.2023.00033
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Modern machine learning models can achieve impressive results in many tasks, but often fail to express reliably how confident they are with their predictions. In an industrial setting, the end goal is usually not a prediction of a model, but a decision based on that prediction. It is often not sufficient to generate high-accuracy predictions on average. One also needs to estimate the uncertainty and risks involved when making related decisions. Thus, having reliable and calibrated uncertainty estimates is highly useful for any model used in automated decision-making. In this paper, we present a case study, where we propose a novel method to improve the uncertainty estimates of an in-production machine learning model operating in an industrial setting with real-life data. This model is used by Basware, a Finnish software company, to extract information from invoices in the form of machine-readable PDFs. The solution we propose is shown to produce calibrated confidence estimates, which outperform legacy estimates on several relevant metrics, increasing coverage of automated invoices from 65.6% to 73.2% with no increase in error rate.
引用
收藏
页码:193 / 202
页数:10
相关论文
共 46 条
  • [1] Adams R.P., 2012, 25 INT C NEURAL INFP, P2951, DOI DOI 10.5555/2999325.2999464.47
  • [2] Alexandari AM, 2020, PR MACH LEARN RES, V119
  • [3] Amodei D, 2016, Arxiv, DOI arXiv:1606.06565
  • [4] Baldwin C.Y., 2000, DESIGN RULES POWER M, V1
  • [5] Barber D., 2012, Bayesian Reasoning and Machine Learning.
  • [6] Bensch O, 2021, Arxiv, DOI arXiv:2106.14624
  • [7] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [8] Chen Tongfei, 2019, P MACHINE LEARNING R, V89
  • [9] Corbière C, 2019, ADV NEUR IN, V32
  • [10] Culotta Aron., 2004, P HLTNAACL 04, P109