Confidence Calibration in a Multiyear Geopolitical Forecasting Competition

被引:36
作者
Moore, Don A. [1 ]
Swift, Samuel A. [2 ]
Minster, Angela [3 ]
Mellers, Barbara [3 ]
Ungar, Lyle [3 ]
Tetlock, Philip [3 ]
Yang, Heather H. J. [4 ]
Tenney, Elizabeth R. [5 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Betterment LLC, New York, NY 10010 USA
[3] Univ Penn, Philadelphia, PA 19104 USA
[4] MIT, Cambridge, MA 02139 USA
[5] Univ Utah, Salt Lake City, UT 84112 USA
关键词
confidence; overconfidence; forecasting; prediction; PROBABILITY JUDGMENT; OVERCONFIDENCE; ACCURACY; ERROR; INFORMATION; PERFORMANCE; AVERAGE; BIAS; UNDERCONFIDENCE; PREDICTIONS;
D O I
10.1287/mnsc.2016.2525
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
This research examines the development of confidence and accuracy over time in the context of forecasting. Although overconfidence has been studied in many contexts, little research examines its progression over long periods of time or in consequential policy domains. This study employs a unique data set from a geopolitical forecasting tournament spanning three years in which thousands of forecasters predicted the outcomes of hundreds of events. We sought to apply insights from research to structure the questions, interactions, and elicitations to improve forecasts. Indeed, forecasters' confidence roughly matched their accuracy. As information came in, accuracy increased. Confidence increased at approximately the same rate as accuracy, and good calibration persisted. Nevertheless, there was evidence of a small amount of overconfidence (3%), especially on the most confident forecasts. Training helped reduce overconfidence, and team collaboration improved forecast accuracy. Together, teams and training reduced overconfidence to 1%. Our results provide reason for tempered optimism regarding confidence calibration and its development over time in consequential field contexts.
引用
收藏
页码:3552 / 3565
页数:14
相关论文
共 50 条
  • [31] Computer model calibration with confidence and consistency
    Plumlee, Matthew
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2019, 81 (03) : 519 - 545
  • [32] The M4 Competition: 100,000 time series and 61 forecasting methods
    Makridakis, Spyros
    Spiliotis, Evangelos
    Assimakopoulos, Vassilios
    INTERNATIONAL JOURNAL OF FORECASTING, 2020, 36 (01) : 54 - 74
  • [33] The Value of Precision in Probability Assessment: Evidence from a Large-Scale Geopolitical Forecasting Tournament
    Friedman, Jeffrey A.
    Baker, Joshua D.
    Mellers, Barbara A.
    Tetlock, Philip E.
    Zeckhauser, Richard
    INTERNATIONAL STUDIES QUARTERLY, 2018, 62 (02) : 410 - 422
  • [34] Use of Radar Quantitative Precipitation Estimates (QPEs) for Improved Hydrological Model Calibration and Flood Forecasting
    Wijayarathne, Dayal
    Coulibaly, Paulin
    Boodoo, Sudesh
    Sills, David
    JOURNAL OF HYDROMETEOROLOGY, 2021, 22 (08) : 2033 - 2053
  • [35] Corporate sustainability practices: An interplay of uncertainty, geopolitical risk and competition
    Bhue, Rajesh
    Gartia, Umakanta
    Panda, Ajaya Kumar
    Tiwari, Aviral Kumar
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2025, 376
  • [36] Measuring the Confidence of Single-Point Traffic Forecasting Models: Techniques, Experimental Comparison, and Guidelines Toward Their Actionability
    Lana, Ibai
    Olabarrieta, Ignacio
    Del Ser, Javier
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) : 11180 - 11199
  • [37] Accuracy, Confidence, and Calibration: How Young Children and Adults Assess Credibility
    Tenney, Elizabeth R.
    Small, Jenna E.
    Kondrad, Robyn L.
    Jaswal, Vikram K.
    Spellman, Barbara A.
    DEVELOPMENTAL PSYCHOLOGY, 2011, 47 (04) : 1065 - 1077
  • [38] Dynamic Correlation Learning and Regularization for Multi-Label Confidence Calibration
    Chen, Tianshui
    Wang, Weihang
    Pu, Tao
    Qin, Jinghui
    Yang, Zhijing
    Liu, Jie
    Lin, Liang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4811 - 4823
  • [39] Clustering, Knowledge Sharing, and Intrabrand Competition: A Multiyear Analysis of an Evolving Franchise System
    Butt, Moeen Naseer
    Antia, Kersi D.
    Murtha, Brian R.
    Kashyap, Vishal
    JOURNAL OF MARKETING, 2018, 82 (01) : 74 - 92
  • [40] Product market competition and analyst forecasting activity: International evidence
    Haw, In-Mu
    Hu, Bingbing
    Lee, Jay Junghun
    JOURNAL OF BANKING & FINANCE, 2015, 56 : 48 - 60