Deconstructing Cross-Entropy for Probabilistic Binary Classifiers

被引:75
作者
Ramos, Daniel [1 ]
Franco-Pedroso, Javier [1 ]
Lozano-Diez, Alicia [1 ]
Gonzalez-Rodriguez, Joaquin [1 ]
机构
[1] Univ Autonoma Madrid, Escuela Politecn Super, AuDIaS Audio Data Intelligence & Speech, Calle Francisco Tomas & Valiente 11, Madrid 28049, Spain
关键词
Bayesian; cross-entropy; probabilistic; classifier; discrimination; calibration; ECE plot; CALIBRATION; SYSTEMS;
D O I
10.3390/e20030208
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In this work, we analyze the cross-entropy function, widely used in classifiers both as a performance measure and as an optimization objective. We contextualize cross-entropy in the light of Bayesian decision theory, the formal probabilistic framework for making decisions, and we thoroughly analyze its motivation, meaning and interpretation from an information-theoretical point of view. In this sense, this article presents several contributions: First, we explicitly analyze the contribution to cross-entropy of (i) prior knowledge; and (ii) the value of the features in the form of a likelihood ratio. Second, we introduce a decomposition of cross-entropy into two components: discrimination and calibration. This decomposition enables the measurement of different performance aspects of a classifier in a more precise way; and justifies previously reported strategies to obtain reliable probabilities by means of the calibration of the output of a discriminating classifier. Third, we give different information-theoretical interpretations of cross-entropy, which can be useful in different application scenarios, and which are related to the concept of reference probabilities. Fourth, we present an analysis tool, the Empirical Cross-Entropy (ECE) plot, a compact representation of cross-entropy and its aforementioned decomposition. We show the power of ECE plots, as compared to other classical performance representations, in two diverse experimental examples: a speaker verification system, and a forensic case where some glass findings are present.
引用
收藏
页数:20
相关论文
共 49 条
[1]   Evaluation of trace evidence in the form of multivariate data (vol 53, pg 109, 2004) [J].
Aitken, CGG ;
Lucy, D .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2004, 53 :665-666
[2]   Evaluation of trace evidence in the form of multivariate data [J].
Aitken, CGG ;
Lucy, D .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2004, 53 :109-122
[3]   Expressing evaluative opinions: A position statement [J].
Aitken, Colin ;
Berger, Charles E. H. ;
Buckleton, John S. ;
Champod, Christophe ;
Curran, James ;
Dawid, A. P. ;
Evett, Ian W. ;
Gill, Peter ;
Gonzalez-Rodriguez, Joaquin ;
Jackson, Graham ;
Kloosterman, Ate ;
Lovelock, Tina ;
Lucy, David ;
Margot, Pierre ;
McKenna, Louise ;
Meuwly, Didier ;
Neumann, Cedric ;
Daeid, Niamh Nic ;
Nordgaard, Anders ;
Puch-Solis, Roberto ;
Rasmusson, Birgitta ;
Redmayne, Mike ;
Roberts, Paul ;
Robertson, Bernard ;
Roux, Claude ;
Sjerps, Marjan J. ;
Taroni, Franco ;
Tjin-A-Tsoi, Tjark ;
Vignaux, G. A. ;
Willis, Sheila M. ;
Zadora, Grzegorz .
SCIENCE & JUSTICE, 2011, 51 (01) :1-2
[4]  
[Anonymous], 2018, THESIS
[5]  
[Anonymous], P 8 INT C KNOWL DISC
[6]  
[Anonymous], TECHNICAL REPORT
[7]  
[Anonymous], LECT NOTES COMPUTER
[8]  
[Anonymous], 2012, MACHINE LEARNING PRO
[9]  
Brier GW., 1950, Monthly Weather Review, V78, P1, DOI [DOI 10.1175/1520-0493, 10.1175/1520-0493(, DOI 10.1175/1520-0493(1950)078ANDLT
[10]  
0001:VOFEITANDGT