A composite Bayesian hierarchical model of compositional data with zeros

被引:8
作者
Napier, Gary [1 ]
Neocleous, Tereza [1 ]
Nobile, Agostino [2 ]
机构
[1] Univ Glasgow, Sch Math & Stat, Glasgow, Lanark, Scotland
[2] Univ York, Dept Math, York YO10 5DD, N Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
Bayes factor; classification; evidence evaluation; forensic glass; Markov chain Monte Carlo; GLASS FRAGMENTS; TRANSFORMATIONS;
D O I
10.1002/cem.2681
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an effective approach for modelling compositional data with large concentrations of zeros and several levels of variation, applied to a database of elemental compositions of forensic glass of various use types. The procedure consists of the following: (i) partitioning the data set in subsets characterised by the same pattern of presence/absence of chemical elements and (ii) fitting a Bayesian hierarchical model to the transformed compositions in each data subset. We derive expressions for the posterior predictive probability that newly observed fragments of glass are of a certain use type and for computing the evidential value of glass fragments relating to two competing propositions about their source. The model is assessed using cross-validation, and it performs well in both the classification and evidence evaluation tasks. Copyright (c) 2014 John Wiley & Sons, Ltd. We present an effective approach for modelling chemical compositions with large concentrations of zeros. The dataset is partitioned into subsets characterized by the same pattern of presence/absence of chemical elements, and hierarchical models are fitted to transformed compositions in each subset. These are combined into a composite model, which outperforms support vector machines in classification of glass fragments in a simulation study. The composite model also performs well in measuring the evidential value elemental compositions of glass.
引用
收藏
页码:96 / 108
页数:13
相关论文
共 28 条
[1]   Evaluation of trace evidence in the form of multivariate data [J].
Aitken, CGG ;
Lucy, D .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2004, 53 :109-122
[2]  
[Anonymous], J ROYAL STAT SOC B
[3]  
[Anonymous], STAT ANAL COMPOSITIO
[4]  
[Anonymous], STAT EVALUATION EVID
[5]  
Brier G. W., 1950, Monthly Weather Review, V78, P1, DOI 10.1175/1520-0493(1950)0782.0.CO
[6]  
2
[7]   A latent Gaussian model for compositional data with zeros [J].
Butler, Adam ;
Glasbey, Chris .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2008, 57 :505-520
[8]   Compositional data analysis for elemental data in forensic science [J].
Campbell, Gareth P. ;
Curran, James M. ;
Miskelly, Gordon M. ;
Coulson, Sally ;
Yaxley, Gregory M. ;
Grunsky, Eric C. ;
Cox, Simon C. .
FORENSIC SCIENCE INTERNATIONAL, 2009, 188 (1-3) :81-90
[9]   EXPLAINING THE GIBBS SAMPLER [J].
CASELLA, G ;
GEORGE, EI .
AMERICAN STATISTICIAN, 1992, 46 (03) :167-174
[10]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46