Reducing uncertainty at the score-to-LR stage in likelihood ratio-based forensic voice comparison using automatic speaker recognition systems

被引:2
作者
Wang, Bruce Xiao [1 ]
Hughes, Vincent [1 ]
机构
[1] Univ York, Dept Language & Linguist Sci, York, N Yorkshire, England
来源
INTERSPEECH 2022 | 2022年
关键词
forensic voice comparison; likelihood-ratio; logistic regression; Bayesian model; uncertainty; POPULATION;
D O I
10.21437/Interspeech.2022-518
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In data-driven forensic voice comparison (FVC), empirical testing of a system is an essential step to demonstrate validity and reliability. Numerous studies have focused on improving system validity, while studies of reliability are comparatively limited. In the present study, simulated scores were generated from i-vector and GMM-UBM automatic speaker recognition systems using real speech data to demonstrate the variability in system reliability as a function of score skewness, sample size, and calibration methods (logistic regression or a Bayesian model). Using logistic regression with small samples of skewed scores, C-llr range is 1.3 for the i-vector system and 0.69 for the GMM-UBM system. When scores follow a normal distribution, C-llr ranges reduce to 0.49 (i-vector) and 0.69 (GMM-UBM). Using the Bayesian model, the C-llr ranges are 0.31 and 0.60 for i-vector and GMM-UBM systems respectively when scores are skewed, and the C-llr range remains stable when scores follow a normal distribution irrespective of sample size. The results suggests that score skewness has a substantial effect on system reliability. With this in mind, in FVC it may be preferable to use an older generation of system which produces less variable results, but slightly weaker discrimination, especially when sample size is small.
引用
收藏
页码:5243 / 5247
页数:5
相关论文
共 25 条
[1]   Evaluation of trace evidence in the form of multivariate data [J].
Aitken, CGG ;
Lucy, D .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2004, 53 :109-122
[2]   Sampling variability in forensic likelihood-ratio computation: A simulation study [J].
Ali, Tauseef ;
Spreeuwers, Luuk ;
Veldhuis, Raymond ;
Meuwly, Didier .
SCIENCE & JUSTICE, 2015, 55 (06) :499-508
[3]   The centred parameterization and related quantities of the skew-t distribution [J].
Arellano-Valle, Reinaldo B. ;
Azzalini, Adelchi .
JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 113 :73-90
[4]  
Azzalini A., 2020, R PACKAGE SN SKEW NO
[5]   Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006 [J].
Bruemmer, Niko ;
Burget, Lukas ;
Cernocky, Jan 'Honza' ;
Glembek, Ondrej ;
Grezl, Frantisek ;
Karafiat, Martin ;
van Leeuwen, David A. ;
Matejka, Pavel ;
Schwarz, Petr ;
Strasheim, Albert .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07) :2072-2084
[6]  
Brümmer N, 2014, INTERSPEECH, P388
[7]   A demonstration of the application of the new paradigm for the evaluation of forensic evidence under conditions reflecting those of a real forensic-voice-comparison case [J].
Enzinger, Ewald ;
Morrison, Geoffrey Stewart ;
Ochoa, Felipe .
SCIENCE & JUSTICE, 2016, 56 (01) :42-57
[8]   CEPSTRAL ANALYSIS TECHNIQUE FOR AUTOMATIC SPEAKER VERIFICATION [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (02) :254-272
[9]   Exploring the relationship between voice similarity estimates by listeners and by an automatic speaker recognition system incorporating phonetic features [J].
Gerlach, Linda ;
McDougall, Kirsty ;
Kelly, Finnian ;
Alexander, Anil ;
Nolan, Francis .
SPEECH COMMUNICATION, 2020, 124 :85-95
[10]  
Hughes V., 2019, P INT C PHON SCI 4 1, P3080