The effect of sampling variability on systems and individual speakers in likelihood ratio-based forensic voice comparison

被引：6

作者：

Wang, Bruce Xiao ^{[1
]}

Hughes, Vincent ^{[1
]}

Foulkes, Paul ^{[1
]}

机构：

[1] Univ York, Dept Language & Linguist Sci, York YO10 5DD, England

来源：

SPEECH COMMUNICATION | 2022年 / 138卷

关键词：

Forensic phonetics; Likelihood ratio; Sampling variability; Individual behavior; STRENGTH;

D O I：

10.1016/j.specom.2022.01.009

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The likelihood ratio (LR) framework has been widely adopted in voice (and other forensic) evidence evaluation. However, in developing any forensic comparison system, it is necessary to make subjective and pragmatic decisions, which in turn may affect the results that system produces. One such decision relates to not only the size of the samples used for training and testing the system, but also which specific individuals are used in the samples. The current study explores the relationship between sampling variability (i.e. the choice of speakers used for training and testing systems, rather than sample size) and the choice of linguistic features used. The first three formants and f0 from the vocalic portion of the filled pause um were used as input, as well as both vowel and nasal durations. 25 speakers were used in test, training and reference sets respectively. Experiments were carried out using all 31 logically possible combinations of features, and replicated 100 times using different configurations of 25 training and reference speakers. The results show that (a) overall, Cllr mean reduces with more features involved and no clear pattern is observed in Cllr range; meanwhile, considerable fluctuation is observed within individual speakers; (b) while the majority of speakers yield stronger mean LLRs in systems with three or more features, a few speakers can be well-separated using one or two features; (c) sampling variability in the training and reference speakers has limited effect on individual test speakers' LR outputs in same-speaker (SS) comparisons, but a marked effect on different-speaker (DS) LRs.

引用

页码：38 / 49

页数：12

共 51 条

[31] Avoiding overstating the strength of forensic evidence: Shrunk likelihood ratios/Bayes factors [J].

Morrison, Geoffrey Stewart ;

Poh, Norman .

SCIENCE & JUSTICE, 2018, 58 (03) :200-218

[32] What should a forensic practitioner's likelihood ratio be? [J].

Morrison, Geoffrey Stewart ;

Enzinger, Ewald .

SCIENCE & JUSTICE, 2016, 56 (05) :374-379

[33] Special issue on measuring and reporting the precision of forensic likelihood ratios: Introduction to the debate [J].

Morrison, Geoffrey Stewart .

SCIENCE & JUSTICE, 2016, 56 (05) :371-373

[34] Tutorial on logistic-regression calibration and fusion:converting a score to a likelihood ratio [J].

Morrison, Geoffrey Stewart .

AUSTRALIAN JOURNAL OF FORENSIC SCIENCES, 2013, 45 (02) :173-197

[35] A comparison of procedures for the calculation of forensic likelihood ratios from acoustic-phonetic data Multivariate kernel density (MVKD) versus Gaussian mixture model-universal background model (GMM-UBM) [J].

Morrison, Geoffrey Stewart .

SPEECH COMMUNICATION, 2011, 53 (02) :242-256

[36] Forensic voice comparison and the paradigm shift [J].

Morrison, Geoffrey Stewart .

SCIENCE & JUSTICE, 2009, 49 (04) :298-308

[37] Forensic: voice comparison using likelihood ratios based on polynomial curves fitted to the formant trajectories of Australian English |aI| [J].

Morrison, Geoffrey Stewart .

INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2008, 15 (02) :249-266

[38]

Nolan F., 2001, P LAW LANG PROSP RET, P12

[39] The DyViS database: style-controlled recordings of 100 homogeneous speakers for forensic phonetic research [J].

Nolan, Francis ;

McDougall, Kirsty ;

de Jong, Gea ;

Hudson, Toby .

INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2009, 16 (01) :31-57

[40]

R core team, 2020, RSTUDIO INT DEV R

← 1 2 3 4 5 6 →