Data strategies in forensic automatic speaker comparison

被引：3

作者：

van der Vloed, David ^{[1
]}

机构：

[1] Netherlands Forens Inst, Laan Ypenburg 6, NL-2497 GB The Hague, Netherlands

来源：

FORENSIC SCIENCE INTERNATIONAL | 2023年 / 350卷

关键词：

Automatic speaker recognition; Forensic speaker comparison; Forensic voice comparison; Forensic casework; Representative data; RECOGNITION;

D O I：

10.1016/j.forsciint.2023.111790

中图分类号：

DF [法律]; D9 [法律]; R [医药、卫生];

学科分类号：

0301 ; 10 ;

摘要：

Automatic speaker recognition (ASR) is a method used in forensic speaker comparison (FSC) casework. It needs collections of audio data that are representative of the case audio in order to perform reference normalization and to train a score-to-LR function. Audio from a certain minimum number of speakers is needed for each of those purposes to obtain relatively stable performance of ASR. Although it is not possible to set a hard cut-off, for the purpose of this work this number was chosen to be 30 for each, and 60 for both. Lack of representative data from that many speakers and uncertainty about what exactly constitutes representative data are major reasons for not employing ASR in FSC. An experiment was carried out in which a situation was simulated where a practitioner has only 30 speakers available. Several data strategies are tried out to handle the lack of data: leaving out reference normalization, splitting the 30 speakers into two groups of 15 (ignoring the minimum of 30) and a leave 1 or 2 out strategy where all 30 speakers are used for both reference normalization and calibration. They are compared to the baseline situation where the practitioner does have the required 60 speakers. The leave 1 or 2 out strategy with 30 speakers performs on par with baseline, and extension of that strategy to the full 60 speakers even outperforms baseline. This shows that a strategy that halves the data need is viable, lessening the data requirements for ASR in FSC and making the use of ASR possible in more cases.

引用

页数：10

共 50 条

[21] A STUDY OF AUTOMATIC PHONETIC SEGMENTATION FOR FORENSIC VOICE COMPARISON
Huang, Chee Cheun
Epps, Julien
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1853 - 1856
[22] Forensic Speaker Identification: a Tutorial
Univaso, Pedro
IEEE LATIN AMERICA TRANSACTIONS, 2017, 15 (09) : 1754 - 1770
[23] Effect of identical twins on deep speaker embeddings based forensic voice comparison
Abed M.H.
Sztahó D.
International Journal of Speech Technology, 2024, 27 (02) : 341 - 351
[24] Forensic Speaker Verification Using Ordinary Least Squares
Machado, Thyago J.
Filho, Jozue Vieira
de Oliveira, Mario A.
SENSORS, 2019, 19 (20)
[25] Optimization of data-driven filterbank for automatic speaker verification
Sarangi, Susanta
Sahidullah, Md
Saha, Goutam
DIGITAL SIGNAL PROCESSING, 2020, 104
[26] Reducing uncertainty at the score-to-LR stage in likelihood ratio-based forensic voice comparison using automatic speaker recognition systems
Wang, Bruce Xiao
Hughes, Vincent
INTERSPEECH 2022, 2022, : 5243 - 5247
[27] Data Augmentation with ECAPA-TDNN Architecture for Automatic Speaker Recognition
Li, Pinyan
Hoi, Lap Man
Wang, Yapeng
Im, Sio Kei
2023 12TH INTERNATIONAL CONFERENCE ON RENEWABLE ENERGY RESEARCH AND APPLICATIONS, ICRERA, 2023, : 414 - 420
[28] AN AUTOMATIC SPEAKER RECOGNITION SYSTEM
Akrouf, Samir
Mehamel, Abbas
Benhamouda, Nacera
Mostefai, Messaoud
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2009), VOLS 1 AND 2, 2009, : 719 - 727
[29] Applying Base Value of Fundamental Frequency via the Multivariate Kernel-Density in Forensic Speaker Comparison
da Silva, Ronaldo R.
da Costa, Joao Paulo C. L.
Miranda, Ricardo K.
Del Galdo, Giovanni
2016 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2016,
[30] The impact in forensic voice comparison of lack of calibration and of mismatched conditions between the known-speaker recording and the relevant-population sample recordings
Morrison, Geoffrey Stewart
FORENSIC SCIENCE INTERNATIONAL, 2018, 283 : E1 - E7

← 1 2 3 4 5 →