Forensic Speaker Verification Using Ordinary Least Squares

被引:7
作者
Machado, Thyago J. [1 ]
Filho, Jozue Vieira [2 ]
de Oliveira, Mario A. [3 ]
机构
[1] Sao Paulo State Univ UNESP, Campus Ilha Solteira, BR-15385000 Sao Paulo, SP, Brazil
[2] Sao Paulo State Univ UNESP, Telecommun & Aeronaut Engn, BR-13876750 Sao Joao Da Boa, Vista Sp, Brazil
[3] Mato Grosso Fed Inst Technol, Automat & Control Engn, BR-78005200 Cuiaba, Brazil
关键词
forensic speaker comparison; forensic phonetics; voice processing; ordinary least squares (OLS); linear predictive coding (LPC); RECOGNITION;
D O I
10.3390/s19204385
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In Brazil, the recognition of speakers for forensic purposes still relies on a subjectivity-based decision-making process through a results analysis of untrustworthy techniques. Owing to the lack of a voice database, speaker verification is currently applied to samples specifically collected for confrontation. However, speaker comparative analysis via contested discourse requires the collection of an excessive amount of voice samples for a series of individuals. Further, the recognition system must inform who is the most compatible with the contested voice from pre-selected individuals. Accordingly, this paper proposes using a combination of linear predictive coding (LPC) and ordinary least squares (OLS) as a speaker verification tool for forensic analysis. The proposed recognition technique establishes confidence and similarity upon which to base forensic reports, indicating verification of the speaker of the contested discourse. Therefore, in this paper, an accurate, quick, alternative method to help verify the speaker is contributed. After running seven different tests, this study preliminarily achieved a hit rate of 100% considering a limited dataset (Brazilian Portuguese). Furthermore, the developed method extracts a larger number of formants, which are indispensable for statistical comparisons via OLS. The proposed framework is robust at certain levels of noise, for sentences with the suppression of word changes, and with different quality or even meaningful audio time differences.
引用
收藏
页数:22
相关论文
共 47 条
  • [1] Convolutional Neural Networks for Speech Recognition
    Abdel-Hamid, Ossama
    Mohamed, Abdel-Rahman
    Jiang, Hui
    Deng, Li
    Penn, Gerald
    Yu, Dong
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) : 1533 - 1545
  • [2] Homogeneity Measure Impact on Target and Non-target Trials in Forensic Voice Comparison
    Ajili, Moez
    Bonastre, Jean-Francois
    Ben Kheder, Waad
    Rossato, Solange
    Kahn, Juliette
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2844 - 2848
  • [3] DWT features performance analysis for automatic speech recognition of Urdu
    Ali, Hazrat
    Ahmad, Nasir
    Zhou, Xianwei
    Iqbal, Khalid
    Ali, Sahibzada Muhammad
    [J]. SPRINGERPLUS, 2014, 3 : 1 - 10
  • [4] [Anonymous], 2000, Linear predictive coding
  • [5] Aparna R., ROLE WINDOWING TECHN
  • [6] A FAST METHOD FOR THE NUMERICAL EVALUATION OF CONTINUOUS FOURIER AND LAPLACE TRANSFORMS
    BAILEY, DH
    SWARZTRAUBER, PN
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1994, 15 (05) : 1105 - 1110
  • [7] Becker T, 2008, INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, P1505
  • [8] Braid A.C.M., 2003, FONETICA FORENSE
  • [9] [Булгакова Е.В. Bulgakova E.V.], 2016, [Научно-технический вестник информационных технологий, механики и оптики, Scientific and Technical Journal of Information Technologies Mechanics and Optics, Nauchno-tekhnicheskii vestnik informatsionnykh tekhnologii, mekhaniki i optiki], V16, P284, DOI 10.17586/2226-1494-2016-16-2-284-289
  • [10] Chougala M, 2016, 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), P510, DOI 10.1109/ICEEOT.2016.7755666