Efficient speaker identification using spectral entropy

被引:0
|
作者
Fernando Luque-Suárez
Antonio Camarena-Ibarrola
Edgar Chávez
机构
[1] CICESE,
[2] Universidad Michoacana,undefined
来源
Multimedia Tools and Applications | 2019年 / 78卷
关键词
Speaker recognition; Speaker identification; Entropygrams;
D O I
暂无
中图分类号
学科分类号
摘要
In voice recognition, the two main problems are speech recognition (what was said), and speaker recognition (who was speaking). The usual method for speaker recognition is to postulate a model where the speaker identity corresponds to the parameters of the model, which estimation could be time-consuming when the number of candidate speakers is large. In this paper, we model the speaker as a high dimensional point cloud of entropy-based features, extracted from the speech signal. The method allows indexing, and hence it can manage large databases. We experimentally assessed the quality of the identification with a publicly available database formed by extracting audio from a collection of YouTube videos of 1,000 different speakers. With 20 second audio excerpts, we were able to identify a speaker with 97% accuracy when the recording environment is not controlled, and with 99% accuracy for controlled recording environments.
引用
收藏
页码:16803 / 16815
页数:12
相关论文
共 50 条
  • [41] Speaker verification using the spectral and time parameters of voice signal
    V. N. Sorokin
    A. I. Tsyplikhin
    Journal of Communications Technology and Electronics, 2010, 55 : 1561 - 1574
  • [42] A robust DNN model for text-independent speaker identification using non-speaker embeddings in diverse data conditions
    Nirupam Shome
    Banala Saritha
    Richik Kashyap
    Rabul Hussain Laskar
    Neural Computing and Applications, 2023, 35 : 18933 - 18947
  • [43] An efficient speaker recognition using quantum neural network
    Kaur, Rupinderdeep
    Sharma, R. K.
    Kumar, Parteek
    MODERN PHYSICS LETTERS B, 2018, 32 (31):
  • [44] Using Approximate Entropy as a Speech Quality Measure for a Speaker Recognition System
    Metzger, Richard A.
    Doherty, John F.
    Jenkins, David M.
    2016 ANNUAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (CISS), 2016,
  • [45] A robust DNN model for text-independent speaker identification using non-speaker embeddings in diverse data conditions
    Shome, Nirupam
    Saritha, Banala
    Kashyap, Richik
    Laskar, Rabul Hussain
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (26): : 18933 - 18947
  • [46] NMF Based System for Speaker Identification
    Costantini, Giovanni
    Cesarini, Valerio
    Paolizzo, Fabio
    2021 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR INDUSTRY 4.0 & IOT (IEEE METROIND4.0 & IOT), 2021, : 620 - 624
  • [47] The case for aural perceptual speaker identification
    Hollien, Harry
    Didla, Grace
    Harnsberger, James D.
    Hollien, Keith A.
    FORENSIC SCIENCE INTERNATIONAL, 2016, 269 : 8 - 20
  • [48] Super-Dirichlet Mixture Models using Differential Line Spectral Frequencies for Text-Independent Speaker Identification
    Ma, Zhanyu
    Leijon, Arne
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2360 - +
  • [49] Speaker Identification Using Semi-supervised Learning
    Fazakis, Nikos
    Karlos, Stamatis
    Kotsiantis, Sotiris
    Sgarbas, Kyriakos
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 389 - 396
  • [50] Novel Approach in Speaker Identification using SVM and GMM
    Bourouba, H.
    Korba, C. A.
    Djemili, Rafik
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2013, 15 (03): : 87 - 95