Efficient speaker identification using spectral entropy

被引:0
|
作者
Fernando Luque-Suárez
Antonio Camarena-Ibarrola
Edgar Chávez
机构
[1] CICESE,
[2] Universidad Michoacana,undefined
来源
Multimedia Tools and Applications | 2019年 / 78卷
关键词
Speaker recognition; Speaker identification; Entropygrams;
D O I
暂无
中图分类号
学科分类号
摘要
In voice recognition, the two main problems are speech recognition (what was said), and speaker recognition (who was speaking). The usual method for speaker recognition is to postulate a model where the speaker identity corresponds to the parameters of the model, which estimation could be time-consuming when the number of candidate speakers is large. In this paper, we model the speaker as a high dimensional point cloud of entropy-based features, extracted from the speech signal. The method allows indexing, and hence it can manage large databases. We experimentally assessed the quality of the identification with a publicly available database formed by extracting audio from a collection of YouTube videos of 1,000 different speakers. With 20 second audio excerpts, we were able to identify a speaker with 97% accuracy when the recording environment is not controlled, and with 99% accuracy for controlled recording environments.
引用
收藏
页码:16803 / 16815
页数:12
相关论文
共 50 条
  • [31] Using Placement and Name for Speaker Identification in Captioning
    Vy, Quoc V.
    Fels, Deborah I.
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PROCEEDINGS, PT 1, 2010, 6179 : 247 - 254
  • [32] Speaker identification using orthogonal and discriminative features
    Davarpanah, SH
    Mirzaei, A
    Ziaei, A
    IWSSIP 2005: Proceedings of the 12th International Worshop on Systems, Signals & Image Processing, 2005, : 293 - 296
  • [33] Using Avatars for Improving Speaker Identification in Captioning
    Vy, Quoc V.
    Fels, Deborah I.
    HUMAN-COMPUTER INTERACTION - INTERACT 2009, PT II, PROCEEDINGS, 2009, 5727 : 916 - 919
  • [34] An efficient speaker identification framework based on Mask R-CNN classifier parameter optimized using hosted cuckoo optimization (HCO)
    Gaurav
    Bhardwaj S.
    Agarwal R.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (10) : 13613 - 13625
  • [35] EFFICIENT SPEAKER VERIFICATION SYSTEM USING SPEAKER MODEL CLUSTERING FOR T AND Z NORMALIZATIONS
    Ravulakollu, Kiran
    Apsingekar, Vijendra Raj
    De Leon, Phillip L.
    42ND ANNUAL 2008 IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY, PROCEEDINGS, 2008, : 56 - 62
  • [36] ENERGY-EFFICIENT SPEAKER IDENTIFICATION WITH LOW-PRECISION NETWORKS
    Koppula, Skanda
    Glass, James
    Chandrakasan, Anantha P.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2246 - 2250
  • [37] Efficient Low-Complexity Speaker Identification based on a SpeakerGAN Approach
    Spisiak, Michal
    Jakubec, Maros
    Jarina, Roman
    Kasak, Peter
    2024 34TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, RADIOELEKTRONIKA 2024, 2024,
  • [38] EFFICIENT SCORE NORMALIZATION FOR SPEAKER RECOGNITION
    Aronowitz, Hagai
    Aronowitz, Vanessia
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4402 - 4405
  • [39] An Improved GMM-based Clustering Algorithm for Efficient Speaker Identification
    Lin, Wenyong
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1490 - 1493
  • [40] Efficient speaker identification from speech transmitted over Bluetooth networks
    Khalil, Ali
    Elnaby, Mustafa
    Saad, Elsayed
    Al-Nahari, Azzam
    Al-Zubi, Nayel
    El-Bendary, Mohsen
    El-Samie, Fathi
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (04) : 409 - 416