Efficient speaker identification using spectral entropy

被引:7
|
作者
Luque-Suarez, Fernando [1 ]
Camarena-Ibarrola, Antonio [2 ]
Chavez, Edgar [1 ]
机构
[1] CICESE, Ensenada, Baja California, Mexico
[2] Univ Michoacana, Morelia, Michoacan, Mexico
关键词
Speaker recognition; Speaker identification; Entropygrams; RECOGNITION;
D O I
10.1007/s11042-018-7035-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In voice recognition, the two main problems are speech recognition (what was said), and speaker recognition (who was speaking). The usual method for speaker recognition is to postulate a model where the speaker identity corresponds to the parameters of the model, which estimation could be time-consuming when the number of candidate speakers is large. In this paper, we model the speaker as a high dimensional point cloud of entropy-based features, extracted from the speech signal. The method allows indexing, and hence it can manage large databases. We experimentally assessed the quality of the identification with a publicly available database formed by extracting audio from a collection of YouTube videos of 1,000 different speakers. With 20 second audio excerpts, we were able to identify a speaker with 97% accuracy when the recording environment is not controlled, and with 99% accuracy for controlled recording environments.
引用
收藏
页码:16803 / 16815
页数:13
相关论文
共 50 条
  • [41] A Novel Speech Enhancement Method Using Fourier Series Decomposition and Spectral Subtraction for Robust Speaker Identification
    Ali I. Siam
    Heba A. El-khobby
    Mustafa M. Abd Elnaby
    Hatem S. Abdelkader
    Fathi E. Abd El-Samie
    Wireless Personal Communications, 2019, 108 : 1055 - 1068
  • [42] Real-Time Speaker Identification Using Speaker Model Distance
    Zeinali, Hossein
    Sameti, Hossein
    Hadian, Hossein
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 643 - 647
  • [43] EFFICIENT FEATURE EXTRACTION OF SPEAKER IDENTIFICATION USING PHONEME MEAN F-RATIO FOR CHINESE
    Zhao, Chen
    Wang, Hongcui
    Hyon, Songgun
    Wei, Jianguo
    Dang, Jianwu
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 345 - 348
  • [44] Speaker Identification for Business-Card-Type Sensors
    Yamaguchi, Shunpei
    Oshima, Ritsuko
    Oshima, Jun
    Shiina, Ryota
    Fujihashi, Takuya
    Saruwatari, Shunsuke
    Watanabe, Takashi
    IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2021, 2 : 216 - 226
  • [45] Efficient Parameterization for Automatic Speaker Recognition Using Support Vector Machines
    Chakroun, Rania
    Frikha, Mondher
    Zouari, Leila Beltaifa
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA 2016), 2017, 557 : 659 - 666
  • [46] Joint Speech Enhancement and Speaker Identification Using Monte Carlo Methods
    Maina, Ciira Wa
    Walsh, John MacLaren
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1359 - 1362
  • [47] EmoBack: Backdoor Attacks Against Speaker Identification Using Emotional Prosody
    Schoof, Coen
    Koffas, Stefanos
    Conti, Mauro
    Picek, Stjepan
    PROCEEDINGS OF THE 2024 WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2024, 2024, : 137 - 148
  • [48] Boosting Speaker Identification Performance Using a Frame Level Based Algorithm
    Djemili, Rafik
    Bourouba, Hocine
    Korba, M. C. Amara
    O'Shaughnessy, Douglas
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA'15), 2015,
  • [49] Forensic Speaker Identification: a Tutorial
    Univaso, Pedro
    IEEE LATIN AMERICA TRANSACTIONS, 2017, 15 (09) : 1754 - 1770
  • [50] Robust Automatic Speaker Identification System Using Shuffled MFCC Features
    Barhoush, Mahdi
    Hallawa, Ahmed
    Schmeink, Anke
    2021 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES (ICMLANT II), 2021, : 28 - 33