Efficient speaker identification using spectral entropy

被引:0
|
作者
Fernando Luque-Suárez
Antonio Camarena-Ibarrola
Edgar Chávez
机构
[1] CICESE,
[2] Universidad Michoacana,undefined
来源
Multimedia Tools and Applications | 2019年 / 78卷
关键词
Speaker recognition; Speaker identification; Entropygrams;
D O I
暂无
中图分类号
学科分类号
摘要
In voice recognition, the two main problems are speech recognition (what was said), and speaker recognition (who was speaking). The usual method for speaker recognition is to postulate a model where the speaker identity corresponds to the parameters of the model, which estimation could be time-consuming when the number of candidate speakers is large. In this paper, we model the speaker as a high dimensional point cloud of entropy-based features, extracted from the speech signal. The method allows indexing, and hence it can manage large databases. We experimentally assessed the quality of the identification with a publicly available database formed by extracting audio from a collection of YouTube videos of 1,000 different speakers. With 20 second audio excerpts, we were able to identify a speaker with 97% accuracy when the recording environment is not controlled, and with 99% accuracy for controlled recording environments.
引用
收藏
页码:16803 / 16815
页数:12
相关论文
共 50 条
  • [21] Speaker Identification using Whispered Speech
    Jawarkar, Naresh P.
    Holambe, Raghunath S.
    Basu, Tapan Kumar
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 778 - 781
  • [22] Speaker Modeling Using Emotional Speech for More Robust Speaker Identification
    M. Milošević
    Ž. Nedeljković
    U. Glavitsch
    Ž. Đurović
    Journal of Communications Technology and Electronics, 2019, 64 : 1256 - 1265
  • [23] A Novel Speech Enhancement Method Using Fourier Series Decomposition and Spectral Subtraction for Robust Speaker Identification
    Siam, Ali, I
    El-khobby, Heba A.
    Abd Elnaby, Mustafa M.
    Abdelkader, Hatem S.
    Abd El-Samie, Fathi E.
    WIRELESS PERSONAL COMMUNICATIONS, 2019, 108 (02) : 1055 - 1068
  • [24] Real-Time Speaker Identification Using Speaker Model Distance
    Zeinali, Hossein
    Sameti, Hossein
    Hadian, Hossein
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 643 - 647
  • [25] Speaker Modeling Using Emotional Speech for More Robust Speaker Identification
    Milosevic, M.
    Nedeljkovic, Z.
    Glavitsch, U.
    Durovic, Z.
    JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2019, 64 (11) : 1256 - 1265
  • [26] A Novel Speech Enhancement Method Using Fourier Series Decomposition and Spectral Subtraction for Robust Speaker Identification
    Ali I. Siam
    Heba A. El-khobby
    Mustafa M. Abd Elnaby
    Hatem S. Abdelkader
    Fathi E. Abd El-Samie
    Wireless Personal Communications, 2019, 108 : 1055 - 1068
  • [27] EFFICIENT FEATURE EXTRACTION OF SPEAKER IDENTIFICATION USING PHONEME MEAN F-RATIO FOR CHINESE
    Zhao, Chen
    Wang, Hongcui
    Hyon, Songgun
    Wei, Jianguo
    Dang, Jianwu
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 345 - 348
  • [28] Efficient Parameterization for Automatic Speaker Recognition Using Support Vector Machines
    Chakroun, Rania
    Frikha, Mondher
    Zouari, Leila Beltaifa
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA 2016), 2017, 557 : 659 - 666
  • [29] SpeakerSense: Energy Efficient Unobtrusive Speaker Identification on Mobile Phones
    Lu, Hong
    Brush, A. J. Bernheim
    Priyantha, Bodhi
    Karlson, Amy K.
    Liu, Jie
    PERVASIVE COMPUTING, 2011, 6696 : 188 - 205
  • [30] Modulation Spectral Features for Robust Far-Field Speaker Identification
    Falk, Tiago H.
    Chan, Wai-Yip
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 90 - 100