Speaker identification based on spectrogram and local binary patterns

被引:0
|
作者
Li, Yuanyuan [1 ]
Wang, Yunfang [1 ]
Li, Penghua [1 ]
Feng, Huizong [1 ]
机构
[1] Automotive Electronics Engineering Research Center, College of Automation, Chongqing University of Posts and Telecommunications, Chongqing
来源
Journal of Computational Information Systems | 2015年 / 11卷 / 08期
基金
中国国家自然科学基金;
关键词
Dynamic time warping; Local binary patterns; Speaker identification; Spectrogram;
D O I
10.12733/jcis13720
中图分类号
学科分类号
摘要
This paper presents a text-independent, closed-set speaker identification approach based on spectrogram and dynamic time warping (DTW) algorithm. The preprocessed speech signals are divided into some chunks, then calculated to get the magnitude of the frequency spectrum, which creates the spectrograms. The local binary patterns (LBP) operator are used to obtain the LBP vectors being treated as the speech features. The distances between each of the LBP vectors are measured by DTW algorithm, which aims to align two sequences of input LBP vectors by warping the time axis iteratively until an optimal match between the two LBP vectors is found. Through this elastic and robust sequential data matching, the proposed method identifies which one is the target speaker among a closed-set of speakers. The numerical experiments are carried out to verify the theoretical results and clearly show that our identification method has an acceptable accuracy. ©, 2015, Binary Information Press. All right reserved.
引用
收藏
页码:2771 / 2778
页数:7
相关论文
共 50 条
  • [31] FACIAL EXPRESSION RECOGNITION USING CURVELET BASED LOCAL BINARY PATTERNS
    Saha, Ashirbani
    Wu, Q. M. Jonathan
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2470 - 2473
  • [32] Learning Binaural Spectrogram Features for Azimuthal Speaker Localization
    Mlynarski, Wiktor
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2938 - 2941
  • [33] Palmprint recognition based on discriminative local binary patterns statistic feature
    Mu, Meiru
    Ruan, Qiuqi
    Shen, Yongsheng
    2010 INTERNATIONAL CONFERENCE ON SIGNAL ACQUISITION AND PROCESSING: ICSAP 2010, PROCEEDINGS, 2010, : 193 - 197
  • [34] A local approach based on a Local Binary Patterns variant texture descriptor for classifying pain states
    Nanni, Loris
    Brahnam, Sheryl
    Lumini, Alessandra
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (12) : 7888 - 7894
  • [35] Speaker identification based on neural networks
    Marhon, Sajid A.
    Al-Aghar, Duaa N. Ubaid
    NEURAL NETWORK WORLD, 2006, 16 (04) : 277 - 290
  • [36] NMF Based System for Speaker Identification
    Costantini, Giovanni
    Cesarini, Valerio
    Paolizzo, Fabio
    2021 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR INDUSTRY 4.0 & IOT (IEEE METROIND4.0 & IOT), 2021, : 620 - 624
  • [37] Speaker Identification Based on Fractal Dimensions
    侯丽敏
    王朔中
    Journal of Shanghai University, 2003, (01) : 60 - 63
  • [38] A Histogram Based Speaker Identification Technique
    Sleit, Azzam
    Serhan, Sami
    Nemir, Loai
    2008 FIRST INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES, VOLS 1 AND 2, 2008, : 391 - 395
  • [39] An FPGA based VQ for speaker identification
    Elmisery, FA
    Khaleil, AH
    Salama, AE
    El-Geldawi, F
    17th ICM 2005: 2005 International Conference on Microelectronics, Proceedings, 2005, : 130 - 132
  • [40] Environment Based Threshold for Speaker Identification
    Kanrar, Soumen
    2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, SIGNALS, COMMUNICATION AND OPTIMIZATION (EESCO), 2015,