Speaker identification based on spectrogram and local binary patterns

被引:0
|
作者
Li, Yuanyuan [1 ]
Wang, Yunfang [1 ]
Li, Penghua [1 ]
Feng, Huizong [1 ]
机构
[1] Automotive Electronics Engineering Research Center, College of Automation, Chongqing University of Posts and Telecommunications, Chongqing
来源
Journal of Computational Information Systems | 2015年 / 11卷 / 08期
基金
中国国家自然科学基金;
关键词
Dynamic time warping; Local binary patterns; Speaker identification; Spectrogram;
D O I
10.12733/jcis13720
中图分类号
学科分类号
摘要
This paper presents a text-independent, closed-set speaker identification approach based on spectrogram and dynamic time warping (DTW) algorithm. The preprocessed speech signals are divided into some chunks, then calculated to get the magnitude of the frequency spectrum, which creates the spectrograms. The local binary patterns (LBP) operator are used to obtain the LBP vectors being treated as the speech features. The distances between each of the LBP vectors are measured by DTW algorithm, which aims to align two sequences of input LBP vectors by warping the time axis iteratively until an optimal match between the two LBP vectors is found. Through this elastic and robust sequential data matching, the proposed method identifies which one is the target speaker among a closed-set of speakers. The numerical experiments are carried out to verify the theoretical results and clearly show that our identification method has an acceptable accuracy. ©, 2015, Binary Information Press. All right reserved.
引用
收藏
页码:2771 / 2778
页数:7
相关论文
共 50 条
  • [21] Image Retrieval Based on Contourlet Transform and Local Binary Patterns
    Zhang, Qidong
    Wu, Jianhua
    Gao, Liqun
    ICIEA: 2009 4TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-6, 2009, : 2673 - 2676
  • [22] ReptoNet: A 3D Log Mel Spectrogram-Based Few-Shot Speaker Identification with Reptile Algorithm
    Saritha, Banala
    Laskar, Mohammad Azharuddin
    Monsley, K. Anish
    Laskar, Rabul Hussain
    Choudhury, Madhuchhanda
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, : 7495 - 7510
  • [23] ON SPECTROGRAM LOCAL MAXIMA
    Flandrin, Patrick
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 3979 - 3983
  • [24] Gammatonegram based Speaker Identification
    Pour, Aref Farhadi
    Asgari, Mohammad
    Hasanabadi, Mohammad Reza
    2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 52 - 55
  • [25] Influence of binary mask estimation errors on robust speaker identification
    May, Tobias
    SPEECH COMMUNICATION, 2017, 87 : 40 - 48
  • [26] Local Binary Patterns for Graph Characterization
    Jawad, Muhammad
    Aziz, Furqan
    Hancock, Edwin
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1241 - 1246
  • [27] Fast Object Detection Based on Color Histograms and Local Binary Patterns
    Lee, Kwon
    Lee, Chulhee
    Kim, Seon-Ae
    Kim, Young-Hoon
    TENCON 2012 - 2012 IEEE REGION 10 CONFERENCE: SUSTAINABLE DEVELOPMENT THROUGH HUMANITARIAN TECHNOLOGY, 2012,
  • [28] HUMAN DETECTION WITH CONTOUR-BASED LOCAL MOTION BINARY PATTERNS
    Duc Thanh Nguyen
    Ogunbona, Philip
    Li, Wanqing
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [29] Facial expression recognition based on Local Binary Patterns: A comprehensive study
    Shan, Caifeng
    Gong, Shaogang
    McOwan, Peter W.
    IMAGE AND VISION COMPUTING, 2009, 27 (06) : 803 - 816
  • [30] A Fast Matching Algorithm Based on Local Binary Patterns and Graph Transformation
    Zhao X.-Q.
    Yue Z.-D.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2017, 45 (09): : 2156 - 2161