Review of various stages in speaker recognition system, performance measures and recognition toolkits

被引:14
|
作者
Pawar, Rupali V. [1 ]
Jalnekar, Rajesh M. [2 ]
Chitode, Janardan S. [3 ]
机构
[1] Sinhgad Coll Engn, Pune, Maharashtra, India
[2] Vishwakarma Inst Technol, Pune, Maharashtra, India
[3] Vishwakarma Inst Technol, Dept E&TC, Pune, Maharashtra, India
关键词
Pre-processing; Framing; Feature extraction; Generative and discriminative model; Toolkits; Performance measures; Receiver operating characteristics (ROC); Decision error trade off (DET); Equal error rate (EER); SPEECH RECOGNITION; IDENTIFICATION;
D O I
10.1007/s10470-017-1069-1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Speaker Recognition is a vital application of speech processing. Speaker Recognition performs a task of authenticating or recognizing a speaker based on the unique features captured which characterize the speaker. Characteristics or features which are unique to an individual such as fundamental frequency, speaking style, pitch, and duration are used as distinguishing components of the human speech signal. Exploring these characteristics for various applications with an attempt to implement a robust speaker recognition system has been the impetus behind the research in this domain. This paper makes an attempt to present the available Feature Extraction and Recognition techniques with their merits and demerits. It also discusses the pre-emphasis stage of the speaker recognition system. The standard databases available for speaker recognition along with the criterion for their selection are also reviewed. The paper presents an overview of various toolkits and performance parameters of Automatic Speaker Recognition System.
引用
收藏
页码:247 / 257
页数:11
相关论文
共 50 条
  • [31] VQ Based Comparative Analysis of MFCC and BFCC Speaker Recognition System
    Rehman, Faizan Ur
    Kumar, Chandar
    Kumar, Shubash
    Mehmood, Atif
    Zafar, Umair
    2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICICT), 2017, : 28 - 32
  • [32] A Review on Hand Gesture Recognition System
    Sonkusare, Jayesh S.
    Chopade, Nilkanth. B.
    Sor, Ravindra
    Tade, Sunil L.
    1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 790 - 794
  • [33] Analysis of Distance Measures for Pre-quantization before Feature Extraction in Automatic Speaker Recognition
    Sarkar, Gourav
    Saha, Goutam
    2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 91 - 94
  • [34] A review of the application of staircase scene recognition system in assisted motion
    Kong, Weifeng
    Tan, Zhiying
    Fan, Wenbo
    Tao, Xu
    Wang, Meiling
    Xu, Linsen
    Xu, Xiaobin
    DIGITAL SIGNAL PROCESSING, 2024, 146
  • [35] A Survey on Various Deep Learning Algorithms for an Efficient Facial Expression Recognition System
    Banerjee, Rudranath
    De, Sourav
    Dey, Shouvik
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2023, 23 (03)
  • [36] User performance with speech recognition: A literature review
    Koester, HH
    ASSISTIVE TECHNOLOGY, 2001, 13 (02) : 116 - 130
  • [37] Binaural Classification-Based Speech Segregation and Robust Speaker Recognition System
    Venkatesan, R.
    Ganesh, A. Balaji
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (08) : 3383 - 3411
  • [38] Low-cost speech recognition system for small vocabulary and speaker independent
    Teh, CC
    Jong, CC
    Siek, L
    DESIGN, MODELING AND SIMULATION IN MICROELECTRONICS, 2000, 4228 : 208 - 211
  • [39] A Novel Approach to Low Cost Multi Language Speaker Sign Recognition System
    Kumar, M. Naresh
    Suresh, D.
    Ganesan, P.
    Sathish, B. S.
    RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 (01): : 829 - 835
  • [40] GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system
    Xiao, Runqiu
    Li, Zhuo
    Miao, Xiaoxiao
    Wang, Wenchao
    Zhang, Pengyuan
    ELECTRONICS LETTERS, 2022, 58 (02) : 82 - 85