Review of various stages in speaker recognition system, performance measures and recognition toolkits

被引:14
|
作者
Pawar, Rupali V. [1 ]
Jalnekar, Rajesh M. [2 ]
Chitode, Janardan S. [3 ]
机构
[1] Sinhgad Coll Engn, Pune, Maharashtra, India
[2] Vishwakarma Inst Technol, Pune, Maharashtra, India
[3] Vishwakarma Inst Technol, Dept E&TC, Pune, Maharashtra, India
关键词
Pre-processing; Framing; Feature extraction; Generative and discriminative model; Toolkits; Performance measures; Receiver operating characteristics (ROC); Decision error trade off (DET); Equal error rate (EER); SPEECH RECOGNITION; IDENTIFICATION;
D O I
10.1007/s10470-017-1069-1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Speaker Recognition is a vital application of speech processing. Speaker Recognition performs a task of authenticating or recognizing a speaker based on the unique features captured which characterize the speaker. Characteristics or features which are unique to an individual such as fundamental frequency, speaking style, pitch, and duration are used as distinguishing components of the human speech signal. Exploring these characteristics for various applications with an attempt to implement a robust speaker recognition system has been the impetus behind the research in this domain. This paper makes an attempt to present the available Feature Extraction and Recognition techniques with their merits and demerits. It also discusses the pre-emphasis stage of the speaker recognition system. The standard databases available for speaker recognition along with the criterion for their selection are also reviewed. The paper presents an overview of various toolkits and performance parameters of Automatic Speaker Recognition System.
引用
收藏
页码:247 / 257
页数:11
相关论文
共 50 条
  • [41] An improved system for large population text independent Speaker Recognition with short utterances
    Chakroun, Rania
    Frikha, Mondher
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 2127 - 2131
  • [42] The DKU System for the Speaker Recognition Task of the 2019 VOiCES from a Distance Challenge
    Cai, Danwei
    Qin, Xiaoyi
    Cai, Weicheng
    Li, Ming
    INTERSPEECH 2019, 2019, : 2493 - 2497
  • [43] Human Action Recognition From Various Data Modalities: A Review
    Sun, Zehua
    Ke, Qiuhong
    Rahmani, Hossein
    Bennamoun, Mohammed
    Wang, Gang
    Liu, Jun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3200 - 3225
  • [44] Evaluating the Performance of a Speech Recognition Based System
    Pandey, Vinod Kumar
    Kopparapu, Sunil Kumar
    ADVANCES IN COMPUTING AND COMMUNICATIONS, PT III, 2011, 192 : 230 - 238
  • [45] WAVELET DETAIL COEFFICIENT AS A NOVEL WAVELET-MFCC FEATURES IN TEXT-DEPENDENT SPEAKER RECOGNITION SYSTEM
    Hidayat, Syahroni
    Tajuddin, Muhammad
    Yusuf, Siti Agrippina Alodia
    Qudsi, Jihadil
    Jaya, Nenet Natasudian
    IIUM ENGINEERING JOURNAL, 2022, 23 (01): : 68 - 81
  • [46] A Systematic Review of Fingerprint Recognition System Development
    Appati, Justice Kwame
    Nartey, Prince Kofi
    Yaokumah, Winfred
    Abdulai, Jamal-Deen
    INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2022, 14 (01):
  • [47] Holonic multi-agent system model for fuzzy automatic speech/speaker recognition
    Valencia-Jimenez, J. J.
    Fernandez-Caballero, Antonio
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, PROCEEDINGS, 2008, 4953 : 73 - 82
  • [48] Research of neural network classifier in speaker recognition module for automated system of critical use
    Bykov, Mykola M.
    Kovtun, Viacheslav V.
    Smolarz, Andrzej
    Junisbekov, Mukhtar
    Targeusizova, Aliya
    Satymbekov, Maksabek
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH ENERGY PHYSICS EXPERIMENTS 2017, 2017, 10445
  • [49] Text Independent Speaker Recognition System using Back Propagation Network with Wavelet Features
    Albin, A. Jose
    Nandhitha, N. M.
    Roslin, S. Emalda
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [50] Efficient Pre-Quantization Techniques Based on Probability Density for Speaker Recognition System
    Sarkar, Gourav
    Saha, Goutam
    TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, : 53 - +