Review of various stages in speaker recognition system, performance measures and recognition toolkits

被引:14
|
作者
Pawar, Rupali V. [1 ]
Jalnekar, Rajesh M. [2 ]
Chitode, Janardan S. [3 ]
机构
[1] Sinhgad Coll Engn, Pune, Maharashtra, India
[2] Vishwakarma Inst Technol, Pune, Maharashtra, India
[3] Vishwakarma Inst Technol, Dept E&TC, Pune, Maharashtra, India
关键词
Pre-processing; Framing; Feature extraction; Generative and discriminative model; Toolkits; Performance measures; Receiver operating characteristics (ROC); Decision error trade off (DET); Equal error rate (EER); SPEECH RECOGNITION; IDENTIFICATION;
D O I
10.1007/s10470-017-1069-1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Speaker Recognition is a vital application of speech processing. Speaker Recognition performs a task of authenticating or recognizing a speaker based on the unique features captured which characterize the speaker. Characteristics or features which are unique to an individual such as fundamental frequency, speaking style, pitch, and duration are used as distinguishing components of the human speech signal. Exploring these characteristics for various applications with an attempt to implement a robust speaker recognition system has been the impetus behind the research in this domain. This paper makes an attempt to present the available Feature Extraction and Recognition techniques with their merits and demerits. It also discusses the pre-emphasis stage of the speaker recognition system. The standard databases available for speaker recognition along with the criterion for their selection are also reviewed. The paper presents an overview of various toolkits and performance parameters of Automatic Speaker Recognition System.
引用
收藏
页码:247 / 257
页数:11
相关论文
共 50 条
  • [1] Review of various stages in speaker recognition system, performance measures and recognition toolkits
    Rupali V. Pawar
    Rajesh M. Jalnekar
    Janardan S. Chitode
    Analog Integrated Circuits and Signal Processing, 2018, 94 : 247 - 257
  • [2] A review on speaker recognition: Technology and challenges
    Hanifa, Rafizah Mohd
    Isa, Khalid
    Mohamad, Shamsul
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 90
  • [3] RobinNet: A Multimodal Speech Emotion Recognition System With Speaker Recognition for Social Interactions
    Khurana, Yash
    Gupta, Swamita
    Sathyaraj, R.
    Raja, S. P.
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 11 (01) : 478 - 487
  • [4] Performance Comparison of Speaker and Emotion Recognition
    Revathy, A.
    Shanmugapriya, P.
    Mohan, V.
    2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
  • [5] Speaker Recognition in Uncontrolled Environent: A Review
    Karamangala, Narendra
    Kumaraswamy, Ratnaswamy
    JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (01) : 49 - 65
  • [6] An Isolated Word Speaker Recognition System
    Ozaydin, Selma
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2017, : 70 - 74
  • [7] A Review on Feature Extraction for Speaker Recognition under Degraded Conditions
    Disken, Gokay
    Tufekci, Zekeriya
    Saribulut, Lutfu
    Cevik, Ulus
    IETE TECHNICAL REVIEW, 2017, 34 (03) : 321 - 332
  • [8] VAD, feature extraction and modelling techniques for speaker recognition: a review
    Jainar, Spoorti J.
    Sale, Pritam Limbaji
    Nagaraja, B. G.
    INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2020, 12 (1-2) : 1 - 18
  • [9] Speaker Recognition with Deep Learning Approaches: A Review
    Alenizi, Abdulrahman S.
    Al-Karawi, Khamis A.
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024, 2024, 1000 : 481 - 499
  • [10] Feature Extraction Methods for Speaker Recognition: A Review
    Chaudhary, Gopal
    Srivastava, Smriti
    Bhardwaj, Saurabh
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (12)