Review of various stages in speaker recognition system, performance measures and recognition toolkits

被引：14

作者：

Pawar, Rupali V. ^{[1
]}

Jalnekar, Rajesh M. ^{[2
]}

Chitode, Janardan S. ^{[3
]}

机构：

[1] Sinhgad Coll Engn, Pune, Maharashtra, India

[2] Vishwakarma Inst Technol, Pune, Maharashtra, India

[3] Vishwakarma Inst Technol, Dept E&TC, Pune, Maharashtra, India

来源：

ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING | 2018年 / 94卷 / 02期

关键词：

Pre-processing; Framing; Feature extraction; Generative and discriminative model; Toolkits; Performance measures; Receiver operating characteristics (ROC); Decision error trade off (DET); Equal error rate (EER); SPEECH RECOGNITION; IDENTIFICATION;

D O I：

10.1007/s10470-017-1069-1

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speaker Recognition is a vital application of speech processing. Speaker Recognition performs a task of authenticating or recognizing a speaker based on the unique features captured which characterize the speaker. Characteristics or features which are unique to an individual such as fundamental frequency, speaking style, pitch, and duration are used as distinguishing components of the human speech signal. Exploring these characteristics for various applications with an attempt to implement a robust speaker recognition system has been the impetus behind the research in this domain. This paper makes an attempt to present the available Feature Extraction and Recognition techniques with their merits and demerits. It also discusses the pre-emphasis stage of the speaker recognition system. The standard databases available for speaker recognition along with the criterion for their selection are also reviewed. The paper presents an overview of various toolkits and performance parameters of Automatic Speaker Recognition System.

引用

页码：247 / 257

页数：11

共 50 条

[1] Review of various stages in speaker recognition system, performance measures and recognition toolkits
Rupali V. Pawar
Rajesh M. Jalnekar
Janardan S. Chitode
Analog Integrated Circuits and Signal Processing, 2018, 94 : 247 - 257
[2] A review on speaker recognition: Technology and challenges
Hanifa, Rafizah Mohd
Isa, Khalid
Mohamad, Shamsul
COMPUTERS & ELECTRICAL ENGINEERING, 2021, 90
[3] RobinNet: A Multimodal Speech Emotion Recognition System With Speaker Recognition for Social Interactions
Khurana, Yash
Gupta, Swamita
Sathyaraj, R.
Raja, S. P.
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 11 (01) : 478 - 487
[4] Performance Comparison of Speaker and Emotion Recognition
Revathy, A.
Shanmugapriya, P.
Mohan, V.
2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
[5] Speaker Recognition in Uncontrolled Environent: A Review
Karamangala, Narendra
Kumaraswamy, Ratnaswamy
JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (01) : 49 - 65
[6] An Isolated Word Speaker Recognition System
Ozaydin, Selma
2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2017, : 70 - 74
[7] A Review on Feature Extraction for Speaker Recognition under Degraded Conditions
Disken, Gokay
Tufekci, Zekeriya
Saribulut, Lutfu
Cevik, Ulus
IETE TECHNICAL REVIEW, 2017, 34 (03) : 321 - 332
[8] VAD, feature extraction and modelling techniques for speaker recognition: a review
Jainar, Spoorti J.
Sale, Pritam Limbaji
Nagaraja, B. G.
INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2020, 12 (1-2) : 1 - 18
[9] Speaker Recognition with Deep Learning Approaches: A Review
Alenizi, Abdulrahman S.
Al-Karawi, Khamis A.
PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024, 2024, 1000 : 481 - 499
[10] Feature Extraction Methods for Speaker Recognition: A Review
Chaudhary, Gopal
Srivastava, Smriti
Bhardwaj, Saurabh
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (12)

← 1 2 3 4 5 →