Review of various stages in speaker recognition system, performance measures and recognition toolkits

被引：14

作者：

Pawar, Rupali V. ^{[1
]}

Jalnekar, Rajesh M. ^{[2
]}

Chitode, Janardan S. ^{[3
]}

机构：

[1] Sinhgad Coll Engn, Pune, Maharashtra, India

[2] Vishwakarma Inst Technol, Pune, Maharashtra, India

[3] Vishwakarma Inst Technol, Dept E&TC, Pune, Maharashtra, India

来源：

ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING | 2018年 / 94卷 / 02期

关键词：

Pre-processing; Framing; Feature extraction; Generative and discriminative model; Toolkits; Performance measures; Receiver operating characteristics (ROC); Decision error trade off (DET); Equal error rate (EER); SPEECH RECOGNITION; IDENTIFICATION;

D O I：

10.1007/s10470-017-1069-1

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speaker Recognition is a vital application of speech processing. Speaker Recognition performs a task of authenticating or recognizing a speaker based on the unique features captured which characterize the speaker. Characteristics or features which are unique to an individual such as fundamental frequency, speaking style, pitch, and duration are used as distinguishing components of the human speech signal. Exploring these characteristics for various applications with an attempt to implement a robust speaker recognition system has been the impetus behind the research in this domain. This paper makes an attempt to present the available Feature Extraction and Recognition techniques with their merits and demerits. It also discusses the pre-emphasis stage of the speaker recognition system. The standard databases available for speaker recognition along with the criterion for their selection are also reviewed. The paper presents an overview of various toolkits and performance parameters of Automatic Speaker Recognition System.

引用

页码：247 / 257

页数：11

共 50 条

[31] VQ Based Comparative Analysis of MFCC and BFCC Speaker Recognition System
Rehman, Faizan Ur
Kumar, Chandar
Kumar, Shubash
Mehmood, Atif
Zafar, Umair
2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICICT), 2017, : 28 - 32
[32] A Review on Hand Gesture Recognition System
Sonkusare, Jayesh S.
Chopade, Nilkanth. B.
Sor, Ravindra
Tade, Sunil L.
1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 790 - 794
[33] Analysis of Distance Measures for Pre-quantization before Feature Extraction in Automatic Speaker Recognition
Sarkar, Gourav
Saha, Goutam
2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 91 - 94
[34] A review of the application of staircase scene recognition system in assisted motion
Kong, Weifeng
Tan, Zhiying
Fan, Wenbo
Tao, Xu
Wang, Meiling
Xu, Linsen
Xu, Xiaobin
DIGITAL SIGNAL PROCESSING, 2024, 146
[35] A Survey on Various Deep Learning Algorithms for an Efficient Facial Expression Recognition System
Banerjee, Rudranath
De, Sourav
Dey, Shouvik
INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2023, 23 (03)
[36] User performance with speech recognition: A literature review
Koester, HH
ASSISTIVE TECHNOLOGY, 2001, 13 (02) : 116 - 130
[37] Binaural Classification-Based Speech Segregation and Robust Speaker Recognition System
Venkatesan, R.
Ganesh, A. Balaji
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (08) : 3383 - 3411
[38] Low-cost speech recognition system for small vocabulary and speaker independent
Teh, CC
Jong, CC
Siek, L
DESIGN, MODELING AND SIMULATION IN MICROELECTRONICS, 2000, 4228 : 208 - 211
[39] A Novel Approach to Low Cost Multi Language Speaker Sign Recognition System
Kumar, M. Naresh
Suresh, D.
Ganesan, P.
Sathish, B. S.
RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 (01): : 829 - 835
[40] GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system
Xiao, Runqiu
Li, Zhuo
Miao, Xiaoxiao
Wang, Wenchao
Zhang, Pengyuan
ELECTRONICS LETTERS, 2022, 58 (02) : 82 - 85

← 1 2 3 4 5 →