Statistically Significant Duration-Independent-based Noise-Robust Speaker Verification

被引:0
作者
Nirmal, Asmita [1 ]
Jayaswal, Deepak [2 ]
Kachare, Pramod H. [3 ]
机构
[1] Datta Meghe Coll Engn, Dept Elect & Telecommun Engn, Navi Mumbai, Maharashtra, India
[2] St Francis Inst Technol, Dept Elect & Telecommun Engn, Mumbai, Maharashtra, India
[3] Ramrao Adik Inst Technol, Dept Elect & Telecommun Engn, Navi Mumbai, Maharashtra, India
关键词
Extreme gradient boost; Feature selection; Mel-frequency cepstral coefficients; Speaker verification; FEATURE-SELECTION; FEATURES; MFCC;
D O I
10.33889/IJMEMS.2024.9.1.008
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A speaker verification system models individual speakers using different speech features to improve their robustness. However, redundant features degrade the system's performance. This paper presents Statistically Significant Duration -Independent Mel frequency Cepstral Coefficients (SSDI-MFCC) features with the Extreme Gradient Boost classifier for improving the noiserobustness of speaker models. Eight statistical descriptors are used to generate signal duration -independent features, and a statistically significant feature subset is obtained using a t -test. A redeveloped Librispeech database by adding noises from the AURORA database to simulate real-world test conditions for speaker verification is used for evaluation. The SSDI-MFCC is compared with Principal Component Analysis (PCA) and Genetic Algorithm (GA). The comparative results showed average equal error rate improvements by 4.93 % and 3.48 % with the SSDI-MFCC than GA-MFCC and PCA-MFCC in clean and noisy conditions, respectively. A significant reduction in verification time is observed using SSDI-MFCC than the complete feature set.
引用
收藏
页码:147 / 162
页数:16
相关论文
共 32 条
[1]   Multitaper MFCC and PLP features for speaker verification using i-vectors [J].
Alam, Md Jahangir ;
Kinnunen, Tomi ;
Kenny, Patrick ;
Ouellet, Pierre ;
O'Shaughnessy, Douglas .
SPEECH COMMUNICATION, 2013, 55 (02) :237-251
[2]  
[Anonymous], 2011, ITU-T, P.56
[3]  
[Anonymous], 2004, P SPEAKER LANGUAGE R
[4]   An efficient text-independent speaker verification for short utterance data from Mobile devices [J].
Arora, Sanghamitra V. ;
Vig, Rekha .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (3-4) :3049-3074
[5]   A novel metaheuristic method for solving constrained engineering optimization problems: Crow search algorithm [J].
Askarzadeh, Alireza .
COMPUTERS & STRUCTURES, 2016, 169 :1-12
[6]  
Ayyub BilalM., 2016, Probability, statistics, and reliability for engineers and scientists
[7]   Feature selection using singular value decomposition and QR factorization with column pivoting for text-independent speaker identification [J].
Chakroborty, Sandipan ;
Saha, Goutam .
SPEECH COMMUNICATION, 2010, 52 (09) :693-709
[8]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[9]  
Cohen A., 2002, PROCEEDING COST 275, P89
[10]   Robust text-independent speaker verification using genetic programming [J].
Day, Peter ;
Nandi, Asoke K. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01) :285-295