Statistically Significant Duration-Independent-based Noise-Robust Speaker Verification

被引：0

作者：

Nirmal, Asmita ^{[1
]}

Jayaswal, Deepak ^{[2
]}

Kachare, Pramod H. ^{[3
]}

机构：

[1] Datta Meghe Coll Engn, Dept Elect & Telecommun Engn, Navi Mumbai, Maharashtra, India

[2] St Francis Inst Technol, Dept Elect & Telecommun Engn, Mumbai, Maharashtra, India

[3] Ramrao Adik Inst Technol, Dept Elect & Telecommun Engn, Navi Mumbai, Maharashtra, India

来源：

INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES | 2024年 / 9卷 / 01期

关键词：

Extreme gradient boost; Feature selection; Mel-frequency cepstral coefficients; Speaker verification; FEATURE-SELECTION; FEATURES; MFCC;

D O I：

10.33889/IJMEMS.2024.9.1.008

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

A speaker verification system models individual speakers using different speech features to improve their robustness. However, redundant features degrade the system's performance. This paper presents Statistically Significant Duration -Independent Mel frequency Cepstral Coefficients (SSDI-MFCC) features with the Extreme Gradient Boost classifier for improving the noiserobustness of speaker models. Eight statistical descriptors are used to generate signal duration -independent features, and a statistically significant feature subset is obtained using a t -test. A redeveloped Librispeech database by adding noises from the AURORA database to simulate real-world test conditions for speaker verification is used for evaluation. The SSDI-MFCC is compared with Principal Component Analysis (PCA) and Genetic Algorithm (GA). The comparative results showed average equal error rate improvements by 4.93 % and 3.48 % with the SSDI-MFCC than GA-MFCC and PCA-MFCC in clean and noisy conditions, respectively. A significant reduction in verification time is observed using SSDI-MFCC than the complete feature set.

引用

页码：147 / 162

页数：16

共 32 条

[1] Multitaper MFCC and PLP features for speaker verification using i-vectors [J].

Alam, Md Jahangir ;

Kinnunen, Tomi ;

Kenny, Patrick ;

Ouellet, Pierre ;

O'Shaughnessy, Douglas .

SPEECH COMMUNICATION, 2013, 55 (02) :237-251

[2]

[Anonymous], 2011, ITU-T, P.56

[3]

[Anonymous], 2004, P SPEAKER LANGUAGE R

[4] An efficient text-independent speaker verification for short utterance data from Mobile devices [J].

Arora, Sanghamitra V. ;

Vig, Rekha .

MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (3-4) :3049-3074

[5] A novel metaheuristic method for solving constrained engineering optimization problems: Crow search algorithm [J].

Askarzadeh, Alireza .

COMPUTERS & STRUCTURES, 2016, 169 :1-12

[6]

Ayyub BilalM., 2016, Probability, statistics, and reliability for engineers and scientists

[7] Feature selection using singular value decomposition and QR factorization with column pivoting for text-independent speaker identification [J].

Chakroborty, Sandipan ;

Saha, Goutam .

SPEECH COMMUNICATION, 2010, 52 (09) :693-709

[8] XGBoost: A Scalable Tree Boosting System [J].

Chen, Tianqi ;

Guestrin, Carlos .

KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794

[9]

Cohen A., 2002, PROCEEDING COST 275, P89

[10] Robust text-independent speaker verification using genetic programming [J].

Day, Peter ;

Nandi, Asoke K. .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01) :285-295

← 1 2 3 4 →