Speaker identification analysis for SGMM with k-means and fuzzy C-means clustering using SVM statistical technique

被引：1

作者：

Manikandan, K. ^{[1
]}

Chandra, E. ^{[2
]}

机构：

[1] PSG Coll Arts & Sci, Dept Comp Sci, Coimbatore 641015, Tamil Nadu, India

[2] Bharathiar Univ, Dept Comp Sci, Coimbatore, Tamil Nadu, India

来源：

INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS | 2021年 / 25卷 / 03期

关键词：

k-means; fuzzy C-means; SGMFC; speaker identification; SVM; MODEL; RECOGNITION; SPEECH;

D O I：

10.3233/KES-210073

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speaker Identification denotes the speech samples of known speaker and it identifies the best matches of the input model. The SGMFC method is the combination of Sub Gaussian Mixture Model (SGMM) with the Mel-frequency Cepstral Coefficients (MFCC) for feature extraction. The SGMFC method minimizes the error rate, memory footprint and also computational throughput measure needs of a medium-vocabulary speaker identification system, supposed for preparation on a transportable or otherwise. Fuzzy C-means and k-means clustering are used in the SGMM method to attain the improved efficiency and their outcomes with parameters such as precision, sensitivity and specificity are compared.

引用

页码：309 / 314

页数：6

共 30 条

[1]

[Anonymous], 2010, P NAT C COMM NCC

[2] Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications [J].

Apsingekar, Vijendra Raj ;

De Leon, Phillip L. .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04) :848-853

[3]

Baraldi A, 1999, IEEE T SYST MAN CY B, V29, P778, DOI 10.1109/3477.809032

[4]

Chaudhari UV, 2001, INT CONF ACOUST SPEE, P461, DOI 10.1109/ICASSP.2001.940867

[5] YIN, a fundamental frequency estimator for speech and music [J].

de Cheveigné, A ;

Kawahara, H .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (04) :1917-1930

[6] Front-End Factor Analysis for Speaker Verification [J].

Dehak, Najim ;

Kenny, Patrick J. ;

Dehak, Reda ;

Dumouchel, Pierre ;

Ouellet, Pierre .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04) :788-798

[7]

Dharmale N., 2016, CONTROLLING APPL VIA, V3

[8]

Diez M, 2011, LECT NOTES COMPUT SC, V6669, P612

[9]

Dubery S. Kumar, 2013, IJACSA, V4

[10] Speaker identification using instantaneous frequencies [J].

Grimaldi, Marco ;

Cummins, Fred .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (06) :1097-1111

← 1 2 3 →