Text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution

被引：0

作者：

Miyajima, C ^{[1
]}

Hattori, Y

Tokuda, K

Masuko, T

Kobayashi, T

Kitamura, T

机构：

[1] Nagoya Inst Technol, Dept Comp Sci, Nagoya, Aichi 4668555, Japan

[2] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Dept Informat Proc, Yokohama, Kanagawa 2268502, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2001年 / E84D卷 / 07期

关键词：

speaker identification; pitch; multi-space probability distribution; Gaussian mixture model; minimum classification error;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a new approach to modeling speech spectra and pitch for text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution (MSD-GMM). MSD-GMM allows us to model continuous pitch values of voiced frames and discrete symbols for unvoiced frames in a unified framework. Spectral and pitch features are jointly modeled by a two-stream MSD-GMM. We derive maximum likelihood (ML) estimation formulae and minimum classification error (MCE) training procedure for MSD-GMM parameters. The MSD-GMM speaker models are evaluated for text-independent speaker identification tasks. The experimental results show that the MSD-GMM can efficiently model spectral and pitch features of each speaker and outperforms conventional speaker models. The results also demonstrate the utility of the MCE training of the MSD-GMM parameters and the robustness for the inter-session variability.

引用

页码：847 / 855

页数：9

共 50 条

[41] An optimum end-to-end text-independent speaker identification system using convolutional neural network
Farsiani, Shabnam
Izadkhah, Habib
Lotfi, Shahriar
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
[42] Robust text-independent speaker identification using multiple subband-classifiers in colored noise environment
Vale, E. E.
Cunha, A. A.
Alcaim, A.
PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 275 - 277
[43] HIERARCHICAL MIXTURE CLUSTERING AND ITS APPLICATION TO GMM BASED TEXT INDEPENDENT SPEAKER IDENTIFICATION
Saeidi, R.
Mohammadi, H. R. Sadegh
Ganchev, T.
Rodman, R. D.
2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 770 - +
[44] Text independent speaker identification with finite multivariate generalised Gaussian mixture model and k-means algorithm
Sailaja, V.
Rao, K. Srinivasa
Reddy, K. V. V. S.
INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2013, 6 (02) : 119 - 126
[45] Features Extracted Using Frequency-Time Analysis Approach from Nyquist Filter Bank and Gaussian Filter Bank for Text-Independent Speaker Identification
Sen, Nirmalya
Basu, T. K.
BIOMETRICS AND ID MANAGEMENT, 2011, 6583 : 125 - +
[46] Using a small amount of text-independent speech data for a BiLSTM large-scale speaker identification approach
Nammous, Mohammad K.
Saeed, Khalid
Kobojek, Pawel
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (03) : 764 - 770
[47] 2S-Norm: A New Score Normalization for a GMM Based Text-Independent Speaker Identification System
Van Huy Nguyen
ADVANCES IN ENGINEERING RESEARCH AND APPLICATION, 2019, 63 : 9 - 15
[48] Feature selection using singular value decomposition and QR factorization with column pivoting for text-independent speaker identification
Chakroborty, Sandipan
Saha, Goutam
SPEECH COMMUNICATION, 2010, 52 (09) : 693 - 709
[49] A Quality Measure Method Using Gaussian Mixture Models and Divergence Measure for Speaker Identification
Zheng, Rong
Zhang, Shuwu
Xu, Bo
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2094 - 2097
[50] A Text Independent Handwriting Forgery Detection System Based on Branchlet Features and Gaussian Mixture Models
Fahn, Chin-Shyurng
Lee, Chu-Ping
Chen, Heng-I
2016 14TH ANNUAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2016,

← 1 2 3 4 5 →