Text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution

被引：0

作者：

Miyajima, C ^{[1
]}

Hattori, Y

Tokuda, K

Masuko, T

Kobayashi, T

Kitamura, T

机构：

[1] Nagoya Inst Technol, Dept Comp Sci, Nagoya, Aichi 4668555, Japan

[2] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Dept Informat Proc, Yokohama, Kanagawa 2268502, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2001年 / E84D卷 / 07期

关键词：

speaker identification; pitch; multi-space probability distribution; Gaussian mixture model; minimum classification error;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a new approach to modeling speech spectra and pitch for text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution (MSD-GMM). MSD-GMM allows us to model continuous pitch values of voiced frames and discrete symbols for unvoiced frames in a unified framework. Spectral and pitch features are jointly modeled by a two-stream MSD-GMM. We derive maximum likelihood (ML) estimation formulae and minimum classification error (MCE) training procedure for MSD-GMM parameters. The MSD-GMM speaker models are evaluated for text-independent speaker identification tasks. The experimental results show that the MSD-GMM can efficiently model spectral and pitch features of each speaker and outperforms conventional speaker models. The results also demonstrate the utility of the MCE training of the MSD-GMM parameters and the robustness for the inter-session variability.

引用

页码：847 / 855

页数：9

共 50 条

[1] Improved Text-Independent Speaker Identification and Verification with Gaussian Mixture Models
Chakroun, Rania
Frikha, Mondher
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 3 - 10
[2] Robust Text-independent Speaker recognition with Short Utterances using Gaussian Mixture Models
Chakroun, Rania
Frikha, Mondher
2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 2204 - 2209
[3] Text-independent speaker identification based on deep Gaussian correlation supervector
Sun, Linhui
Gu, Ting
Xie, Keli
Chen, Jia
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (02) : 449 - 457
[4] Text-independent speaker identification based on deep Gaussian correlation supervector
Linhui Sun
Ting Gu
Keli Xie
Jia Chen
International Journal of Speech Technology, 2019, 22 : 449 - 457
[5] Fused Mel Feature sets based Text-Independent Speaker Identification using Gaussian Mixture Model
Kumari, R. Shantha Selva
Nidhyananthan, S. Selva
Anand, G.
INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 319 - 326
[6] Text-Independent Speaker Verification Using Variational Gaussian Mixture Model
Moattar, Mohammad Hossein
Homayounpour, Mohammad Mehdi
ETRI JOURNAL, 2011, 33 (06) : 914 - 923
[7] Self-Organizing Mixture Models for Text-Independent Speaker Identification
Bouziane, Ayoub
Kharroubi, Jamal
Zarghili, Arsalane
2014 THIRD IEEE INTERNATIONAL COLLOQUIUM IN INFORMATION SCIENCE AND TECHNOLOGY (CIST'14), 2014, : 345 - 350
[8] A novel text-independent speaker identification method based on common Gaussian bases
Hao, Chen
Zhao, Rongchun
2005 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2005, : 72 - 78
[9] Text Independent Speaker Identification Using Gaussian Mixture Model
Ting, Chee-Ming
Salleh, Sh-Hussain
Tan, Tian-Swee
Ariff, A. K.
ICIAS 2007: INTERNATIONAL CONFERENCE ON INTELLIGENT & ADVANCED SYSTEMS, VOLS 1-3, PROCEEDINGS, 2007, : 194 - 198
[10] Text-Independent Speaker Identification Using the Histogram Transform Model
Ma, Zhanyu
Yu, Hong
Tan, Zheng-Hua
Guo, Jun
IEEE ACCESS, 2016, 4 : 9733 - 9739

← 1 2 3 4 5 →