Text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution

被引:0
|
作者
Miyajima, C [1 ]
Hattori, Y
Tokuda, K
Masuko, T
Kobayashi, T
Kitamura, T
机构
[1] Nagoya Inst Technol, Dept Comp Sci, Nagoya, Aichi 4668555, Japan
[2] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Dept Informat Proc, Yokohama, Kanagawa 2268502, Japan
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2001年 / E84D卷 / 07期
关键词
speaker identification; pitch; multi-space probability distribution; Gaussian mixture model; minimum classification error;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new approach to modeling speech spectra and pitch for text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution (MSD-GMM). MSD-GMM allows us to model continuous pitch values of voiced frames and discrete symbols for unvoiced frames in a unified framework. Spectral and pitch features are jointly modeled by a two-stream MSD-GMM. We derive maximum likelihood (ML) estimation formulae and minimum classification error (MCE) training procedure for MSD-GMM parameters. The MSD-GMM speaker models are evaluated for text-independent speaker identification tasks. The experimental results show that the MSD-GMM can efficiently model spectral and pitch features of each speaker and outperforms conventional speaker models. The results also demonstrate the utility of the MCE training of the MSD-GMM parameters and the robustness for the inter-session variability.
引用
收藏
页码:847 / 855
页数:9
相关论文
共 50 条
  • [1] Improved Text-Independent Speaker Identification and Verification with Gaussian Mixture Models
    Chakroun, Rania
    Frikha, Mondher
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 3 - 10
  • [2] Robust Text-independent Speaker recognition with Short Utterances using Gaussian Mixture Models
    Chakroun, Rania
    Frikha, Mondher
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 2204 - 2209
  • [3] Text-independent speaker identification based on deep Gaussian correlation supervector
    Sun, Linhui
    Gu, Ting
    Xie, Keli
    Chen, Jia
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (02) : 449 - 457
  • [4] Text-independent speaker identification based on deep Gaussian correlation supervector
    Linhui Sun
    Ting Gu
    Keli Xie
    Jia Chen
    International Journal of Speech Technology, 2019, 22 : 449 - 457
  • [5] Fused Mel Feature sets based Text-Independent Speaker Identification using Gaussian Mixture Model
    Kumari, R. Shantha Selva
    Nidhyananthan, S. Selva
    Anand, G.
    INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 319 - 326
  • [6] Text-Independent Speaker Verification Using Variational Gaussian Mixture Model
    Moattar, Mohammad Hossein
    Homayounpour, Mohammad Mehdi
    ETRI JOURNAL, 2011, 33 (06) : 914 - 923
  • [7] Self-Organizing Mixture Models for Text-Independent Speaker Identification
    Bouziane, Ayoub
    Kharroubi, Jamal
    Zarghili, Arsalane
    2014 THIRD IEEE INTERNATIONAL COLLOQUIUM IN INFORMATION SCIENCE AND TECHNOLOGY (CIST'14), 2014, : 345 - 350
  • [8] A novel text-independent speaker identification method based on common Gaussian bases
    Hao, Chen
    Zhao, Rongchun
    2005 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2005, : 72 - 78
  • [9] Text Independent Speaker Identification Using Gaussian Mixture Model
    Ting, Chee-Ming
    Salleh, Sh-Hussain
    Tan, Tian-Swee
    Ariff, A. K.
    ICIAS 2007: INTERNATIONAL CONFERENCE ON INTELLIGENT & ADVANCED SYSTEMS, VOLS 1-3, PROCEEDINGS, 2007, : 194 - 198
  • [10] Text-Independent Speaker Identification Using the Histogram Transform Model
    Ma, Zhanyu
    Yu, Hong
    Tan, Zheng-Hua
    Guo, Jun
    IEEE ACCESS, 2016, 4 : 9733 - 9739