Text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution

被引:0
|
作者
Miyajima, C [1 ]
Hattori, Y
Tokuda, K
Masuko, T
Kobayashi, T
Kitamura, T
机构
[1] Nagoya Inst Technol, Dept Comp Sci, Nagoya, Aichi 4668555, Japan
[2] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Dept Informat Proc, Yokohama, Kanagawa 2268502, Japan
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2001年 / E84D卷 / 07期
关键词
speaker identification; pitch; multi-space probability distribution; Gaussian mixture model; minimum classification error;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new approach to modeling speech spectra and pitch for text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution (MSD-GMM). MSD-GMM allows us to model continuous pitch values of voiced frames and discrete symbols for unvoiced frames in a unified framework. Spectral and pitch features are jointly modeled by a two-stream MSD-GMM. We derive maximum likelihood (ML) estimation formulae and minimum classification error (MCE) training procedure for MSD-GMM parameters. The MSD-GMM speaker models are evaluated for text-independent speaker identification tasks. The experimental results show that the MSD-GMM can efficiently model spectral and pitch features of each speaker and outperforms conventional speaker models. The results also demonstrate the utility of the MCE training of the MSD-GMM parameters and the robustness for the inter-session variability.
引用
收藏
页码:847 / 855
页数:9
相关论文
共 50 条
  • [21] Text-independent speaker identification in environment using singular value decomposition
    Aldhaheri, RW
    Al-Saadi, FE
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1624 - 1628
  • [22] Text-Independent Speaker Identification Using Formants and Convolutional Neural Networks
    Camarena-Ibarrola, Antonio
    Reynoso, Miguel
    Figueroa, Karina
    ADVANCES IN SOFT COMPUTING (MICAI 2021), PT II, 2021, 13068 : 108 - 119
  • [23] On the Determination of Optimal Model Order for GMM-Based Text-Independent Speaker Identification
    MF Abu El-Yazeed
    MA El Gamal
    MMH El Ayadi
    EURASIP Journal on Advances in Signal Processing, 2004
  • [24] On the determination of optimal model order for GMM-based text-independent speaker identification
    Abu El-Yazeed, MF
    El Gamal, MA
    El Ayadi, MMH
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (08) : 1078 - 1087
  • [25] Text-independent Speaker Identification Using Fisher Discrimination Dictionary Learning Method
    Wang, Xia
    Yin, Qian
    Guo, Ping
    PROCEEDINGS OF THE 2012 EIGHTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2012), 2012, : 435 - 438
  • [26] An Efficient Text-Independent Speaker Identification Using Feature Fusion and Transformer Model
    Khan, Arfat Ahmad
    Jahangir, Rashid
    Alroobaea, Roobaea
    Alyahyan, Saleh Yahya
    Almulhi, Ahmed H.
    Alsafyani, Majed
    Wechtaisong, Chitapong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (02): : 4085 - 4100
  • [27] Text-independent speaker identification in a distant-talking multi-microphone environment
    Ji, Mikyong
    Kim, Sungtak
    Kim, Hoirin
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (11) : 1892 - 1895
  • [28] Text-independent speaker identification based on selection of the most similar feature vectors
    Soleymanpour M.
    Marvi H.
    Soleymanpour, Mohammad (Soleimanpour141@gmail.com), 1600, Springer Science and Business Media, LLC (20): : 99 - 108
  • [29] Automatic, Text-Independent, Speaker Identification and Verification System Using Mel Cepstrum and GMM
    Al Marashli, Ahmad
    Al Dakkak, Oumayma
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 657 - +
  • [30] Text-Independent Speaker Identification Using VQ-HMM Model Based Multiple Classifier System
    Zulfiqar, Ali
    Muhammad, Aslam
    Martinez-Enriquez, A. M.
    Escalada-Imaz, G.
    ADVANCES IN SOFT COMPUTING - MICAI 2010, PT II, 2010, 6438 : 116 - 125