Speaker recognition based on short utterance compensation method of generative adversarial networks

被引:0
|
作者
Zhangfang Hu
Yaqin Fu
Yuan Luo
Xuan Xu
Zhiguang Xia
Hongwei Zhang
机构
[1] Chongqing University of Posts and Telecommunications,College of Optoelectronic Engineering
来源
International Journal of Speech Technology | 2020年 / 23卷
关键词
Gaussian mixture model–universal background model; Speaker recognition; Generative adversarial network;
D O I
暂无
中图分类号
学科分类号
摘要
On the basis of gaussian mixture model–universal background model (GMM–UBM) in the speaker recognition system, the paper proposes a short utterance sample compensation method based on the generative adversarial network (GAN) to solve the problem of the inadequate corpus data caused by short utterance, which has led to a serious reduction of recognition rate. The presented method compensates the short utterance samples into the speech samples with sufficient speaker identity information by completing the antagonistic training of generator network and discriminator network. In order to avoid the model crash and gradient instability in the process of GAN training, this paper adopts the condition information in the conditional GAN to guide the compensation process of the generator network, and proposes the generator compensation performance measurement training task and the feature tag training task of the discriminator to stabilize training process. Finally, the proposed short utterance compensation method is evaluated on the speaker recognition system based on GMM–UBM. The experimental results indicate that the presented method can effectively reduce the equal error rate of the speaker recognition system in short utterance environment.
引用
收藏
页码:443 / 450
页数:7
相关论文
共 50 条
  • [1] Speaker recognition based on short utterance compensation method of generative adversarial networks
    Hu, Zhangfang
    Fu, Yaqin
    Luo, Yuan
    Xu, Xuan
    Xia, Zhiguang
    Zhang, Hongwei
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (02) : 443 - 450
  • [2] A short utterance speaker recognition method with improved cepstrum–CNN
    Yongfeng Li
    Shuaishuai Chang
    QingE Wu
    SN Applied Sciences, 2022, 4
  • [3] A short utterance speaker recognition method with improved cepstrum-CNN
    Li, Yongfeng
    Chang, Shuaishuai
    Wu, QingE
    SN APPLIED SCIENCES, 2022, 4 (12):
  • [4] Short Utterance Speaker Recognition Based on Speech High Frequency Information Compensation and Dynamic Feature Enhancement Methods
    Zi, Yunfei
    Xiong, Shengwu
    ARCHIVES OF ACOUSTICS, 2024, 49 (01) : 37 - 48
  • [5] UNIVERSAL ADVERSARIAL PERTURBATIONS GENERATIVE NETWORK FOR SPEAKER RECOGNITION
    Li, Jiguo
    Zhang, Xinfeng
    Jia, Chuanmin
    Xu, Jizheng
    Zhang, Li
    Wang, Yue
    Ma, Siwei
    Gao, Wen
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [6] An algorithm of face recognition based on generative adversarial networks
    Leonov, Sergey
    Vasilyev, Alexander
    Makovetskii, Artyom
    Diaz-Escobar, J.
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLI, 2018, 10752
  • [7] Speaker Recognition Based on Multimodal Generative Adversarial Nets with Triplet-loss
    Chen Ying
    Chen Huangkang
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (02) : 379 - 385
  • [8] Multi-Scale Kernels for Short Utterance Speaker Recognition
    Zhang, Wei-Qiang
    Zhao, Junhong
    Zhang, Wen-Lin
    Liu, Jia
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 414 - +
  • [9] Maize Disease Classification and Recognition Method Based on Super-resolution Generative Adversarial Networks
    Ma, Tiemin
    Qu, Hao
    Gao, Ya
    Wang, Xue
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2024, 55 (11): : 49 - 56and67
  • [10] Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes
    Li, Lantian
    Wang, Dong
    Zhang, Chenhao
    Zheng, Thomas Fang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (06) : 1129 - 1139