Speaker recognition based on short utterance compensation method of generative adversarial networks

被引:0
|
作者
Zhangfang Hu
Yaqin Fu
Yuan Luo
Xuan Xu
Zhiguang Xia
Hongwei Zhang
机构
[1] Chongqing University of Posts and Telecommunications,College of Optoelectronic Engineering
来源
International Journal of Speech Technology | 2020年 / 23卷
关键词
Gaussian mixture model–universal background model; Speaker recognition; Generative adversarial network;
D O I
暂无
中图分类号
学科分类号
摘要
On the basis of gaussian mixture model–universal background model (GMM–UBM) in the speaker recognition system, the paper proposes a short utterance sample compensation method based on the generative adversarial network (GAN) to solve the problem of the inadequate corpus data caused by short utterance, which has led to a serious reduction of recognition rate. The presented method compensates the short utterance samples into the speech samples with sufficient speaker identity information by completing the antagonistic training of generator network and discriminator network. In order to avoid the model crash and gradient instability in the process of GAN training, this paper adopts the condition information in the conditional GAN to guide the compensation process of the generator network, and proposes the generator compensation performance measurement training task and the feature tag training task of the discriminator to stabilize training process. Finally, the proposed short utterance compensation method is evaluated on the speaker recognition system based on GMM–UBM. The experimental results indicate that the presented method can effectively reduce the equal error rate of the speaker recognition system in short utterance environment.
引用
收藏
页码:443 / 450
页数:7
相关论文
共 50 条
  • [21] An Advanced Channel Compensation Method for Speaker Recognition
    Imamverdiyev, Yadigar
    Sukhostat, Lyudmila
    2013 7TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2013, : 128 - 131
  • [22] Radio Classify Generative Adversarial Networks: A Semi-supervised Method for Modulation Recognition
    Li, Mingxuan
    Liu, Guangyi
    Li, Shuntao
    Wu, Yifan
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2018, : 669 - 672
  • [23] A THz Passive Image Generation Method Based on Generative Adversarial Networks
    Yang, Guan
    Li, Chao
    Liu, Xiaojun
    Fang, Guangyou
    APPLIED SCIENCES-BASEL, 2022, 12 (04):
  • [24] A Shallow Seafloor Reverberation Simulation Method Based on Generative Adversarial Networks
    Hu, Ning
    Rao, Xin
    Zhao, Jiabao
    Wu, Shengjie
    Wang, Maofa
    Wang, Yangzhen
    Qiu, Baochun
    Zhu, Zhenjing
    Chen, Zitong
    Liu, Tong
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [25] Lens-Free Imaging Method Based on Generative Adversarial Networks
    Zhang Chao
    Xing Tao
    Liu Zizhen
    He Haokun
    Shen Hua
    Bian Yinxu
    Zhu Rihong
    ACTA OPTICA SINICA, 2020, 40 (16)
  • [26] Synthetic Dataset Generation for Text Recognition with Generative Adversarial Networks
    Efimova, Valeria
    Shalamov, Viacheslav
    Filchenkov, Andrey
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [27] Speaker recognition algorithm based on channel compensation
    Shen X.-J.
    Zhai Y.-J.
    Lu Y.-T.
    Wang Y.
    Chen H.-P.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2016, 46 (03): : 870 - 875
  • [28] Adversarial Patch Attacks on Deep-Learning-Based Face Recognition Systems Using Generative Adversarial Networks
    Hwang, Ren-Hung
    Lin, Jia-You
    Hsieh, Sun-Ying
    Lin, Hsuan-Yu
    Lin, Chia-Liang
    SENSORS, 2023, 23 (02)
  • [29] A pore space reconstruction method of shale based on autoencoders and generative adversarial networks
    Zhang, Ting
    Li, Deya
    Lu, Fangfang
    COMPUTATIONAL GEOSCIENCES, 2021, 25 (06) : 2149 - 2165
  • [30] A pore space reconstruction method of shale based on autoencoders and generative adversarial networks
    Ting Zhang
    Deya Li
    Fangfang Lu
    Computational Geosciences, 2021, 25 : 2149 - 2165