Generalization Ability Improvement of Speaker Representation and Anti-Interference for Speaker Verification

被引:3
|
作者
Hong, Qian-Bei [1 ,2 ]
Wu, Chung-Hsien [3 ]
Wang, Hsin-Min [4 ]
机构
[1] Natl Cheng Kung Univ, Grad Program Multimedia Syst & Intelligent Comp, Tainan, Taiwan
[2] Acad Sinica, Tainan, Taiwan
[3] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan
[4] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
关键词
Speaker verification; parent embedding learning; partial adaptive score normalization; RECOGNITION; EMBEDDINGS;
D O I
10.1109/TASLP.2022.3221042
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The ability to generalize to mismatches between training and testing conditions and resist interference from other speakers is crucial for the performance of speaker verification. In this paper, we propose two novel approaches to improve the generalization ability to deal with the mismatched recorded scenarios and languages in test conditions and to reduce the influence of interference from other speakers on the similarity measurement of two speaker embeddings. First, parent embedding learning (PEL) is used for model training, which exploits the generalization ability of the shared structure to improve the representation of speaker embeddings. Second, partial adaptive score normalization (PAS-Norm) is used to reduce the influence of interference from other speakers on embedding-based similarity measures. In the experiments, the speaker embedding models are trained using the VoxCeleb2 dataset, and the performance is evaluated on four other datasets under different conditions, including VoxCeleb1, Librispeech, SITW, and CN-Celeb datasets. In the experiments on VoxCeleb1, evaluation results considering a large number of verification speakers and identity restrictions show that the proposed PEL-based system reduces the EER by 6.0% and 4.9% in these two cases, respectively, compared to the state-of-the-art (SOTA) system. Furthermore, in the experiments evaluating speaker verification in mismatch conditions on SITW and CN-Celeb, the proposed PEL-based system also outperforms the SOTA system. In the language mismatched conditions, the EER is reduced by 8.3%. For the evaluation of the influence of interference from other speakers, the EER is significantly reduced by 24.4% when PAS-Norm is used instead of the baseline AS-Norm score normalization method.
引用
收藏
页码:486 / 499
页数:14
相关论文
共 50 条
  • [31] Deep domain adaptation for anti-spoofing in speaker verification systems
    Himawan, Ivan
    Villavicencio, Fernando
    Sridharan, Sridha
    Fookes, Clinton
    COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 377 - 402
  • [32] A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification
    Li, Lantian
    Chen, Yixiang
    Wang, Dong
    Zheng, Thomas Fang
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 92 - 96
  • [33] Best Feature Selection for Emotional Speaker Verification in i-vector Representation
    Mackova, Lenka
    Cizmar, Anton
    Juhar, Jozef
    2015 25TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2015, : 209 - 212
  • [34] Lexicon-Based Local Representation for Text-Dependent Speaker Verification
    You, Hanxu
    Li, Wei
    Li, Lianqiang
    Zhu, Jie
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (03): : 587 - 589
  • [35] Improvement of anti-interference ability to thermal noises in a image-checking system for laser shape-meter
    Yang, Xilin
    Fang, Zhongyan
    Jin, Guofan
    Qiu, Zhongyi
    Guangxue Jishu/Optical Technique, 1999, (01): : 28 - 31
  • [36] A Simple Design of Alternating Polarized Array Antenna with Anti-Interference Ability
    Dai Huan-Yao
    Li Yong-Zhen
    Wang Xue-Song
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2012, 6 : 15 - 18
  • [37] Research on Anti-interference Ability of Direct Sequence Spread Spectrum System
    Li Zhendong
    Tan Weifeng
    Kang Chengbin
    Cheng Jingshuang
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (01) : 116 - 123
  • [38] An encapsulation strategy of graphene humidity sensor for enhanced anti-interference ability
    Huang, Yuehua
    Zeng, Zhonglin
    Liang, Tao
    Li, Jing
    Liao, Ziqi
    Li, Junjun
    Yang, Tingting
    SENSORS AND ACTUATORS B-CHEMICAL, 2023, 396
  • [39] Anthropogenic impacts on the biodiversity and anti-interference ability of microbial communities in lakes
    Luo, Jiwei
    Zeng, Hui
    Zhou, Qixing
    Hu, Xiangang
    Qu, Qian
    Ouyang, Shaohu
    Wang, Yingying
    SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 820
  • [40] An excitation signal source with anti-interference ability for eddy current testing
    Jiang, Guodong
    Li, Po
    NINTH INTERNATIONAL SYMPOSIUM ON PRECISION ENGINEERING MEASUREMENTS AND INSTRUMENTATION, 2015, 9446