Jointing Multi-task Learning and Gradient Reversal Layer for Far-Field Speaker Verification

被引:1
|
作者
Xu, Wei [1 ]
Wang, Xinghao [1 ]
Wan, Hao [1 ,2 ]
Guo, Xin [3 ]
Zhao, Junhong [1 ]
Deng, Feiqi [1 ]
Kang, Wenxiong [1 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China
[2] Guangdong Baiyun Airport Informat Technol Co Ltd, Postdoctoral Innovat Base, Guangzhou, Peoples R China
[3] Guangdong Commun Polytech, Guangzhou, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Far-field speaker verification; Multi-task learning; Gradient reversal layer; Dynamic loss weights strategy;
D O I
10.1007/978-3-030-86608-2_49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Far-field speaker verification is challenging, because of interferences caused by different distances between the speaker and the recorder. In this paper, a distance discriminator, which determines whether two utterances are recorded at the same distance, is used as an auxiliary task to learn distance discrepancy information. There are two identical auxiliary tasks, one is added before the speaker embedding layer to learn distance discrepancy information via multi-task learning, and then the other is added after that layer to suppress the learned discrepancy via a gradient reversal layer. In addition, to avoid conflicts among the optimization directions of all tasks, the loss weight of every task is updated dynamically during training. Experiments on AISHELL Wake-up show a relatively 7% and 10.3% reduction of equal error rate (EER) on far-far speaker verification and near-far speaker verification respectively, compared with the single-task model, demonstrating the effectiveness of the proposed method.
引用
收藏
页码:449 / 457
页数:9
相关论文
共 50 条
  • [41] Accelerated Gradient Method for Multi-Task Sparse Learning Problem
    Chen, Xi
    Pan, Weike
    Kwok, James T.
    Carbonell, Jaime G.
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 746 - +
  • [42] Conflict-Averse Gradient Descent for Multi-task Learning
    Liu, Bo
    Liu, Xingchao
    Jin, Xiaojie
    Stone, Peter
    Liu, Qiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [43] Online Multi-Task Gradient Temporal-Difference Learning
    Sreenivasan, Vishnu Purushothaman
    Ammar, Haitham Bou
    Eaton, Eric
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 3136 - 3137
  • [44] Combination of clean and contaminated GMM/SVM for far-field text-independent speaker verification
    Zieger, Christian
    Omologo, Maurizio
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1949 - 1952
  • [45] HI-MIA : A FAR-FIELD TEXT-DEPENDENT SPEAKER VERIFICATION DATABASE AND THE BASELINES
    Qin, Xiaoyi
    Bu, Hui
    Li, Ming
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7609 - 7613
  • [46] A Pseudo-task Design in Multi-task Learning Deep Neural Network for Speaker Recognition
    Lu, Xugang
    Shen, Peng
    Tsao, Yu
    Kawai, Hisashi
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [47] Enhanced Multi-Task Learning Architecture for Detecting Pedestrian at Far Distance
    Zhou, Chengju
    Wu, Meiqing
    Lam, Siew-Kei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 15588 - 15604
  • [48] CHANNEL INVARIANT SPEAKER EMBEDDING LEARNING WITH JOINT MULTI-TASK AND ADVERSARIAL TRAINING
    Chen, Zhengyang
    Wang, Shuai
    Qian, Yanmin
    Yu, Kai
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6574 - 6578
  • [49] GRAPH ATTENTION AND INTERACTION NETWORK WITH MULTI-TASK LEARNING FOR FACT VERIFICATION
    Yang, Rui
    Wang, Runze
    Ling, Zhen-Hua
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7838 - 7842
  • [50] A single layer Perceptron approach to selective multi-task learning
    Madrid-Sanchez, Jaisiel
    Lazaro-Gredilla, Miguel
    Figueiras-Vidal, Anibal R.
    BIO-INSPIRED MODELING OF COGNITIVE TASKS, PT 1, PROCEEDINGS, 2007, 4527 : 272 - +