RACP: A network with attention corrected prototype for few-shot speaker recognition using indefinite distance metric

被引:8
|
作者
Wang, Xingmei [1 ]
Meng, Jiaxiang [1 ]
Wen, Bin [1 ]
Xue, Fuzhao [2 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
关键词
Few-shot learning; Speaker recognition; Prototypical networks; Indefinite distance metric; Attention;
D O I
10.1016/j.neucom.2021.11.092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot speaker recognition task is to identify speakers from limited support samples. We argue that query samples and support samples are both informative for classification. To help Prototypical Networks capture information from query samples, this paper proposes the relation-based indefinite distance metric attentive correction prototype network (RACP). Since the mean prototype deviates from the ideal prototype, we calculate attention scores for each query sample to customize the attention prototype. Then, to compensate for the missed query samples information, the prototype is further refined by correction data that is constructed by combining query samples with the global class attention score. Later, the indefinite distance metric of Relation Networks is introduced on Prototypical Networks, and the relation scores between the sample prototypes and the query samples are calculated for final prediction. Compare with existing methods, RACP can consider both query samples and support samples instead of ignoring the query ones. We compare RACP with strong baselines (e.g. GMM-SVM, MAML, Prototypical Networks, Res32, and VGG11). Ablation study and generalizability study of different scenarios are also conducted on different datasets. Results show that RACP achieves better performance and generalization ability. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:283 / 294
页数:12
相关论文
共 50 条
  • [1] Multi-distance metric network for few-shot learning
    Gao, Farong
    Cai, Lijie
    Yang, Zhangyi
    Song, Shiji
    Wu, Cheng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (09) : 2495 - 2506
  • [2] Multi-distance metric network for few-shot learning
    Farong Gao
    Lijie Cai
    Zhangyi Yang
    Shiji Song
    Cheng Wu
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 2495 - 2506
  • [3] Few-shot palmprint recognition based on similarity metric hashing network
    Liu, Chengcheng
    Zhong, Dexing
    Shao, Huikai
    NEUROCOMPUTING, 2021, 456 : 540 - 549
  • [4] Few-Shot Learning of Signal Modulation Recognition based on Attention Relation Network
    Zhang, Zilin
    Li, Yan
    Gao, Meiguo
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 1372 - 1376
  • [5] Intermediate prototype network for few-shot segmentation
    Luo, Xiaoliu
    Duan, Zhao
    Zhang, Taiping
    SIGNAL PROCESSING, 2023, 203
  • [6] Transductive Prototypical Attention Network for Few-shot SAR Target Recognition
    Yu, Xuelian
    Liu, Sen
    Ren, Haohao
    Zou, Lin
    Zhou, Yun
    Wang, Xuegang
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [7] Gaussian Prototype Rectification For Few-shot Image Recognition
    Lin, Jinfu
    Shen, Junmin
    He, Xiaojian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [8] Few-Shot Speaker Identification Using Lightweight Prototypical Network With Feature Grouping and Interaction
    Li, Yanxiong
    Chen, Hao
    Cao, Wenchang
    Huang, Qisheng
    He, Qianhua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9241 - 9253
  • [9] Spatial Attention Network for Few-Shot Learning
    He, Xianhao
    Qiao, Peng
    Dou, Yong
    Niu, Xin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 567 - 578
  • [10] Local Mutual Metric Network for Few-Shot Image Classification
    Li, Yaohui
    Li, Huaxiong
    Chen, Haoxing
    Chen, Chunlin
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 443 - 454