RACP: A network with attention corrected prototype for few-shot speaker recognition using indefinite distance metric

被引:8
作者
Wang, Xingmei [1 ]
Meng, Jiaxiang [1 ]
Wen, Bin [1 ]
Xue, Fuzhao [2 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
关键词
Few-shot learning; Speaker recognition; Prototypical networks; Indefinite distance metric; Attention;
D O I
10.1016/j.neucom.2021.11.092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot speaker recognition task is to identify speakers from limited support samples. We argue that query samples and support samples are both informative for classification. To help Prototypical Networks capture information from query samples, this paper proposes the relation-based indefinite distance metric attentive correction prototype network (RACP). Since the mean prototype deviates from the ideal prototype, we calculate attention scores for each query sample to customize the attention prototype. Then, to compensate for the missed query samples information, the prototype is further refined by correction data that is constructed by combining query samples with the global class attention score. Later, the indefinite distance metric of Relation Networks is introduced on Prototypical Networks, and the relation scores between the sample prototypes and the query samples are calculated for final prediction. Compare with existing methods, RACP can consider both query samples and support samples instead of ignoring the query ones. We compare RACP with strong baselines (e.g. GMM-SVM, MAML, Prototypical Networks, Res32, and VGG11). Ablation study and generalizability study of different scenarios are also conducted on different datasets. Results show that RACP achieves better performance and generalization ability. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:283 / 294
页数:12
相关论文
共 50 条
  • [21] RAPS: A Novel Few-Shot Relation Extraction Pipeline with Query-Information Guided Attention and Adaptive Prototype Fusion RAPS for Few-Shot RE
    Zhang, Yuzhe
    Cen, Min
    Wu, Tongzhou
    Zhang, Hong
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, : 147 - 152
  • [22] Two-Stream Prototype Learning Network for Few-Shot Face Recognition Under Occlusions
    Yang, Xingyu
    Han, Mengya
    Luo, Yong
    Hu, Han
    Wen, Yonggang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1555 - 1563
  • [23] Attention meta-transfer learning approach for few-shot iris recognition
    Lei, Songze
    Dong, Baihua
    Shan, Aokui
    Li, Yonggang
    Zhang, Wenjuan
    Xiao, Feng
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 99
  • [24] Overall positive prototype for few-shot open-set recognition
    Sun, Liang-Yu
    Chu, Wei-Ta
    PATTERN RECOGNITION, 2024, 151
  • [25] Multi-level Metric Learning for Few-Shot Image Recognition
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 243 - 254
  • [26] Multi-instance attention network for few-shot learning
    Qin, Zhili
    Wang, Han
    Mawuli, Cobbinah Bernard
    Han, Wei
    Zhang, Rui
    Yang, Qinli
    Shao, Junming
    INFORMATION SCIENCES, 2022, 611 : 464 - 475
  • [27] Selectively Augmented Attention Network for Few-Shot Image Classification
    Li, Xiaoxu
    Wang, Xiangyang
    Zhu, Rui
    Ma, Zhanyu
    Cao, Jie
    Xue, Jing-Hao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1180 - 1192
  • [28] Transductive Graph-Attention Network for Few-shot Classification
    Pan, Lili
    Liu, Weifeng
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 190 - 195
  • [29] Few-shot classification for soil images: Prototype correction and feature distance enhancement
    Zeng, Shaohua
    Xia, Yinsen
    Gu, Shoukuan
    Liu, Fugang
    Zhou, Jing
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 233
  • [30] Convolutional Self-attention Guided Graph Neural Network for Few-Shot Action Recognition
    Pan, Fei
    Guo, Jie
    Guo, Yanwen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 401 - 412