Few-Shot Speaker Identification Using Lightweight Prototypical Network With Feature Grouping and Interaction

被引:4
|
作者
Li, Yanxiong [1 ]
Chen, Hao [1 ]
Cao, Wenchang [1 ]
Huang, Qisheng [1 ]
He, Qianhua [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510640, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature grouping; feature interaction; few-shot learning; prototypical network; speaker identification; RECOGNITION; VERIFICATION; ATTENTION;
D O I
10.1109/TMM.2023.3253301
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing methods for few-shot speaker identification (FSSI) obtain high accuracy, but their computational complexities and model sizes need to be reduced for lightweight applications. In this work, we propose a FSSI method using a lightweight prototypical network with the final goal to implement the FSSI on intelligent terminals with limited resources, such as smart watches and smart speakers. In the proposed prototypical network, an embedding module is designed to perform feature grouping for reducing the memory requirement and computational complexity, and feature interaction for enhancing the representational ability of the learned speaker embedding. In the proposed embedding module, audio feature of each speech sample is split into several low-dimensional feature subsets that are transformed by a recurrent convolutional block in parallel. Then, the operations of averaging, addition, concatenation, element-wise summation and statistics pooling are sequentially executed to learn a speaker embedding for each speech sample. The recurrent convolutional block consists of a block of bidirectional long short-term memory, and a block of de-redundancy convolution in which feature grouping and interaction are conducted too. Our method is compared to baseline methods on three datasets that are selected from three public speech corpora (VoxCeleb1, VoxCeleb2, and LibriSpeech). The results show that our method obtains higher accuracy under several conditions, and has advantages over all baseline methods in computational complexity and model size.
引用
收藏
页码:9241 / 9253
页数:13
相关论文
共 50 条
  • [31] Improved prototypical networks for few-Shot learninge
    Ji, Zhong
    Chai, Xingliang
    Yu, Yunlong
    Pang, Yanwei
    Zhang, Zhongfei
    PATTERN RECOGNITION LETTERS, 2020, 140 : 81 - 87
  • [32] Few-Shot Keyword Spotting With Prototypical Networks
    Parnami, Archit
    Lee, Minwoo
    PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 277 - 283
  • [33] Multidomain variance-learnable prototypical network for few-shot diagnosis of novel faults
    Jianyu Long
    Yibin Chen
    Huiyu Huang
    Zhe Yang
    Yunwei Huang
    Chuan Li
    Journal of Intelligent Manufacturing, 2024, 35 : 1455 - 1467
  • [34] Multi-local feature relation network for few-shot learning
    Li Ren
    Guiduo Duan
    Tianxi Huang
    Zhao Kang
    Neural Computing and Applications, 2022, 34 : 7393 - 7403
  • [35] RACP: A network with attention corrected prototype for few-shot speaker recognition using indefinite distance metric
    Wang, Xingmei
    Meng, Jiaxiang
    Wen, Bin
    Xue, Fuzhao
    NEUROCOMPUTING, 2022, 490 : 283 - 294
  • [36] DiffFSRE: Diffusion-Enhanced Prototypical Network for Few-Shot Relation Extraction
    Chen, Yang
    Shi, Bowen
    ENTROPY, 2024, 26 (05)
  • [37] Prototypical Network with Instance-Level Attention in Multi-Label Few-Shot Learning
    Luo S.
    Zhang R.
    Pan L.
    Wu Z.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2023, 43 (04): : 403 - 409
  • [38] Multi-local feature relation network for few-shot learning
    Ren, Li
    Duan, Guiduo
    Huang, Tianxi
    Kang, Zhao
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (10) : 7393 - 7403
  • [39] ST-PN: A Spatial Transformed Prototypical Network for Few-Shot SAR Image Classification
    Cai, Jinlei
    Zhang, Yueting
    Guo, Jiayi
    Zhao, Xin
    Lv, Junwei
    Hu, Yuxin
    REMOTE SENSING, 2022, 14 (09)
  • [40] A novel dimensional variational prototypical network for industrial few-shot fault diagnosis with unseen faults☆ ☆
    Peng, Chuang
    Chen, Lei
    Hao, Kuangrong
    Chen, Shuaijie
    Cai, Xin
    Wei, Bing
    COMPUTERS IN INDUSTRY, 2024, 162