Few-Shot Speaker Identification Using Lightweight Prototypical Network With Feature Grouping and Interaction

被引:4
|
作者
Li, Yanxiong [1 ]
Chen, Hao [1 ]
Cao, Wenchang [1 ]
Huang, Qisheng [1 ]
He, Qianhua [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510640, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature grouping; feature interaction; few-shot learning; prototypical network; speaker identification; RECOGNITION; VERIFICATION; ATTENTION;
D O I
10.1109/TMM.2023.3253301
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing methods for few-shot speaker identification (FSSI) obtain high accuracy, but their computational complexities and model sizes need to be reduced for lightweight applications. In this work, we propose a FSSI method using a lightweight prototypical network with the final goal to implement the FSSI on intelligent terminals with limited resources, such as smart watches and smart speakers. In the proposed prototypical network, an embedding module is designed to perform feature grouping for reducing the memory requirement and computational complexity, and feature interaction for enhancing the representational ability of the learned speaker embedding. In the proposed embedding module, audio feature of each speech sample is split into several low-dimensional feature subsets that are transformed by a recurrent convolutional block in parallel. Then, the operations of averaging, addition, concatenation, element-wise summation and statistics pooling are sequentially executed to learn a speaker embedding for each speech sample. The recurrent convolutional block consists of a block of bidirectional long short-term memory, and a block of de-redundancy convolution in which feature grouping and interaction are conducted too. Our method is compared to baseline methods on three datasets that are selected from three public speech corpora (VoxCeleb1, VoxCeleb2, and LibriSpeech). The results show that our method obtains higher accuracy under several conditions, and has advantages over all baseline methods in computational complexity and model size.
引用
收藏
页码:9241 / 9253
页数:13
相关论文
共 50 条
  • [1] Improved prototypical network for active few-shot learning
    Wu, Yaqiang
    Li, Yifei
    Zhao, Tianzhe
    Zhang, Lingling
    Wei, Bifan
    Liu, Jun
    Zheng, Qinghua
    PATTERN RECOGNITION LETTERS, 2023, 172 : 188 - 194
  • [2] TRANSDUCTIVE PROTOTYPICAL NETWORK FOR FEW-SHOT CLASSIFICATION
    Liu, Xinyue
    Liu, Pengxin
    Zong, Linlin
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1671 - 1675
  • [3] Reweighted Regularized Prototypical Network for Few-Shot Fault Diagnosis
    Li, Kang
    Shang, Chao
    Ye, Hao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 6206 - 6217
  • [4] DFPN: a dynamic fusion prototypical network for few-shot learning
    Dong, Mengping
    Li, Fei
    Li, Zhenbo
    Liu, Xue
    APPLIED INTELLIGENCE, 2025, 55 (08)
  • [5] DFPN: a dynamic fusion prototypical network for few-shot learningDFPN: a dynamic fusion prototypical network for few-shot learningM. Dong et al.
    Mengping Dong
    Fei Li
    Zhenbo Li
    Xue Liu
    Applied Intelligence, 2025, 55 (10)
  • [6] Enhanced prototypical network for few-shot relation extraction
    Wen, Wen
    Liu, Yongbin
    Ouyang, Chunping
    Lin, Qiang
    Chung, Tonglee
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [7] Few-shot learning based oral cancer diagnosis using a dual feature extractor prototypical network
    Guo, Zijun
    Ao, Sha
    Ao, Bo
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 150
  • [8] Modeling Reviews for Few-Shot Recommendation via Enhanced Prototypical Network
    Liang, Tingting
    Xia, Congying
    Xu, Haoran
    Zhao, Ziqiang
    Yin, Yuyu
    Chen, Liang
    Yu, Philip S. S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 9407 - 9420
  • [9] Few-shot re-identification of the speaker by social robots
    Foggia, Pasquale
    Greco, Antonio
    Roberto, Antonio
    Saggese, Alessia
    Vento, Mario
    AUTONOMOUS ROBOTS, 2023, 47 (02) : 181 - 192
  • [10] DCPNet: Distribution Calibration Prototypical Network for Few-Shot Image Classification
    Xu, Ranhui
    Jiang, Kaizhong
    Qi, Lulu
    Zhao, Shaojie
    Zheng, Mingming
    IEEE ACCESS, 2024, 12 : 67036 - 67045