Efficient Integrated Features Based on Pre-trained Models for Speaker Verification

被引:0
|
作者
Li, Yishuang [1 ,2 ]
Guan, Wenhao [3 ]
Huang, Hukai [3 ]
Miao, Shiyu [2 ]
Su, Qi [2 ]
Li, Lin [1 ,2 ]
Hong, Qingyang [3 ]
机构
[1] Xiamen Univ, Inst Artificial Intelligence, Xian, Peoples R China
[2] Xiamen Univ, Sch Elect Sci & Engn, Xian, Peoples R China
[3] Xiamen Univ, Sch Informat, Xian, Peoples R China
来源
INTERSPEECH 2024 | 2024年
基金
中国国家自然科学基金;
关键词
speaker verification; pre-trained models; feature integration; t-SNE; SPEECH;
D O I
10.21437/Interspeech.2024-1889
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous work has explored the application of pre-trained models (PTMs) in speaker verification(SV). Most researchers directly replaced handcrafted features with the universal representations of the PTMs, and jointly fine-tuned PTMs with the downstream SV networks, which undoubtedly discarded important spectral information contained in handcrafted features and also increased the training cost. In this paper, we proposed an efficient feature integration method that utilized a Fine-grained Fusion Module to fuse the multi-layer representations of the PTMs adaptively. Then we integrated the fused representations with handcrafted features to obtain the integrated features, which were subsequently fed into the SV network. The experimental results demonstrated that using the integrated features effectively enhanced the performance of the SV systems, and yielded decent results with no need to fine-tune the PTMs. Moreover, employing full-parameter fine-tuning led to the current optimal results.
引用
收藏
页码:2140 / 2144
页数:5
相关论文
共 50 条
  • [41] TARGET SPEECH EXTRACTION WITH PRE-TRAINED SELF-SUPERVISED LEARNING MODELS
    Peng, Junyi
    Delcroix, Marc
    Ochiai, Tsubasa
    Plchot, Oldrich
    Araki, Shoko
    Cemocky, Jan
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 10421 - 10425
  • [42] Classification of Rice Leaf Diseases using CNN-based pre-trained models and transfer learning
    Mavaddat, Marjan
    Naderan, Marjan
    Alavi, Seyyed Enayatallah
    2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,
  • [43] EnsUNet: Enhancing Brain Tumor Segmentation Through Fusion of Pre-trained Models
    Laouamer, Ilhem
    Aiadi, Oussama
    Kherfi, Mohammed Lamine
    Cheddad, Abbas
    Amirat, Hanane
    Laouamer, Lamri
    Drid, Khaoula
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 163 - 174
  • [44] Detection of Speech Related Disorders by Pre-trained Embedding Models Extracted Biomarkers
    Jenei, Attila Zoltan
    Kiss, Gabor
    Sztaho, David
    SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 279 - 289
  • [45] Effective Utilization of Pre-Trained Models in Ad-Hoc Video Search
    Ueki K.
    Suzuki Y.
    Hori T.
    Takushima H.
    Okamoto H.
    Tanoue H.
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2023, 89 (12): : 926 - 933
  • [46] Comprehensive study of pre-trained language models: detecting humor in news headlines
    Shatnawi, Farah
    Abdullah, Malak
    Hammad, Mahmoud
    Al-Ayyoub, Mahmoud
    SOFT COMPUTING, 2023, 27 (05) : 2575 - 2599
  • [47] Deep Pre-trained Models for Computer Vision Applications: Traffic sign recognition
    Bouaafia, Soulef
    Messaoud, Seifeddine
    Maraoui, Amna
    Ammari, Ahmed Chiheb
    Khriji, Lazhar
    Machhout, Mohsen
    2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2021, : 23 - 28
  • [48] A Joint Framework for Predicting Disease-Gene Interactions Based on Pre-trained Models and Graph Attention Networks
    Deng, Qiwen
    Han, Yuexia
    Sun, Jianfei
    PROCEEDINGS OF 2024 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND INTELLIGENT COMPUTING, BIC 2024, 2024, : 474 - 478
  • [49] FastPTM: Fast weights loading of pre-trained models for parallel inference service provisioning
    Cai, Fenglong
    Yuan, Dong
    Yang, Zhe
    Xu, Yonghui
    He, Wei
    Guo, Wei
    Cui, Lizhen
    PARALLEL COMPUTING, 2024, 122
  • [50] Hippocampus segmentation and classification for dementia analysis using pre-trained neural network models
    Priyanka, Ahana
    Ganesan, Kavitha
    BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2021, 66 (06): : 581 - 592