Pre-Finetuning for Few-Shot Emotional Speech Recognition

被引:1
|
作者
Chen, Maximillian [1 ]
Yu, Zhou [1 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
来源
INTERSPEECH 2023 | 2023年
关键词
emotion recognition; low-resource learning; pre-finetuning; transfer learning; CORPUS;
D O I
10.21437/Interspeech.2023-136
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech models have long been known to overfit individual speakers for many classification tasks. This leads to poor generalization in settings where the speakers are out-of-domain or out-of-distribution, as is common in production environments. We view speaker adaptation as a few-shot learning problem and propose investigating transfer learning approaches inspired by recent success with pre-trained models in natural language tasks. We propose pre-finetuning speech models on difficult tasks to distill knowledge into few-shot downstream classification objectives. We pre-finetune Wav2Vec2.0 on every permutation of four multiclass emotional speech recognition corpora and evaluate our pre-finetuned models through 33,600 few-shot fine-tuning trials on the Emotional Speech Dataset.
引用
收藏
页码:3602 / 3606
页数:5
相关论文
共 50 条
  • [1] A prototypical network for few-shot recognition of speech imagery data
    Hernandez-Galvan, Alan
    Ramirez-Alonso, Graciela
    Ramirez-Quintana, Juan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [2] Few-shot dysarthric speech recognition with text-to-speech data augmentation
    Hermann, Enno
    Magimai-Doss, Mathew
    INTERSPEECH 2023, 2023, : 156 - 160
  • [3] Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning
    Utama, Prasetya Ajie
    Moosavi, Nafise Sadat
    Sanh, Victor
    Gurevych, Iryna
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9063 - 9074
  • [4] Few-shot learning for ear recognition
    Zhang, Jie
    Yu, Wen
    Yang, Xudong
    Deng, Fang
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 50 - 54
  • [5] Few-shot Logo Recognition in the Wild
    Ermakov, Mikhail
    Makarov, Ilya
    2022 IEEE 22ND INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS AND 8TH IEEE INTERNATIONAL CONFERENCE ON RECENT ACHIEVEMENTS IN MECHATRONICS, AUTOMATION, COMPUTER SCIENCE AND ROBOTICS (CINTI-MACRO), 2022, : 393 - 397
  • [6] Few-Shot Hyperspectral Image Classification Using Meta Learning and Regularized Finetuning
    Li, Wenmei
    Liu, Qing
    Zhang, Yu
    Wang, Yu
    Yuan, Yuan
    Jia, Yan
    He, Yuhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 14
  • [7] Improving Imbalanced Learning by Pre-finetuning with Data Augmentation
    Shi, Yiwen
    ValizadehAslani, Taha
    Wang, Jing
    Ren, Ping
    Zhang, Yi
    Hu, Meng
    Zhao, Liang
    Liang, Hualou
    FOURTH INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 183, 2022, 183 : 68 - 82
  • [8] Prototype equilibrium network with group emotional contagion for few-shot emotion recognition in conversation
    Jiang, Min
    Wang, Mengdi
    Kong, Jun
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (06) : 2229 - 2246
  • [9] Worst Case Matters for Few-Shot Recognition
    Fu, Minghao
    Cao, Yun-Hao
    Wu, Jianxin
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 99 - 115
  • [10] Few-shot nested named entity recognition
    Ming, Hong
    Yang, Jiaoyun
    Gui, Fang
    Jiang, Lili
    An, Ning
    KNOWLEDGE-BASED SYSTEMS, 2024, 293