共 83 条
- [2] Agarla M, 2023, Arxiv, DOI arXiv:2207.06767
- [4] [Anonymous], 2005, P INTERSPEECH, DOI DOI 10.21437/INTERSPEECH.2005-446
- [5] [Anonymous], 2010, Multimodal Emotion Recognition, DOI DOI 10.4018/978-1-61520-919-4
- [6] SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers [J]. PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
- [7] Baevski A., 2020, Advances in neural information processing systems
- [8] Burkhardt F, 2022, LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P1917
- [10] DISTILHUBERT: SPEECH REPRESENTATION LEARNING BY LAYER-WISE DISTILLATION OF HIDDEN-UNIT BERT [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7087 - 7091