共 83 条
[2]
Agarla M, 2023, Arxiv, DOI arXiv:2207.06767
[4]
SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers
[J].
PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022,
2022,
[5]
Baevski A, 2020, ADV NEUR IN, V33
[6]
Burkhardt F, 2022, LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P1917
[7]
Burkhardt F., 2005, P INT, DOI DOI 10.21437/INTERSPEECH.2005-446
[9]
DISTILHUBERT: SPEECH REPRESENTATION LEARNING BY LAYER-WISE DISTILLATION OF HIDDEN-UNIT BERT
[J].
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2022,
:7087-7091
[10]
Chen LW, 2023, Arxiv, DOI arXiv:2110.06309