共 50 条
- [42] XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech INTERSPEECH 2023, 2023, : 5506 - 5510
- [43] A Feature Fusion Model with Data Augmentation for Speech Emotion Recognition APPLIED SCIENCES-BASEL, 2023, 13 (07):
- [44] Empirical Interpretation of Speech Emotion Perception with Attention Based Model for Speech Emotion Recognition INTERSPEECH 2020, 2020, : 4113 - 4117
- [45] Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (01): : 30 - 42
- [46] Improving Under-Resourced Code-Switched Speech Recognition: Large Pre-trained Models or Architectural Interventions INTERSPEECH 2023, 2023, : 1439 - 1443
- [48] Improving Automatic Emotion Recognition from Speech Signals INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 312 - +