共 48 条
- [1] Alkhulaifi A(2021)Knowledge distillation in deep learning and its applications PeerJ Comput Sci 7 e474-444
- [2] Alsahli F(2005)ASR for emotional speech: clarifying the issues and enhancing performance Neural Netw 18 437-359
- [3] Ahmad I(2008)IEMOCAP: interactive emotional dyadic motion capture database Lang Resour Eval 42 335-42
- [4] Athanaselis T(2011)Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition IEEE Trans Audio Speech Lang Process 20 30-2030
- [5] Bakamidis S(2016)Domain-adversarial training of neural networks J Mach Learn Res 17 2096-2680
- [6] Dologlou I(2014)Generative adversarial nets Adv Neural Inf Process Syst 27 2672-134
- [7] Cowie R(2016)Multistage data selection-based unsupervised speaker adaptation for personalized speech emotion recognition Eng Appl Artif Intell 52 126-13
- [8] Douglas-Cowie E(2021)Accented speech recognition based on end-to-end domain adversarial training of neural networks Appl Sci 11 1-1920
- [9] Cox C(2010)A novel approach to HMM-based speech recognition systems using particle swarm optimization Math Comput Model 52 1910-1773
- [10] Busso C(2012)Using DTW neural-based MFCC warping to improve emotional speech recognition Neural Comput Appl 21 1765-14018