共 36 条
[1]
RNN-T MODELS FAIL TO GENERALIZE TO OUT-OF-DOMAIN AUDIO: CAUSES AND SOLUTIONS
[J].
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT),
2021,
:873-880
[2]
CHOLLET F, 2017, PROC CVPR IEEE, P1800, DOI [DOI 10.1109/CVPR.2017.195, 10.1109/CVPR.2017.195]
[3]
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[4]
Graves A., 2012, Sequence transduction with recurrent neural networks
[5]
Graves A, 2006, Proceedings of the 23rd International Conference on Machine Learning, ICML'06, page, P369, DOI DOI 10.1145/1143844.1143891
[6]
Hannun A., 2019, ARXIV190402619
[7]
He K, 2016, PROC CVPR IEEE, P770, DOI [10.1109/CVPR.2016.90, DOI 10.1109/CVPR.2016.90]
[8]
He YZ, 2019, INT CONF ACOUST SPEE, P6381, DOI [10.1109/ICASSP.2019.8682336, 10.1109/icassp.2019.8682336]
[9]
Hu J., 2018, 32 C NEUR INF PROC S, Vvol 31, P9401, DOI 10.5555/3327546.3327612
[10]
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]