共 42 条
[31]
Sennrich R, 2016, Arxiv, DOI [arXiv:1511.06709, DOI 10.48550/ARXIV.1511.06709]
[32]
Srivastava N, 2014, J MACH LEARN RES, V15, P1929
[33]
Vaswani A., 2017, Adv. Neural Inf. Process. Syst, V30, P1
[34]
Vaswani A, 2023, Arxiv, DOI [arXiv:1706.03762, 10.48550/arXiv.1706.03762, DOI 10.48550/ARXIV.1706.03762]
[35]
Wei J, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P6382
[36]
Wen TH, 2017, Arxiv, DOI arXiv:1604.04562
[37]
Wu CS, 2019, Arxiv, DOI arXiv:1901.04713
[39]
Zhang X, 2015, ADV NEUR IN, V28
[40]
ADAPTING GPT, GPT-2 AND BERT LANGUAGE MODELS FOR SPEECH RECOGNITION
[J].
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU),
2021,
:162-168