共 52 条
[41]
Qian J, 2022, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), P2912
[42]
Radford A., 2018, Language models are unsupervised multitask learners
[43]
Raffel C, 2020, J MACH LEARN RES, V21
[44]
Reimers N, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3982
[45]
Keskar NS, 2019, Arxiv, DOI [arXiv:1909.05858, 10.48550/arXiv.1909.05858]
[46]
Su J., 2021, arXiv, DOI DOI 10.48550/ARXIV.2103.15316
[47]
Touvron H, 2023, Arxiv, DOI [arXiv:2302.13971, 10.48550/arXiv.2302.13971.09685]
[48]
van den Oord A, 2019, Arxiv, DOI arXiv:1807.03748
[49]
Wang T., 2020, INT C MACHINE LEARNI
[50]
Wolf T, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING: SYSTEM DEMONSTRATIONS, P38