共 37 条
[1]
Alec RadfordKarthik Narasimhan., 2018, IMPROVING LANGUAGE U
[2]
Artetxe Mikel., 2021, Efficient large scale language modeling with mixtures of experts
[3]
Athiwaratkun B, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P375
[4]
Black Sid., 2021, GPT-Neo: Large Scale Autoregressive Language Modeling with MeshTensorflow
[5]
Brown TB, 2020, ADV NEUR IN, V33
[6]
Bucila Cristian., 2006, P 12 ACM SIGKDD INT, P535, DOI DOI 10.1145/1150402.1150464
[7]
Chowdhery A, 2022, Arxiv, DOI arXiv:2204.02311
[8]
Coucke A., 2018, CORR
[9]
De Cao N., 2020, AUTOREGRESSIVE ENTIT
[10]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171