共 29 条
[1]
[Anonymous], 2023, Facebook Research
[2]
[Anonymous], Different development paths of llms
[3]
[Anonymous], 2023, Nvidia/megatron-lm: Ongoing research training transformer models at scale
[4]
Cobbe K, 2021, Arxiv, DOI arXiv:2110.14168
[6]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]
FairScale authors, 2021, Fairscale: A general purpose modular pytorch library for high performance and large scale training
[9]
Foster D., 2022, Generative Deep Learning
[10]
Gozalo-Brizuela R., 2023, CHATGPT IS NOT ALL Y