共 34 条
[1]
A. I. for AI, 2020, C4: The colossal clean crawled corpus
[2]
[Anonymous], The Yelp Dataset
[3]
Artetxe M, 2022, Arxiv, DOI arXiv:2112.10684
[4]
Brown TB, 2020, ADV NEUR IN, V33
[5]
Chen C., 2022, Advances in Neural Information Processing Systems, V35, P173
[6]
Child R, 2019, Arxiv, DOI arXiv:1904.10509
[7]
Fedus W, 2022, J MACH LEARN RES, V23
[8]
Gao L, 2020, Arxiv, DOI [arXiv:2101.00027, 10.48550/arXiv.2101.00027]
[9]
He JA, 2021, Arxiv, DOI arXiv:2103.13262
[10]
FASTERMOE: Modeling and Optimizing Training of Large-Scale Dynamic Pre-Trained Models
[J].
PPOPP'22: PROCEEDINGS OF THE 27TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING,
2022,
:120-134