共 250 条
[1]
Osoba OA, 2020, Arxiv, DOI arXiv:2006.05048
[3]
Aher G, 2023, PR MACH LEARN RES, V202, P337
[4]
Akata E, 2023, Arxiv, DOI [arXiv:2305.16867, 10.48550/arXiv.2305.16867, DOI 10.48550/ARXIV.2305.16867]
[5]
Alluhaybi B, 2019, INT J ADV COMPUT SC, V10, P211
[6]
DeepSpeed-Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale
[J].
SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS,
2022,
[8]
[Anonymous], 2000, Mind & Society, DOI DOI 10.1007/BF02512229