共 38 条
[1]
Ba JL, 2016, arXiv
[2]
Brown TB, 2020, ADV NEUR IN, V33
[3]
A Deep Look into Logarithmic Quantization of Model Parameters in Neural Networks
[J].
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY (IAIT2018),
2018,
[4]
Dehghani M., 2023, arXiv
[5]
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[8]
Driess D, 2023, Arxiv, DOI arXiv:2303.03378
[9]
A3: Accelerating Attention Mechanisms in Neural Networks with Approximation
[J].
2020 IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2020),
2020,
:328-341
[10]
Kim S., 2021, P 38 INT C MACH LEAR, P5506