共 129 条
[31]
Farmahini-Farahani A, 2015, INT S HIGH PERF COMP, P283, DOI 10.1109/HPCA.2015.7056040
[32]
TETRIS: Scalable and efficient neural network acceleration with 3D memory
[J].
1600, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (52)
:751-764
[33]
GenStore: A High-Performance In-Storage Processing System for Genome Sequence Analysis
[J].
ASPLOS '22: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS,
2022,
:635-654
[34]
github, pybind11
[35]
github, Xilinx OpenCL extension
[36]
github, Deepspeed
[37]
Goyal P, 2018, Arxiv, DOI arXiv:1706.02677
[38]
Biscuit: A Framework for Near-Data Processing of Big Data Workloads
[J].
2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA),
2016,
:153-165
[39]
Gu YX, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P8410
[40]
h3platform, Falcon 4109