共 35 条
- [1] [Anonymous], P 3 INT C LEARNING R
- [2] Banner R., 2018, Post-training 4-bit quantization of convolution networks for rapid-deployment
- [4] PRIME: A Novel Processing-in-memory Architecture for Neural Network Computation in ReRAM-based Main Memory [J]. 2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, : 27 - 39
- [5] Low-bit Quantization of Neural Networks for Efficient Inference [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3009 - 3018
- [6] Chuang P, 2019, PROC SYSML, P1
- [7] Degrijse D., 2016, ARXIV160204354
- [8] REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs [J]. PROCEEDINGS OF THE 2019 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'19), 2019, : 33 - 42
- [9] Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks [J]. 2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2018, : 383 - 396
- [10] Deep Residual Learning for Image Recognition [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778