共 36 条
[1]
Lecun Y, Bengio Y, Hinton G., Deep learning, Nature, 521, 7553, pp. 436-444, (2015)
[2]
Krizhevsky A, Et al., ImageNet classification with deep convolutional neural networks, Communications of the ACM, 60, 6, pp. 84-90, (2017)
[3]
Szegedy C, Et al., Going deeper with convolutions, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-9, (2015)
[4]
Jouppi N P, Young C, Patil N, Et al., In-datacenter performance analysis of a tensor processing unit, Proceedings of the Annual International Symposium on Computer Architecture, pp. 1-12, (2017)
[5]
Chen Y, Luo T, Liu S, Et al., DaDianNao: A machine-learning supercomputer, Proceedings of the Annual IEEE/ACM International Symposium on Microarchitecture, pp. 609-622, (2014)
[6]
Chen Y H, Yang T J, Emer J, Et al., Eyeriss v2: A flexible accelerator for emerging deep neural networks on mobile devices, IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 9, 2, pp. 292-308, (2019)
[7]
Gu Yi-Kun, Ni Feng-Lei, Liu Hong, Fault-tolerance design of Xilinx FPGA with self-hosting configuration management, Journal of Astronautics, 33, 10, pp. 1519-1527, (2012)
[8]
Wang Chao, Wang Teng, Ma Xiang, Zhou Xue-Hai, Research progress on FPGA-based machine learning hardware acceleration, Chinese Journal of Computers, 43, 6, pp. 1161-1182, (2020)
[9]
Wu Yan-Xia, Liang Kai, Liu Ying, Cui Hui-Min, The progress and trends of FPGA-based accelerators in deep learning, Chinese Journal of Computers, 42, 11, pp. 2461-2480, (2019)
[10]
Geng T, Wang T, Sanaullah A, Et al., A framework for acceleration of CNN training on deeply-pipelined FPGA clusters with work and weight load balancing, Proceedings of the International Conference on Field Programmable Logic and Applications, pp. 394-398, (2018)