共 25 条
[1]
Ozdal M M, Yesil S, Kim T, Et al., Energy efficient architecture for graph analytics accelerators, Proc of the 43rd Annual ACM/IEEE Int Symp on Computer Architecture (ISCA), pp. 166-177, (2016)
[2]
Beamer S, Asanovic K, Patterson D., Locality exists in graph processing: Workload characterization on an Ivy bridge server, Proc of IEEE Int Symp on Workload Characterization (IISWC), pp. 56-65, (2015)
[3]
Garland M, Kirk D B., Understanding throughput-oriented architectures, Communications of the ACM, 53, 11, pp. 58-66, (2010)
[4]
O'Neil M A, Burtscher M., Microarchitectural performance characterization of irregular GPU kernels, Proc of IEEE Int Symp on Workload Characterization (IISWC), pp. 130-139, (2014)
[5]
Yao Pengcheng, Zheng Long, Liao Xiaofei, Et al., An efficient graph accelerator with parallel data conflict management, Proc of the 27th Int Conf on Parallel Architectures and Compilation Techniques, (2018)
[6]
Ham T J, Wu L, Sundaram N, Et al., Graphicionado: A high-performance and energy-efficient accelerator for graph analytics, Proc of the 49th Annual IEEE/ACM Int Symp on Microarchitecture (MICRO), pp. 1-13, (2016)
[7]
Dai Guohao, Huang Tianhao, Chi Yuze, Et al., Foregraph: Exploring large-scale graph processing on multi-FPGA architecture, Proc of the 2017 ACM/SIGDA Int Symp on Field-Programmable Gate Arrays, pp. 217-226, (2017)
[8]
Zhou Shijie, Chelmis C, Prasanna V K., High-throughput and energy-efficient graph processing on FPGA, Proc of the 24th Annual IEEE Int Symp on Field-Programmable Custom Computing Machines (FCCM), pp. 103-110, (2016)
[9]
Bacon D F, Rabbah R M, Shukla S., FPGA programming for the masses, Communications of the ACM, 56, 4, pp. 56-63, (2013)
[10]
Bachrach J, Vo H, Richards B, Et al., Chisel:Constructing hardware in a scala embedded language, Proc of Design Automation Conf (DAC 2012), pp. 1212-1221, (2012)