共 38 条
[1]
[Anonymous], 2015, NVIDIA CUDA C Programming Guide
[2]
Bakhoda A, 2009, INT SYM PERFORM ANAL, P163, DOI 10.1109/ISPASS.2009.4919648
[3]
Balasubramonian Rajeev, 2009, TECHNICAL REPORT
[4]
Burtscher M., 2012, 2012 IEEE International Symposium on Workload Characterization (IISWC 2012), P141, DOI 10.1109/IISWC.2012.6402918
[5]
Chang J, 2009, SYMP VLSI CIRCUITS, P152
[6]
Managing DRAM Latency Divergence in Irregular GPGPU Applications
[J].
SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS,
2014,
:128-139
[7]
Che S, 2013, I S WORKL CHAR PROC, P185, DOI 10.1109/IISWC.2013.6704684
[8]
Adaptive Cache Management for Energy-efficient GPU Computing
[J].
2014 47TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO),
2014,
:343-355
[10]
Unifying Primary Cache, Scratch, and Register File Memories in a Throughput Processor
[J].
2012 IEEE/ACM 45TH INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-45),
2012,
:96-106