共 30 条
[1]
Adriaens JT, 2012, INT S HIGH PERF COMP, P79
[2]
[Anonymous], 2018, CUD SDK COD SAMPL
[3]
Bakhoda A, 2009, INT SYM PERFORM ANAL, P163, DOI 10.1109/ISPASS.2009.4919648
[4]
Kernel concurrency opportunities based on GPU benchmarks characterization
[J].
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS,
2020, 23 (01)
:177-188
[5]
Che SA, 2009, I S WORKL CHAR PROC, P44, DOI 10.1109/IISWC.2009.5306797
[7]
Chen Q, 2017, OPER SYST REV, V51, P17, DOI 10.1145/3037697.3037700
[8]
Accelerate GPU Concurrent Kernel Execution by Mitigating Memory Pipeline Stalls
[J].
2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA),
2018,
:208-220
[9]
Gómez-Luna J, 2017, INT SYM PERFORM ANAL, P43, DOI 10.1109/ISPASS.2017.7975269
[10]
Kato S., 2011, P 2011 USENIX C USEN, P17