共 44 条
[1]
Hardware Counters' Space Reduction for Code Region Characterization
[J].
EURO-PAR 2019: PARALLEL PROCESSING,
2019, 11725
:74-86
[2]
[Anonymous], 2017, 2017 IEEE 19 INT C H
[3]
OpenTuner: An Extensible Framework for Program Autotuning
[J].
PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14),
2014,
:303-315
[4]
An Adaptive Performance Modeling Tool for GPU Architectures
[J].
PPOPP 2010: PROCEEDINGS OF THE 2010 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING,
2010,
:105-114
[5]
Bakhoda A, 2009, INT SYM PERFORM ANAL, P163, DOI 10.1109/ISPASS.2009.4919648
[7]
Can search algorithms save large-scale automatic performance tuning?
[J].
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS),
2011, 4
:2136-2145
[8]
Cavazos J, 2007, INT SYM CODE GENER, P185
[9]
Automatically Selecting Profitable Thread Block Sizes for Accelerated Kernels
[J].
2017 19TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS (HPCC) / 2017 15TH IEEE INTERNATIONAL CONFERENCE ON SMART CITY (SMARTCITY) / 2017 3RD IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (DSS),
2017,
:442-449
[10]
End-to-end Deep Learning of Optimization Heuristics
[J].
2017 26TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT),
2017,
:219-232