共 50 条
[41]
Scan Stack: A Search-based Concurrent Stack for GPU
[J].
PROCEEDINGS OF THE 2023 ACM SOUTHEAST CONFERENCE, ACMSE 2023,
2023,
:10-19
[42]
SEER: A Time Prediction Model for CNNs from GPU Kernel's View
[J].
30TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT 2021),
2021,
:173-185
[43]
Principal Kernel Analysis: A Tractable Methodology to Simulate Scaled GPU Workloads
[J].
PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021,
2021,
:724-737
[44]
GPU Code Optimization using Abstract Kernel Emulation and Sensitivity Analysis
[J].
PROCEEDINGS OF THE 39TH ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION, PLDI 2018,
2018,
:736-751