共 7 条
- [1] OpenTuner: An Extensible Framework for Program Autotuning [J]. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 303 - 315
- [2] StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing Systems [J]. CGO '21: PROCEEDINGS OF THE 2021 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2021, : 315 - 326
- [3] AN5D: Automated Stencil Framework for High-Degree Temporal Blocking on GPUs [J]. CGO'20: PROCEEDINGS OF THE18TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2020, : 199 - 211
- [4] On Optimizing Complex Stencils on GPUs [J]. 2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 641 - 652
- [6] StencilMART: Predicting Optimization Selection for Stencil Computations across GPUs [J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 875 - 885
- [7] csTuner: Scalable Auto-tuning Framework for Complex Stencil Computation on GPUs [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2021), 2021, : 192 - 203