共 14 条
- [1] Efficient Stencil Computation with Temporal Blocking by Halide DSL 2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 870 - 877
- [2] AN5D: Automated Stencil Framework for High-Degree Temporal Blocking on GPUs CGO'20: PROCEEDINGS OF THE18TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2020, : 199 - 211
- [3] Combined Spatial and Temporal Blocking for High-Performance Stencil Computation on FPGAs Using OpenCL PROCEEDINGS OF THE 2018 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'18), 2018, : 153 - 162
- [5] A Performance Study for Iterative Stencil Loops on GPUs with Ghost Zone Optimizations International Journal of Parallel Programming, 2011, 39 : 115 - 142
- [7] Revisiting split tiling for stencil computations in polyhedral compilation JOURNAL OF SUPERCOMPUTING, 2022, 78 (01): : 440 - 470
- [8] Revisiting split tiling for stencil computations in polyhedral compilation The Journal of Supercomputing, 2022, 78 : 440 - 470
- [9] A Highly Efficient I/O-based Out-of-Core Stencil Algorithm with Globally Optimized Temporal Blocking 2017 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2017,
- [10] Practical applicability of optimizations and performance models to complex stencil-based loop kernels in CFD INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2019, 33 (04): : 602 - 618