共 28 条
- [1] Adapting combined tiling to stencil optimizations on sunway processor CCF Transactions on High Performance Computing, 2023, 5 : 322 - 333
- [2] 26 PFLOPS Stencil Computations for Atmospheric Modeling on Sunway TaihuLight 2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 535 - 544
- [3] TOAST: Automatic tiling for iterative stencil computations on GPUs CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (08):
- [4] Toward Accelerated Stencil Computation by Adapting Tensor Core Unit on GPU PROCEEDINGS OF THE 36TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2022, 2022,
- [5] An Approach of Processor Core Customization for Stencil Computation PROCEEDINGS OF THE 2014 IEEE 25TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2014), 2014, : 182 - +
- [6] Extreme-scale Realistic Stencil Computations on Sunway TaihuLight with Ten Million Cores 2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 566 - 571
- [8] Towards optimized tensor code generation for deep learning on sunway many-core processor Frontiers of Computer Science, 2024, 18
- [10] Evaluating optimizations that reduce global memory accesses of stencil computations in GPGPUs CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (18):