共 11 条
- [2] pocl: A Performance-Portable OpenCL Implementation International Journal of Parallel Programming, 2015, 43 : 752 - 785
- [3] Developing Performance-Portable Molecular Dynamics Kernels in OpenCL 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 386 - 395
- [4] Performance-Portable Autotuning of OpenCL Kernels for Convolutional Layers of Deep Neural Networks PROCEEDINGS OF 2016 2ND WORKSHOP ON MACHINE LEARNING IN HPC ENVIRONMENTS (MLHPC), 2016, : 9 - 18
- [6] I DEFIX: A versatile performance-portable Godunov code for astrophysical flows Astronomy and Astrophysics, 2023, 677
- [7] Developing High-Performance, Portable OpenCL Code via Multi-Dimensional Homomorphisms PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON OPENCL (IWOCL'19), 2019,
- [8] Generating Performance Portable Code using Rewrite Rules From High-Level Functional Expressions to High-Performance OpenCL Code PROCEEDINGS OF THE 20TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON FUNCTIONAL PROGRAMMING (ICFP'15), 2015, : 205 - 217
- [9] FusionCL: a machine-learning based approach for OpenCL kernel fusion to increase system performance Computing, 2021, 103 : 2171 - 2202