共 32 条
- [1] Mendonça G(2017)DAWNCC: automatic annotation for data parallelism and offloading ACM Trans Archit Code Optim (TACO) 14 13-26
- [2] Guimarães B(2015)Bones: an automatic skeleton-based c-to-cuda compiler for gpus ACM Trans Arch Code Optim (TACO) 11 35-139
- [3] Alves P(2019)Transparent acceleration for heterogeneous platforms with compilation to opencl ACM Trans Arch Code Optim (TACO) 16 1-3841
- [4] Pereira M(2019)Data-flow analysis and optimization for data coherence in heterogeneous architectures J Parallel Distrib Comput 130 126-507
- [5] Araújo G(2013)Polyhedral parallel code generation for cuda ACM Trans Arch Code Optim (TACO) 9 54-1192
- [6] Pereira FMQ(2019)Not: a high-level no-threading parallel programming method for heterogeneous systems J Supercomput 75 3810-undefined
- [7] Nugteren C(2015)Dwarfcode: a performance prediction tool for parallel applications IEEE Trans Comput 65 495-undefined
- [8] Corporaal H(2017)Predicting hpc parallel program performance based on llvm compiler Cluster Comput 20 1179-undefined
- [9] Riebler H(undefined)undefined undefined undefined undefined-undefined
- [10] Vaz G(undefined)undefined undefined undefined undefined-undefined