共 66 条
[41]
Nvidia, 2020, CUDA C programming guide
[42]
Nvidia, 2018, CUB Documentation
[43]
Nvidia, 2013, cuBLAS
[45]
Paszke A, 2019, ADV NEUR IN, V32
[46]
Polychronopoulos C. D., 1987, Proceedings of the 1987 International Conference on Parallel Processing, P235
[47]
Polyhedral Optimization of TensorFlow Computation Graphs
[J].
PROGRAMMING AND PERFORMANCE VISUALIZATION TOOLS,
2019, 11027
:74-89
[49]
Reduction Drawing: Language Constructs and Polyhedral Compilation for Reductions on GPUs
[J].
2016 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURE AND COMPILATION TECHNIQUES (PACT),
2016,
:87-97
[50]
Simonyan K, 2015, Arxiv, DOI [arXiv:1409.1556, DOI 10.48550/ARXIV.1409.1556]