共 32 条
[1]
Arunkumar Akhil, 2017, SIGARCH Comput. Archit. News, V45, P2
[2]
Memory Access Patterns: The Missing Piece of the Multi-GPU Puzzle
[J].
PROCEEDINGS OF SC15: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS,
2015,
[3]
Compiling Affine Loop Nests for Distributed-Memory Parallel Architectures
[J].
2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC),
2013,
[4]
A Practical Automatic Polyhedral Parallelizer and Locality Optimizer
[J].
PLDI'08: PROCEEDINGS OF THE 2008 SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN & IMPLEMENTATION,
2008,
:101-+
[5]
Catanzaro B. C., 2006, Tech. Rep. UCB/EECS-2006-183
[6]
Catanzaro B, 2011, ACM SIGPLAN NOTICES, V46, P47, DOI 10.1145/1941553.1941562
[7]
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[8]
Doerfert J, 2017, INT SYM CODE GENER, P292, DOI 10.1109/CGO.2017.7863748
[9]
Doerfert Johannes, 2017, 2017 US LLVM DEV M
[10]
Duato Jose, 2010, P 2010 INT C HIGH PE