共 40 条
[1]
MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-Based Systems
[J].
2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS),
2012,
:647-654
[2]
[Anonymous], 2007, NVIDIA CUDA BASIC LI
[3]
[Anonymous], 1999, PAGERANK CITATION RA
[5]
Augonnet Cedric., 2012, European MPI Users' Group Meeting, P298, DOI DOI 10.1007/978-3-642-33518-1_40
[6]
Bauer M, 2012, INT CONF HIGH PERFOR
[7]
Beguelin A., 1991, ORNLTM11826 U TENN
[8]
Memory Access Patterns: The Missing Piece of the Multi-GPU Puzzle
[J].
PROCEEDINGS OF SC15: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS,
2015,
[9]
Beri A. Tarun, 2015, P INT C PAR DISTR PR, P48
[10]
ProSteal: A proactive work stealer for bulk synchronous tasks distributed on a cluster of heterogeneous machines with multiple accelerators
[J].
2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS,
2015,
:17-26