共 19 条
- [1] Abadi M., 2016, TENSORFLOW LARGE SCA
- [2] [Anonymous], MICR COGN TOOLK
- [3] [Anonymous], OPT PRIM COLL MULTIG
- [4] [Anonymous], CAFF FAST OP FRAM DE
- [5] Awan AA, 2017, ACM SIGPLAN NOTICES, V52, P193, DOI [10.1145/3155284.3018769, 10.1145/3018743.3018769]
- [6] Banerjee DS, 2016, INT CONF CLOUD COMP, P144, DOI [10.1109/CloudCom.2016.33, 10.1109/CloudCom.2016.0036]
- [7] Bureddy Devendar, 2012, P EUR MPI US GROUP M, P110, DOI [DOI 10.1007/978-3-642-33518-116, 10.1007/978-3-642-33518-116]
- [8] Chu CH, 2016, PROCEEDINGS OF FIRST WORKSHOP ON OPTIMIZATION OF COMMUNICATION IN HPC RUNTIME SYSTEMS (COM-HPC 2016), P29, DOI [10.1109/COMHPC.2016.009, 10.1109/COM-HPC.2016.9]
- [9] Designing High Performance Heterogeneous Broadcast for Streaming Applications on GPU Clusters [J]. PROCEEDINGS OF 28TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, (SBAC-PAD 2016), 2016, : 59 - 66
- [10] Hoefler T., 2007, PROC INT WORKSHOP CO, P232