共 50 条
[22]
An Optimization of FMM under CPU plus GPU Heterogeneous Architecture
[J].
PROCEEDINGS OF THE 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON COMMERCE AND ENTERPRISE COMPUTING (CEC 2012),
2012,
:147-150
[25]
An Approach Towards Distributed DNN Training on FPGA Clusters
[J].
ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2024,
2024, 14842
:18-32
[27]
Accelerating Iterative Protein Sequence Alignment on a Heterogeneous GPU-CPU Platform
[J].
2016 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2016),
2016,
:403-410
[28]
Distributed Hybrid CPU and GPU training for Graph Neural Networks on Billion-Scale Heterogeneous Graphs
[J].
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022,
2022,
:4582-4591
[29]
AccDP: Accelerated Data-Parallel Distributed DNN Training for Modern GPU-Based HPC Clusters
[J].
2022 IEEE 29TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS, HIPC,
2022,
:32-41
[30]
Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters
[J].
SC23:INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS,
2023,