共 50 条
[31]
Characterizing Convolutional Neural Network Workloads on a Detailed GPU Simulator
[J].
PROCEEDINGS INTERNATIONAL SOC DESIGN CONFERENCE 2017 (ISOCC 2017),
2017,
:84-85
[32]
Evaluating On-Node GPU Interconnects for Deep Learning Workloads
[J].
HIGH PERFORMANCE COMPUTING SYSTEMS: PERFORMANCE MODELING, BENCHMARKING, AND SIMULATION (PMBS 2017),
2018, 10724
:3-21
[33]
Reliability of Large Scale GPU Clusters for Deep Learning Workloads
[J].
WEB CONFERENCE 2021: COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2021),
2021,
:179-181
[34]
Accelerating Broadcast Communication with GPU Compression for Deep Learning Workloads
[J].
2022 IEEE 29TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS, HIPC,
2022,
:22-31
[35]
Whippletree: Task-based Scheduling of Dynamic Workloads on the GPU
[J].
ACM TRANSACTIONS ON GRAPHICS,
2014, 33 (06)
[37]
GPU Memory Reallocation Techniques in Fully Homomorphic Encryption Workloads
[J].
39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024,
2024,
:1525-1532
[38]
Analyzing Machine Learning Workloads Using a Detailed GPU Simulator
[J].
2019 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS),
2019,
:151-152