共 43 条
[21]
Performance Comparision of TPU, GPU, CPU on Google Colaboratory over Distributed Deep Learning
[J].
2021 IEEE 14TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2021),
2021,
:312-319
[22]
Optimized Broadcast for Deep Learning Workloads on Dense-GPU InfiniBand Clusters: MPI or NCCL?
[J].
EUROMPI 2018: PROCEEDINGS OF THE 25TH EUROPEAN MPI USERS' GROUP MEETING,
2018,
[23]
Distributed Deep Learning Framework based on Shared Memory for Fast Deep Neural Network Training
[J].
2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC),
2018,
:1239-1242
[24]
Falcon: Towards Computation-Parallel Deep Learning in Heterogeneous Parameter Server
[J].
2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019),
2019,
:196-206
[25]
Distributed Deep Learning for Multi-Label Chest Radiography Classification
[J].
PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4,
2022,
:949-956
[26]
Multi-Switch Cooperative In-Network Aggregation for Distributed Deep Learning
[J].
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM,
2023,
:4767-4772
[27]
Early Experiences of Noise-Sensitivity Performance Analysis of a Distributed Deep Learning Framework
[J].
2022 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2022),
2022,
:516-522
[28]
A Dynamic Sliding Window Based Tensor Communication Scheduling Framework for Distributed Deep Learning
[J].
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING,
2025, 12 (02)
:1080-1095
[29]
Scheduling Deep Learning Jobs in Multi-Tenant GPU Clusters via Wise Resource Sharing
[J].
2024 IEEE/ACM 32ND INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE, IWQOS,
2024,