共 50 条
- [32] Comparative Study of Distributed Deep Learning Tools on Supercomputers ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT I, 2018, 11334 : 122 - 137
- [33] ALADDIN: Asymmetric Centralized Training for Distributed Deep Learning PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 863 - 872
- [34] Towards a Scalable and Distributed Infrastructure for Deep Learning Applications PROCEEDINGS OF 2020 IEEE/ACM 5TH WORKSHOP ON DEEP LEARNING ON SUPERCOMPUTERS (DLS 2020), 2020, : 20 - 30
- [35] Understanding Distributed Deep Learning Performance by Correlating HPC and Machine Learning Measurements HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2022, 2022, 13289 : 275 - 292
- [37] SHAT: A Novel Asynchronous Training Algorithm That Provides Fast Model Convergence in Distributed Deep Learning APPLIED SCIENCES-BASEL, 2022, 12 (01):
- [38] Dynamic layer-wise sparsification for distributed deep learning FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 147 : 1 - 15
- [39] Dynamic Stale Synchronous Parallel Distributed Training for Deep Learning 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 1507 - 1517
- [40] Elastic Bulk Synchronous Parallel Model for Distributed Deep Learning 2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 1504 - 1509