共 27 条
[1]
Understanding Training Efficiency of Deep Learning Recommendation Models at Scale
[J].
2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021),
2021,
:802-814
[2]
Adnan M, 2024, Arxiv, DOI arXiv:2204.05436
[3]
High-Performance Recommender System Training using Co-Clustering on CPU/GPU Clusters
[J].
2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP),
2017,
:372-381
[4]
Cheng H.-T., 2016, P 1 WORKSH DEEP LEAR, P7, DOI [DOI 10.1145/2988450.2988454, 10.1145/2988450.2988454]
[5]
Estimating GPU Memory Consumption of Deep Learning Models
[J].
PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20),
2020,
:1342-1352
[6]
RecPipe: Co-designing Models and Hardware to Jointly Optimize Recommendation Quality and Performance
[J].
PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021,
2021,
:870-884
[7]
DeepRecSys: A System for Optimizing End-To-End At-Scale Neural Recommendation Inference
[J].
2020 ACM/IEEE 47TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2020),
2020,
:982-995
[8]
Deep Position-wise Interaction Network for CTR Prediction
[J].
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL,
2021,
:1885-1889
[10]
Optimizing Deep Learning Recommender Systems Training on CPU Cluster Architectures
[J].
PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20),
2020,