Automatic Pipeline Parallelism: A Parallel Inference Framework for Deep Learning Applications in 6G Mobile Communication Systems

被引:4
作者
Shi, Hongjian [1 ]
Zheng, Weichu [1 ]
Liu, Zifei [1 ]
Ma, Ruhui [1 ]
Guan, Haibing [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
关键词
Parallel processing; Task analysis; Schedules; Data models; Training; Pipelines; Computational modeling; Distributed learning; system heterogeneity; parallel inference; hardware profiling; task scheduling; LOW-LATENCY COMMUNICATIONS; RESOURCE-ALLOCATION; 5G; OPTIMIZATION; CHALLENGES; NETWORKS;
D O I
10.1109/JSAC.2023.3280970
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the rapid development of wireless communication, achieving the neXt generation Ultra-Reliable and Low-Latency Communications (xURLLC) in 6G mobile communication systems has become a critical problem. Among many applications in xURLLC, deep learning model inference requires improvement over its efficiency. Due to the heterogeneous hardware environment in 6G, parallel schedules from distributed machine learning and edge computing has been borrowed to tackle the efficiency problem. However, traditional parallel schedules suffer from high latency, low throughput, and low device utility. In this paper, we propose Automatic Pipeline Parallelism (AP(2)), a parallel inference framework for deep learning applications in 6G mobile communication systems, to improve the model inference efficiency while maintaining reliability. (AP(2)) contains three sub-modules. A task-device affinity predictor predicts a task's expected execution time on a given device. The parallel inference arrangement optimizer finds the most suitable device for each task. The parallel inference scheduler converts the arrangement to a schedule that can be directly executed in the system. The experimental results show that (AP(2)) can achieve better latency, throughput, reliability, and device utility than other parallel schedules. Also, the priority of the sub-module designs has been approved through the experiments.
引用
收藏
页码:2041 / 2056
页数:16
相关论文
共 82 条
  • [1] Latency Minimization for Intelligent Reflecting Surface Aided Mobile Edge Computing
    Bai, Tong
    Pan, Cunhua
    Deng, Yansha
    Elkashlan, Maged
    Nallanathan, Arumugam
    Hanzo, Lajos
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (11) : 2666 - 2682
  • [2] Baldini Ioana, 2014, 2014 IEEE 26th International Symposium on Computer Architecture and High-Performance Computing (SBAC-PAD), P254, DOI 10.1109/SBAC-PAD.2014.30
  • [3] Convergence conditions of genetic algorithms
    Barrios, D
    Malumbres, L
    Rios, J
    [J]. INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 1998, 68 (3-4) : 231 - 241
  • [4] Superior Fitting of Arterial Resistance and Compliance Parameters With Genetic Algorithms in Models of Dynamic Cerebral Autoregulation
    Bello Robles, Felipe-Andres
    Panerai, Ronney B.
    Katsogridakis, Emmanuel
    Chacon, Max
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2022, 69 (01) : 503 - 512
  • [5] An Efficient Deep Learning Approach To IoT Intrusion Detection
    Cao, Jin
    Lin, Liwei
    Ma, Ruhui
    Guan, Haibing
    Tian, Mengke
    Wang, Yong
    [J]. COMPUTER JOURNAL, 2022, 65 (11) : 2870 - 2879
  • [6] A Chaotic Ant Colony Optimized Link Prediction Algorithm
    Cao, Zhiwei
    Zhang, Yichao
    Guan, Jihong
    Zhou, Shuigeng
    Wen, Guanghui
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (09): : 5274 - 5288
  • [7] Evolutionary Multitasking for Feature Selection in High-Dimensional Classification via Particle Swarm Optimization
    Chen, Ke
    Xue, Bing
    Zhang, Mengjie
    Zhou, Fengyu
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (03) : 446 - 460
  • [8] Correlated Input-Dependent Label Noise in Large-Scale Image Classification
    Collier, Mark
    Mustafa, Basil
    Kokiopoulou, Efi
    Jenatton, Rodolphe
    Berent, Jesse
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1551 - 1560
  • [9] A Survey on Non-Orthogonal Multiple Access for 5G Networks: Research Challenges and Future Trends
    Ding, Zhiguo
    Lei, Xianfu
    Karagiannidis, George K.
    Schober, Robert
    Yuan, Jinhong
    Bhargava, Vijay K.
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2017, 35 (10) : 2181 - 2195
  • [10] Sharp Bounds for Genetic Drift in Estimation of Distribution Algorithms
    Doerr, Benjamin
    Zheng, Weijie
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2020, 24 (06) : 1140 - 1149