TAS: A Temperature-Aware Scheduling for Heterogeneous Computing

被引:2
|
作者
Gao, Xiang [1 ]
机构
[1] Res Inst China Telecom Corp Ltd, Guangzhou 510660, Peoples R China
关键词
Graphics processing units; Task analysis; Neural networks; Image edge detection; Schedules; Parallel processing; Computational modeling; Heterogeneous computing; schedule; temperature-aware;
D O I
10.1109/ACCESS.2023.3281839
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of AI technology, the parameters and calculation overhead of advanced models have increased exponentially, resulting in the existing low-end GPU(Graphic Processing Unit) being unable to meet the computing power required for model operation. In order to speed up the inference speed in edge scenarios, various manufacturers have launched NPU(Neural Processor Unit), a special chip for neural networks, which can improve the overall inference efficiency and reduce energy consumption through a certain loss of precision. However, in the current common edge-side solutions, the problem of CPU+GPU+NPU co-processing is not well considered. At the same time, edge-side devices are more easily affected by the ambient temperature. In this paper, CPU+GPU+NPU is used to jointly process edge-side inference tasks, and we first established a heterogeneous device temperature perception model based on the ambient temperature of the edge device, then proposed a TAS(temperature-aware schedule) algorithm to control the running speed of the heterogeneous device, and then proposed a task scheduling algorithm for the heterogeneous device, namely TASTS(TAS-based task schedule). At the same time, we also use a hungarian matching algorithm to optimize the final result. This paper finally verified several models in real edge environment, found that it can improve the performance by 20-50% compared with conventional methods under temperature constraints.
引用
收藏
页码:54773 / 54781
页数:9
相关论文
共 50 条
  • [31] Temperature-aware scheduling and assignment for hard real-time applications on MPSoCs
    Chantem, Thidapat
    Dick, Robert P.
    Hu, X. Sharon
    2008 DESIGN, AUTOMATION AND TEST IN EUROPE, VOLS 1-3, 2008, : 246 - +
  • [32] Temperature and Energy Aware Scheduling of Heterogeneous Processors
    Kabir, Rashadul
    Izadi, Baback
    2016 NINTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2016, : 48 - 54
  • [33] TMDS: Temperature-aware Makespan Minimizing DAG Scheduler for Heterogeneous Distributed Systems
    Senapati, Debabrata
    Rajesh, Kousik
    Karfa, Chandan
    Sarkar, Arnab
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2023, 28 (06)
  • [34] Temperature-Aware Scheduling and Assignment for Hard Real-Time Applications on MPSoCs
    Chantem, Thidapat
    Hu, X. Sharon
    Dick, Robert P.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2011, 19 (10) : 1884 - 1897
  • [35] Battery-Aware Workflow Scheduling for Portable Heterogeneous Computing
    Jiang, Fu
    Xia, Yaoxin
    Yan, Lisen
    Liu, Weirong
    Zhang, Xiaoyong
    Li, Heng
    Peng, Jun
    IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2024, 9 (04): : 677 - 694
  • [36] Energy-aware task scheduling in heterogeneous computing environments
    Jing Mei
    Kenli Li
    Keqin Li
    Cluster Computing, 2014, 17 : 537 - 550
  • [37] Energy-aware task scheduling in heterogeneous computing environments
    Mei, Jing
    Li, Kenli
    Li, Keqin
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2014, 17 (02): : 537 - 550
  • [38] Introduction to the Special Section on Adaptive Power Management for Energy and Temperature-Aware Computing Systems
    Coskun, Ayse Kivilcim
    Lu, Yung-Hsiang
    Qiu, Qinru
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2013, 18 (01)
  • [39] Temperature-aware on-chip networks
    Shang, L
    Peh, LS
    Kumar, A
    Jha, NK
    IEEE MICRO, 2006, 26 (01) : 130 - 139
  • [40] TREAFET: Temperature-Aware Real-Time Task Scheduling for FinFET based Multicores
    Chakraborty, Shounak
    Sharma, Yanshul
    Moulik, Sanjay
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2024, 23 (04)