Lightweight asynchronous scheduling in heterogeneous reconfigurable systems

被引:3
|
作者
Rodriguez, Andres [1 ]
Navarro, Angeles [1 ]
Nikov, Kris [2 ]
Nunez-Yanez, Jose [2 ]
Gran, Ruben [3 ]
Gracia, Dario Suarez [3 ]
Asenjo, Rafael [1 ]
机构
[1] Univ Malaga, Dept Comp Architecture, Malaga, Spain
[2] Univ Bristol, Dept Elect & Elect Engn, Bristol, Avon, England
[3] Univ Zaragoza, Comp Architecture Grp, Zaragoza, Spain
基金
英国工程与自然科学研究理事会;
关键词
Heterogeneous architecture; FPGA; Heterogeneous scheduling; Throughput model; Energy efficiency;
D O I
10.1016/j.sysarc.2022.102398
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The trend for heterogeneous embedded systems is the integration of accelerators and general-purpose CPU cores on the same die. In these integrated architectures, like the Zynq UltraScale+ board (CPU+FPGA) that we target in this work, hardware support for shared memory and low-overhead synchronization between the accelerator and the CPU cores make the case for exploring strategies that exploit a tight collaboration between the CPUs and the accelerator. In this paper we propose a novel lightweight scheduling strategy, FastFit, targeted to FPGA accelerators, and a new scheduler based on it, named MultiFastFit, which asynchronously tackles heterogeneous systems comprised of a variety of CPU cores and FPGA IPs. Our strategy significantly reduces the overhead to automatically compute the near-optimal chunksizes when compared to a previous state-of-the-art auto-tuned approach, which makes our approach more suitable for fine-grained applications. Additionally, our scheduler MultiFastFit has been designed to enable the efficient co-execution of work among compute devices in such a way that all the devices are busy while minimizing the load unbalance.Our approaches have been evaluated using four benchmarks carefully tuned for the low-power UltraScale+ platform. Our experiments demonstrate that the FastFit strategy always finds the near-optimal FPGA chunksize for any device configuration at a reasonable cost, even for fine-grained and irregular applications, and that heterogeneous CPU+FPGA co-executions that exploit all the compute devices are usually faster and more energy efficient than the CPU-only and FPGA-only executions. We have also compared MultiFastFit with other state-of-the-art scheduling strategies, finding that it outperforms other auto-tuned approach up to 2x and it achieves similar results to manually-tuned schedulers without requiring an offline search of the ideal CPU-FPGA partition or FPGA chunk granularity.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Decentralised control of heterogeneous interconnected systems with asynchronous sampling
    Zhang, Shiqiang
    Liu, Zidong
    Zhao, Dongya
    International Journal of Industrial and Systems Engineering, 2024, 48 (02) : 180 - 197
  • [42] Information Matrix Fusion for Nonlinear, Asynchronous and Heterogeneous Systems
    Yang, Kaipei
    Bar-Shalom, Yaakov
    Chang, Kuo-Chu
    2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
  • [43] Discharge scheduling for voltage balancing in reconfigurable battery systems
    Kim, Dongwan
    Lee, Jinkyu
    ELECTRONICS LETTERS, 2017, 53 (07) : 496 - 498
  • [44] Scheduling Mixed-criticality Systems on Reconfigurable Platforms
    Sehhatbakhsh, Sadegh
    Sedaghat, Yasser
    2019 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2019), 2019, : 431 - 436
  • [45] Modelling the problem of production scheduling for reconfigurable manufacturing systems
    Azab, Ahmed
    Naderi, Bahman
    9TH CIRP CONFERENCE ON INTELLIGENT COMPUTATION IN MANUFACTURING ENGINEERING - CIRP ICME '14, 2015, 33 : 76 - 80
  • [46] Efficient Resource Scheduling for Runtime Reconfigurable Systems on FPGAs
    Alismail, Shaden
    Koch, Dirk
    2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 123 - 129
  • [47] Parameterized module scheduling algorithm for reconfigurable computing systems
    Kota, Solomon Raju
    Shekhar, Chandra
    Kokkula, Archana
    Toshniwal, Durga
    Kartikeyan, M. V.
    Joshi, R. C.
    ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, : 473 - +
  • [48] Clustering scheduling for hardware tasks in reconfigurable computing systems
    Chen, Zhi
    Qiu, Meikang
    Ming, Zhong
    Yang, Laurence T.
    Zhu, Yongxin
    JOURNAL OF SYSTEMS ARCHITECTURE, 2013, 59 (10) : 1424 - 1432
  • [49] Mapping Adaptive Particle Filters to Heterogeneous Reconfigurable Systems
    Chau, Thomas C. P.
    Niu, Xinyu
    Eele, Alison
    Maciejowski, Jan
    Cheung, Peter Y. K.
    Luk, Wayne
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2015, 7 (04)
  • [50] Application of a heterogeneous reconfigurable architecture to OFDM wireless systems
    Niktash, Afshin
    Parizi, Hooman T.
    Bagherzadeh, Nader
    2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 2586 - 2589