MapZero: Mapping for Coarse-grained Reconfigurable Architectures with Reinforcement Learning and Monte-Carlo Tree Search

被引:6
|
作者
Kong, Xiangyu [1 ]
Huang, Yi [1 ]
Zhu, Jianfeng [1 ]
Man, Xingchen [1 ]
Liu, Yang [2 ]
Feng, Chunyang [2 ]
Gou, Pengfei [3 ]
Tang, Minggui [3 ]
Wei, Shaojun [1 ]
Liu, Leibo [1 ]
机构
[1] Tsinghua Univ, BNRist, Sch Integrated Circuits, Beijing, Peoples R China
[2] GBA, Innovat Inst High Performance Server, Guangzhou, Guangdong, Peoples R China
[3] HEXIN Technol Co Ltd, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Coarse-Grained Reconfigurable Architecture; Compiler; Graph Neural Network; Reinforcement Learning; DATA-FLOW GRAPH; CGRA; ALGORITHM; FRAMEWORK; SHOGI; CHESS; GO;
D O I
10.1145/3579371.3589081
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Coarse-grained reconfigurable architecture (CGRA) has become a promising candidate for data-intensive computing due to its flexibility and high energy efficiency. CGRA compilers map data flow graphs (DFGs) extracted from applications onto CGRAs, playing a fundamental role in fully exploiting hardware resources for acceleration. Yet the existing compilers are time-demanding and cannot guarantee optimal results due to the traversal search of enormous search spaces brought about by the spatio-temporal flexibility of CGRA structures and the complexity of DFGs. Inspired by the amazing progress in reinforcement learning (RL) and Monte-Carlo tree search (MCTS) for real-world problems, we consider constructing a compiler that can learn from past experiences and comprehensively understand the target DFG and CGRA. In this paper, we propose an architecture-aware compiler for CGRAs based on RL and MCTS, called MapZero - a framework to automatically extract the characteristics of DFG and CGRA hardware and map operations onto varied CGRA fabrics. We apply Graph Attention Network to generate an adaptive embedding for DFGs and also model the functionality and interconnection status of the CGRA, aiming at training an RL agent to perform placement and routing intelligently. Experimental results show that MapZero can generate superior-quality mappings and reduce compilation time hundreds of times compared to state-of-the-art methods. MapZero can find high-quality mappings very quickly when the feasible solution space is rather small and all other compilers fail. We also demonstrate the scalability and broad applicability of our framework.
引用
收藏
页码:646 / 659
页数:14
相关论文
共 50 条
  • [41] Tuning coarse-grained reconfigurable architectures towards an application domain
    Oliveira, Julio
    Schweizer, Thomas
    Oppold, Tobias
    Kuhn, Tommy
    Rosenstiel, Wolfgang
    ReConFig 2006: Proceedings of the 2006 IEEE International Conference on Reconfigurable Computing and FPGA's, 2006, : 71 - 77
  • [42] Memory Access Optimization in Compilation for Coarse-Grained Reconfigurable Architectures
    Kim, Yongjoo
    Lee, Jongeun
    Shrivastava, Aviral
    Paek, Yunheung
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2011, 16 (04)
  • [43] A Survey on Coarse-Grained Reconfigurable Architectures From a Performance Perspective
    Podobas, Artur
    Sano, Kentaro
    Matsuoka, Satoshi
    IEEE ACCESS, 2020, 8 : 146719 - 146743
  • [44] Still Image Processing on Coarse-Grained Reconfigurable Array Architectures
    Matthias Hartmann
    Vasileios (Vassilis) Pantazis
    Tom Vander Aa
    Mladen Berekovic
    Christian Hochberger
    Journal of Signal Processing Systems, 2010, 60 : 225 - 237
  • [45] Fault Recovery Time Analysis for Coarse-Grained Reconfigurable Architectures
    Lee, Ganghee
    Cetin, Ediz
    Diessel, Oliver
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (02)
  • [46] High-level Programming of Coarse-Grained Reconfigurable Architectures
    Zain-ul-Abdin
    FPL: 2009 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, 2009, : 713 - 714
  • [47] A Configuration Data Multicasting Method for Coarse-Grained Reconfigurable Architectures
    Kojima, Takuya
    Amano, Hideharu
    2018 28TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2018, : 239 - 242
  • [48] Evolution in architectures and programming methodologies of coarse-grained reconfigurable computing
    Zain-ul-Abdin
    Svensson, Bertil
    MICROPROCESSORS AND MICROSYSTEMS, 2009, 33 (03) : 161 - 178
  • [49] A Mapping Algorithm for Embedded Coarse-grained Reconfigurable Processor
    Yu, Sudong
    Liu, Leibo
    Yin, Shouyi
    Wei, Shaojun
    2008 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEM, 2008, : 1230 - 1234
  • [50] Mapping Tasks to a Dynamically Reconfigurable Coarse-Grained Array
    Moghaddam, Mansureh S.
    Paul, Kolin
    Balakrishnan, M.
    2014 IEEE 22ND ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2014), 2014, : 33 - 33