Iris: Automatic Generation of Efficient Data Layouts for High Bandwidth Utilization

被引:2
作者
Soldavini, Stephanie [1 ]
Sciuto, Donatella [1 ]
Pilato, Christian [1 ]
机构
[1] Politecn Milan, Milan, Italy
来源
2023 28TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC | 2023年
基金
欧盟地平线“2020”;
关键词
D O I
10.1145/3566097.3567892
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimizing data movements is becoming one of the biggest challenges in heterogeneous computing to cope with data deluge and, consequently, big data applications. When creating specialized accelerators, modern high-level synthesis (HLS) tools are increasingly efficient in optimizing the computational aspects, but data transfers have not been adequately improved. To combat this, novel architectures such as High-Bandwidth Memory with wider data busses have been developed so that more data can be transferred in parallel. Designers must tailor their hardware/software interfaces to fully exploit the available bandwidth. HLS tools can automate this process, but the designer must follow strict coding-style rules. If the bus width is not evenly divisible by the data width (e.g., when using custom-precision data types) or if the arrays are not power-of-two length, the HLS-generated accelerator will likely not fully utilize the available bandwidth, demanding even more manual effort from the designer. We propose a methodology to automatically find and implement a data layout that, when streamed between memory and an accelerator, uses a higher percentage of the available bandwidth than a naive or HLS-optimized design. We borrow concepts from multiprocessor scheduling to achieve such high efficiency.
引用
收藏
页码:172 / 177
页数:6
相关论文
共 50 条
[41]   AN ALL DIGITAL RECEIVER ARCHITECTURE FOR BANDWIDTH EFFICIENT TRANSMISSION AT HIGH DATA RATES [J].
ASCHEID, G ;
OERDER, M ;
STAHL, J ;
MEYR, H .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1989, 37 (08) :804-813
[42]   Two Tier Cluster Based Data Aggregation (TTCDA) for Efficient Bandwidth Utilization in Wireless Sensor Network [J].
Dnyaneshwar Mantri ;
Neeli Rashmi Prasad ;
Ramjee Prasad .
Wireless Personal Communications, 2014, 75 :2589-2606
[43]   Two Tier Cluster Based Data Aggregation (TTCDA) for Efficient Bandwidth Utilization in Wireless Sensor Network [J].
Mantri, Dnyaneshwar ;
Prasad, Neeli Rashmi ;
Prasad, Ramjee .
WIRELESS PERSONAL COMMUNICATIONS, 2014, 75 (04) :2589-2606
[44]   GENERATION AND UTILIZATION OF SOCIAL DATA [J].
BRADBURN, NM .
ETHICS, 1973, 84 (01) :22-37
[45]   On the automatic generation of efficient computational models [J].
Nabhan, Tarek M., 1600, Taylor & Francis Ltd, Basingstoke, United Kingdom (26)
[46]   Optimization of Bandwidth Utilization in Data Center Network with SDN [J].
Li, Yaofang ;
Wu, Bin ;
Xiao, Jie ;
Dai, Chunxia .
PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING (AMCCE 2017), 2017, 118 :559-567
[47]   Bandwidth Partition and Allocation for Efficient Spectrum Utilization in Cognitive Communications [J].
Huang, Song ;
Yuan, Di ;
Ephremides, Anthony .
JOURNAL OF COMMUNICATIONS AND NETWORKS, 2019, 21 (04) :353-364
[48]   A SPARSE REPRESENTATION FOR EFFICIENT BANDWIDTH UTILIZATION IN WIRELESS SURVEILLANCE NETWORKS [J].
Abdel-Hakim, Alaa E. .
2013 9TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO 2013): TODAY INFORMATION SOCIETY WHAT'S NEXT?, 2014, :103-108
[49]   SOC test architecture design for efficient utilization of test bandwidth [J].
Goel, SK ;
Marinissen, EJ .
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2003, 8 (04) :399-429
[50]   Increasing interposer utilization: A scalable, energy efficient and high bandwidth multicore-multichip integration solution [J].
Rochester Institute of Technology, Rochester ;
NY, United States .
Int. Green Sustain. Comput. Conf., IGSC, 1600, (1-6)