Godson-T: An Efficient Many-Core Architecture for Parallel Program Executions

被引:23
|
作者
Fan, Dong-Rui [1 ]
Yuan, Nan [1 ]
Zhang, Jun-Chao [1 ]
Zhou, Yong-Bin [1 ]
Lin, Wei [1 ]
Song, Feng-Long [1 ]
Ye, Xiao-Chun [1 ]
Huang, He [1 ]
Yu, Lei [1 ]
Long, Guo-Ping [1 ]
Zhang, Hao [1 ]
Liu, Lei [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Comp Syst & Architecture, Beijing 100190, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
many-core; parallel computing; multithread; data communication; thread synchronization; runtime system;
D O I
10.1007/s11390-009-9295-3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Moore's law will grant computer architects ever more transistors for the foreseeable future, and the challenge is how to use them to deliver efficient performance and flexible programmability. We propose a many-core architecture, Godson-T, to attack this challenge. On the one hand, Godson-T features a region-based cache coherence protocol, asynchronous data transfer agents and hardware-supported synchronization mechanisms, to provide full potential for the high efficiency of the on-chip resource utilization. On the other hand, Godson-T features a highly efficient runtime system, a Pthreads-like programming model, and versatile parallel libraries, which make this many-core design flexibly programmable. This hardware/software cooperating design methodology bridges the high-end computing with mass programmers. Experimental evaluations are conducted on a cycle-accurate simulator of Godson-T. The results show that the proposed architecture has good scalability, fast synchronization, high computational efficiency, and flexible programmability.
引用
收藏
页码:1061 / 1073
页数:13
相关论文
共 50 条
  • [21] Reducing the burden of parallel loop schedulers for many-core processors
    Arif, Mahwish
    Vandierendonck, Hans
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (13):
  • [22] Study on Fine-grained Synchronization in Many-Core Architecture
    Yu, Lei
    Liu, Zhiyong
    Fan, Dongrui
    Song, Fenglong
    Zhang, Junchao
    Yuan, Nan
    SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGS, 2009, : 524 - 529
  • [23] SAM: A Haskell Parallel Programming Model for Many-Core Systems
    Kim, Yeoneo
    Cheon, Junseok
    Liu, Xiao
    Byun, Sugwoo
    Woo, Gyun
    PROCEEDINGS OF 4TH IEEE INTERNATIONAL CONFERENCE ON APPLIED SYSTEM INNOVATION 2018 ( IEEE ICASI 2018 ), 2018, : 822 - 825
  • [24] Scaling Graph Community Detection on the Tilera Many-core Architecture
    Chavarria-Miranda, Daniel
    Halappanavar, Mahantesh
    Kalyanaraman, Ananth
    2014 21ST INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2014,
  • [25] Analysis of the Parallel Programming Models in Haskell for Many-Core Systems
    Liu, Xiao
    Kim, Yeoneo
    Cheon, Junseok
    Byun, Sugwoo
    Woo, Gyun
    PROCEEDINGS OF 4TH IEEE INTERNATIONAL CONFERENCE ON APPLIED SYSTEM INNOVATION 2018 ( IEEE ICASI 2018 ), 2018, : 838 - 841
  • [26] Circuit Modeling for Practical Many-core Architecture Design Exploration
    Truong, Dean N.
    Baas, Bevan M.
    PROCEEDINGS OF THE 47TH DESIGN AUTOMATION CONFERENCE, 2010, : 627 - 628
  • [27] Efficient DVFS to Prevent Hard Faults for Many-Core Architectures
    Lai, Zhiquan
    Zhao, Baokang
    Su, Jinshu
    INFORMATION AND COMMUNICATION TECHNOLOGY, 2014, 8407 : 674 - 679
  • [28] SunwayImg: A Parallel Image Processing Library for the Sunway Many-Core Processor
    Liu, Rui
    Liu, Yi
    Zhao, Meiting
    Song, Kaida
    Qian, Depei
    IEEE ACCESS, 2019, 7 : 128555 - 128569
  • [29] A Parallel Bloom Filter String Searching Algorithm on a Many-core Processor
    Ong, WenMei
    Baskaran, Vishnu Monn
    Chong, Poh Kit
    Ettikan, K. K.
    Ong, Keh Kok Y.
    2013 IEEE CONFERENCE ON OPEN SYSTEMS (ICOS), 2013, : 1 - +
  • [30] DAG Scheduling Algorithm for a Cluster-Based Many-Core Architecture
    Kitagawa, Yuto
    Ishigooka, Tasuku
    Azumi, Takuya
    2018 IEEE 16TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2018), 2018, : 150 - 157