SmarCo: An Efficient Many-Core Processor for High-Throughput Applications in Datacenters

被引:11
|
作者
Fan, Dongrui [1 ,2 ]
Li, Wenming [1 ]
Ye, Xiaochun [1 ]
Wang, Da [1 ,3 ]
Zhang, Hao [1 ,3 ]
Tang, Zhimin [1 ]
Sun, Ninghui [1 ]
机构
[1] Chinese Acad Sci, ICT, SKL Comp Architecture, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing, Peoples R China
[3] SmarCo Co Ltd, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/HPCA.2018.00057
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Fast-growing high-throughput applications, such as web services, are characterized by high-concurrency processing, hard real-time response, and high-bandwidth memory access. The newly-born applications bring severe challenges to processors in datacenters, both in concurrent processing performance and energy efficiency. To offer a satisfactory quality of services, it is of critical importance to meet these newly emerging demands of high-throughput applications in the future datacenters in a more efficient way. In this paper, we propose a novel architecture, called SmarCo, which allows high-throughput applications to be processed more efficiently in datacenters. Based on the dominant characteristics of high-throughput applications, we implement large-scale many-core architecture with in-pair threads to support high-concurrency processing; we also introduce a hierarchical ring topology and laxity-aware task scheduler to guarantee hard real-time response; furthermore, we propose high-throughput datapath to improve memory access efficiency. We verify the efficiency of SmarCo by using simulators, large-scale FPGA and prototype with TSMC 40-nm technology node. The experimental results show that, compared to Intel Xeon E7-8890V4, SmarCo achieves 10.11X performance improvement and 6.95X energy-efficiency improvement with higher throughput and a better guarantee of real-time response.
引用
收藏
页码:596 / 607
页数:12
相关论文
共 50 条
  • [1] Toward High-Throughput Algorithms on Many-Core Architectures
    Orozco, Daniel
    Garcia, Elkin
    Khan, Rishi
    Livingston, Kelly
    Gao, Guang R.
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2012, 8 (04)
  • [2] Energy Efficient NoC Router for High Throughput Applications in Many-core GPUs
    Bansal, Shrestha
    Mondal, Hemanta Kumar
    Gade, Sri Harsha
    Deb, Sujay
    2017 3RD IEEE INTERNATIONAL SYMPOSIUM ON NANOELECTRONIC AND INFORMATION SYSTEMS (INIS), 2017, : 50 - 55
  • [3] Parallel simulation of many-core processor and many-core clusters
    Lü, Huiwei
    Cheng, Yuan
    Bai, Lu
    Chen, Mingyu
    Fan, Dongrui
    Sun, Ninghui
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2013, 50 (05): : 1110 - 1117
  • [4] Design of A Scalable Many-Core Processor for Embedded Applications
    Chien, Hsiao-Wei
    Lai, Jyun-Long
    Wu, Chao-Chieh
    Huang, Chih-Tsun
    Hsu, Ting-Shuo
    Liou, Jing-Jia
    2015 20TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2015, : 24 - 25
  • [5] A Many-core Parallelizing Processor
    Porada, Katarzyna
    2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 875 - 877
  • [6] Efficient Workload Balance Technology on Many-core Crypto Processor
    Dai Zibin
    Yin Anqi
    Qu Tongzhou
    Nan Longmei
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (02) : 369 - 376
  • [7] Design and Analysis of a Many-Core Processor Architecture for Multimedia Applications
    Lai, Jyu-Yuan
    Chen, Po-Yu
    Hsu, Ting-Shuo
    Huang, Chih-Tsun
    Liou, Jing-Jia
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [8] High-throughput configurable motion estimation processor core for video applications
    Lai, Yeon-Kang
    Chen, Lien-Fei
    JAPANESE JOURNAL OF APPLIED PHYSICS PART 1-REGULAR PAPERS BRIEF COMMUNICATIONS & REVIEW PAPERS, 2006, 45 (4B): : 3330 - 3335
  • [9] High-throughput configurable motion estimation processor core for video applications
    Lai, Yeong-Kang
    Chen, Lien-Fei
    Japanese Journal of Applied Physics, Part 1: Regular Papers and Short Notes and Review Papers, 2006, 45 (4 B): : 3330 - 3335
  • [10] A Many-core Architecture for an Ensemble Ternary Neural Network Toward High-Throughput Inference
    Kayanoma, Ryota
    Jinguji, Akira
    Nakahara, Hiroki
    2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, : 446 - 453