Machine Learning Computers With Fractal von Neumann Architecture

被引:10
|
作者
Zhao, Yongwei [1 ,2 ,3 ]
Fan, Zhe [1 ,2 ,3 ]
Du, Zidong [1 ,3 ]
Zhi, Tian [1 ,3 ]
Li, Ling [4 ]
Guo, Qi [1 ]
Liu, Shaoli [1 ,3 ]
Xu, Zhiwei [1 ,2 ]
Chen, Tianshi [1 ,3 ]
Chen, Yunji [1 ,2 ,5 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Cambricon Technol, Beijing, Peoples R China
[4] Chinese Acad Sci, Inst Software, Beijing 100190, Peoples R China
[5] CAS Ctr Excellence Brain Sci & Intelligence Techn, Shanghai Res Ctr Brian Sci & Brain Inspired Intel, Inst Brain Intelligence Technol, Zhangjiang Lab BIT,ZfLab, Beijing, Peoples R China
基金
北京市自然科学基金;
关键词
Machine learning; Computers; Fractals; Programming; Computer architecture; Graphics processing units; Matrix decomposition; architecture; neural networks; programming efficiency;
D O I
10.1109/TC.2020.2982159
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning techniques are pervasive tools for emerging commercial applications and many dedicated machine learning computers on different scales have been deployed in embedded devices, servers, and data centers. Currently, most machine learning computer architectures still focus on optimizing performance and energy efficiency instead of programming productivity. However, with the fast development in silicon technology, programming productivity, including programming itself and software stack development, becomes the vital reason instead of performance and power efficiency that hinders the application of machine learning computers. In this article, we propose Cambricon-F, which is a series of homogeneous, sequential, multi-layer, layer-similar, and machine learning computers with same ISA. A Cambricon-F machine has a fractal von Neumann architecture to iteratively manage its components: it is with von Neumann architecture and its processing components (sub-nodes) are still Cambricon-F machines with von Neumann architecture and the same ISA. Since different Cambricon-F instances with different scales can share the same software stack on their common ISA, Cambricon-Fs can significantly improve the programming productivity. Moreover, we address four major challenges in Cambricon-F architecture design, which allow Cambricon-F to achieve a high efficiency. We implement two Cambricon-F instances at different scales, i.e., Cambricon-F100 and Cambricon-F1. Compared to GPU based machines (DGX-1 and 1080Ti), Cambricon-F instances achieve 2.82x, 5.14x better performance, 8.37x, 11.39x better efficiency on average, with 74.5, 93.8 percent smaller area costs, respectively. We further propose Cambricon-FR, which enhances the Cambricon-F machine learning computers to flexibly and efficiently support all the fractal operations with a reconfigurable fractal instruction set architecture. Compared to the Cambricon-F instances, Cambricon-FR machines achieve 1.96x, 2.49x better performance on average. Most importantly, Cambricon-FR computers are able to save the code length with a factor of 5.83, thus significantly improving the programming productivity.
引用
收藏
页码:998 / 1014
页数:17
相关论文
共 50 条
  • [21] Towards Energy Efficient non-von Neumann Architectures for Deep Learning
    Ganguly, Antara
    Muralidhar, Rajeev
    Singh, Virendra
    PROCEEDINGS OF THE 2019 20TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED), 2019, : 335 - 342
  • [22] Distributed Double Machine Learning with a Serverless Architecture
    Kurz, Malte S.
    COMPANION OF THE ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING, ICPE 2021, 2021, : 27 - 33
  • [23] Enabling Machine Learning in Software Architecture Frameworks
    Moin, Armin
    Badii, Atta
    Guennemann, Stephan
    Challenger, Moharram
    2023 IEEE/ACM 2ND INTERNATIONAL CONFERENCE ON AI ENGINEERING - SOFTWARE ENGINEERING FOR AI, CAIN, 2023, : 92 - 93
  • [24] A Survey of Machine Learning for Computer Architecture and Systems
    Wu, Nan
    Xie, Yuan
    ACM COMPUTING SURVEYS, 2023, 55 (03)
  • [25] Machine Learning for Systems
    Litz, Heiner
    Hashemi, Milad
    IEEE MICRO, 2020, 40 (05) : 6 - 7
  • [26] Integrating eye gaze into machine learning using fractal curves
    Newport, Robert Ahadizad
    Liu, Sidong
    Di Ieva, Antonio
    GAZE MEETS MACHINE LEARNING WORKSHOP, VOL 210, 2022, 210 : 113 - 126
  • [27] Memristors Enabled Computing Correlation Parameter In-Memory System: A Potential Alternative to Von Neumann Architecture
    Kundu, Souvik
    Ganganaik, Priyanka B.
    Louis, Jeffry
    Chalamalasetty, Hemanth
    Rao, B. V. V. S. N. Prabhakar
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (06) : 755 - 768
  • [28] A Machine Learning-Driven Threat Hunting Architecture for Protecting Critical Infrastructures
    Lozano, Mario Aragones
    Llopis, Israel Perez
    Alarcon, Alfonso Climente
    Domingo, Manuel Esteve
    2023 19TH INTERNATIONAL CONFERENCE ON THE DESIGN OF RELIABLE COMMUNICATION NETWORKS, DRCN, 2023,
  • [29] Enabling One-Size-Fits-All Compilation Optimization for Inference Across Machine Learning Computers
    Wen, Yuanbo
    Qi Guo
    Du, Zidong
    Xu, Jianxing
    Zhang, Zhenxing
    Xing Hu
    Wei Li
    Rui Zhang
    Chao Wang
    Zhou Xuehai
    Chen, Tianshi
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (09) : 2313 - 2326
  • [30] Threat Hunting Architecture Using a Machine Learning Approach for Critical Infrastructures Protection
    Lozano, Mario Aragones
    Llopis, Israel Perez
    Domingo, Manuel Esteve
    BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (02)