Machine Learning Computers With Fractal von Neumann Architecture

被引:10
|
作者
Zhao, Yongwei [1 ,2 ,3 ]
Fan, Zhe [1 ,2 ,3 ]
Du, Zidong [1 ,3 ]
Zhi, Tian [1 ,3 ]
Li, Ling [4 ]
Guo, Qi [1 ]
Liu, Shaoli [1 ,3 ]
Xu, Zhiwei [1 ,2 ]
Chen, Tianshi [1 ,3 ]
Chen, Yunji [1 ,2 ,5 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Cambricon Technol, Beijing, Peoples R China
[4] Chinese Acad Sci, Inst Software, Beijing 100190, Peoples R China
[5] CAS Ctr Excellence Brain Sci & Intelligence Techn, Shanghai Res Ctr Brian Sci & Brain Inspired Intel, Inst Brain Intelligence Technol, Zhangjiang Lab BIT,ZfLab, Beijing, Peoples R China
基金
北京市自然科学基金;
关键词
Machine learning; Computers; Fractals; Programming; Computer architecture; Graphics processing units; Matrix decomposition; architecture; neural networks; programming efficiency;
D O I
10.1109/TC.2020.2982159
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning techniques are pervasive tools for emerging commercial applications and many dedicated machine learning computers on different scales have been deployed in embedded devices, servers, and data centers. Currently, most machine learning computer architectures still focus on optimizing performance and energy efficiency instead of programming productivity. However, with the fast development in silicon technology, programming productivity, including programming itself and software stack development, becomes the vital reason instead of performance and power efficiency that hinders the application of machine learning computers. In this article, we propose Cambricon-F, which is a series of homogeneous, sequential, multi-layer, layer-similar, and machine learning computers with same ISA. A Cambricon-F machine has a fractal von Neumann architecture to iteratively manage its components: it is with von Neumann architecture and its processing components (sub-nodes) are still Cambricon-F machines with von Neumann architecture and the same ISA. Since different Cambricon-F instances with different scales can share the same software stack on their common ISA, Cambricon-Fs can significantly improve the programming productivity. Moreover, we address four major challenges in Cambricon-F architecture design, which allow Cambricon-F to achieve a high efficiency. We implement two Cambricon-F instances at different scales, i.e., Cambricon-F100 and Cambricon-F1. Compared to GPU based machines (DGX-1 and 1080Ti), Cambricon-F instances achieve 2.82x, 5.14x better performance, 8.37x, 11.39x better efficiency on average, with 74.5, 93.8 percent smaller area costs, respectively. We further propose Cambricon-FR, which enhances the Cambricon-F machine learning computers to flexibly and efficiently support all the fractal operations with a reconfigurable fractal instruction set architecture. Compared to the Cambricon-F instances, Cambricon-FR machines achieve 1.96x, 2.49x better performance on average. Most importantly, Cambricon-FR computers are able to save the code length with a factor of 5.83, thus significantly improving the programming productivity.
引用
收藏
页码:998 / 1014
页数:17
相关论文
共 50 条
  • [41] Machine Learning with Distributed Data Management and Process Architecture
    Baysal, Engin
    Bayilmis, Cuneyt
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 53 - 57
  • [42] FEEDBACK FREE DVC ARCHITECTURE USING MACHINE LEARNING
    Martinez, J. L.
    Fernandez-Escribano, G.
    Kalva, H.
    Weerakkody, W. A. R. J.
    Fernando, W. A. C.
    Garrido, A.
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 1140 - 1143
  • [43] An Architecture to Integrate Digital Twins and Machine Learning Operations
    van Bruggen, Arno H.
    Kruger, Karel
    Basson, Anton H.
    Grobler, Jacomine
    SERVICE ORIENTED, HOLONIC AND MULTI-AGENT MANUFACTURING SYSTEMS FOR INDUSTRY OF THE FUTURE, SOHOMA 2023, 2024, 1136 : 3 - 14
  • [44] Conception of a Reference Architecture for Machine Learning in the Process Industry
    Woestmann, Rene
    Schlunder, Philipp
    Temme, Fabian
    Klinkenberg, Ralf
    Kimberger, Josef
    Spichtinger, Andrea
    Goldhacker, Markus
    Deuse, Jochen
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1726 - 1735
  • [45] Machine Learning Operations (MLOps): Overview, Definition, and Architecture
    Kreuzberger, Dominik
    Kuehl, Niklas
    Hirschl, Sebastian
    IEEE ACCESS, 2023, 11 : 31866 - 31879
  • [46] Fractal geometry and architecture: some interesting connections
    Sala, N.
    Eco-Architecture: Harmonisation between Architecture and Nature, 2006, 86 : 163 - 173
  • [47] An Overview of Processing-in-Memory Circuits for Artificial Intelligence and Machine Learning
    Kim, Donghyuk
    Yu, Chengshuo
    Xie, Shanshan
    Chen, Yuzong
    Kim, Joo-Young
    Kim, Bongjin
    Kulkarni, Jaydeep P.
    Kim, Tony Tae-Hyoung
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (02) : 338 - 353
  • [48] A Microservice Based Architecture Topology for Machine Learning Deployment
    Ribeiro, Jose Lucas
    Figueredo, Mickael
    Araujo, Adelson
    Cacho, Nelio
    Lopes, Frederico
    2019 5TH IEEE INTERNATIONAL SMART CITIES CONFERENCE (IEEE ISC2 2019), 2019, : 426 - 431
  • [49] An open architecture for complex event processing with machine learning
    Luong, Nhan Nathan Tri
    Milosevic, Zoran
    Berry, Andrew
    Rabhi, Fethi
    2020 IEEE 24TH INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE (EDOC 2020), 2020, : 51 - 56
  • [50] Machine learning for naval architecture, ocean and marine engineering
    Panda, J. P.
    JOURNAL OF MARINE SCIENCE AND TECHNOLOGY, 2023, 28 (01) : 1 - 26