A Many-core Architecture for In-Memory Data Processing

被引:19
|
作者
Agrawal, Sandeep R. [1 ]
Idicula, Sam [1 ]
Raghavan, Arun [1 ]
Vlachos, Evangelos [1 ]
Govindaraju, Venkatraman [1 ]
Varadarajan, Venkatanathan [1 ]
Balkesen, Cagri [1 ]
Giannikis, Georgios [1 ]
Roth, Charlie [1 ]
Agarwal, Nipun [1 ]
Sedlar, Eric [1 ]
机构
[1] Oracle Labs, Burlington, MA 01803 USA
来源
50TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO) | 2017年
关键词
Accelerator; Big data; Microarchitecture; Databases; DPU; Low power; Analytics Processor; In-Memory Data Processing; Data Movement System; SUPPORT; VISION;
D O I
10.1145/3123939.3123985
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
For many years, the highest energy cost in processing has been data movement rather than computation, and energy is the limiting factor in processor design [21]. As the data needed for a single application grows to exabytes [56], there is clearly an opportunity to design a bandwidth-optimized architecture for big data computation by specializing hardware for data movement. We present the Data Processing Unit or DPU, a shared memory many-core that is specifically designed for high bandwidth analytics workloads. The DPU contains a unique Data Movement System (DMS), which provides hardware acceleration for data movement and partitioning operations at the memory controller that is sufficient to keep up with DDR bandwidth. The DPU also provides acceleration for core to core communication via a unique hardware RPC mechanism called the Atomic Transaction Engine. Comparison of a DPU chip fabricated in 40nm with a Xeon processor on a variety of data processing applications shows a 3x-15x performance per watt advantage.
引用
收藏
页码:245 / 258
页数:14
相关论文
共 50 条
  • [1] FROM GPGPU TO MANY-CORE: NVIDIA FERMI AND INTEL MANY INTEGRATED CORE ARCHITECTURE
    Heinecke, Alexander
    Klemm, Michael
    Bungartz, Hans-Joachim
    COMPUTING IN SCIENCE & ENGINEERING, 2012, 14 (02) : 78 - 83
  • [2] RAPID: In-Memory Analytical Query Processing Engine with Extreme Performance per Watt
    Balkesen, Cagri
    Kunal, Nitin
    Giannikis, Georgios
    Fender, Pit
    Sundara, Seema
    Schmidt, Felix
    Wen, Jarod
    Agrawal, Sandeep
    Raghavan, Arun
    Varadarajan, Venkatanathan
    Viswanathan, Anand
    Chandrasekaran, Balakrishnan
    Idicula, Sam
    Agarwal, Nipun
    Sedlar, Eric
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1407 - 1419
  • [3] Stitch: Fusible Heterogeneous Accelerators Enmeshed with Many-Core Architecture for Wearables
    Tan, Cheng
    Karunaratne, Manupa
    Mitra, Tulika
    Peh, Li-Shiuan
    2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2018, : 575 - 587
  • [4] SPECTRUM: A Software-defined Predictable Many-core Architecture for LTE/5G Baseband Processing
    Venkataramani, Vanchinathan
    Kulkarni, Aditi
    Mitra, Tulika
    Peh, Li-Shiuan
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2020, 19 (05)
  • [5] Architecture and Evaluation of Low Power Many-Core SoC with Two 32-Core Clusters
    Miyamori, Takashi
    Xu, Hui
    Usui, Hiroyuki
    Hosoda, Soichiro
    Sano, Toru
    Yamamoto, Kazumasa
    Kodaka, Takeshi
    Nonogaki, Nobuhiro
    Ozaki, Nau
    Tanabe, Jun
    IEICE TRANSACTIONS ON ELECTRONICS, 2014, E97C (04): : 360 - 368
  • [6] Data Processing and Information Classification-An In-Memory Approach
    Andrighetti, Milena
    Turvani, Giovanna
    Santoro, Giulia
    Vacca, Marco
    Marchesin, Andrea
    Ottati, Fabrizio
    Roch, Massimo Ruo
    Graziano, Mariagrazia
    Zamboni, Maurizio
    SENSORS, 2020, 20 (06)
  • [7] Deca: A Garbage Collection Optimizer for In-Memory Data Processing
    Shi, Xuanhua
    Ke, Zhixiang
    Zhou, Yongluan
    Jin, Hai
    Lu, Lu
    Zhang, Xiong
    He, Ligang
    Hu, Zhenyu
    Wang, Fei
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2019, 36 (01):
  • [8] An Energy Efficient In-Memory Computing Architecture Using Reconfigurable Magnetic Logic Circuits for Big Data Processing
    Gargari, Milad Ashtari
    Eslami, Nima
    Moaiyeri, Mohammad Hossein
    IEEE TRANSACTIONS ON MAGNETICS, 2023, 59 (12) : 1 - 10
  • [9] Enhancing in-memory efficiency for MapReduce-based data processing
    Veiga, Jorge
    Exposito, Roberto R.
    Taboada, Guillermo L.
    Tourino, Juan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 120 : 323 - 338
  • [10] Representing a Model for the Anonymization of Big Data Stream Using In-Memory Processing
    Shamsinejad E.
    Banirostam T.
    Pedram M.M.
    Rahmani A.M.
    Annals of Data Science, 2025, 12 (1) : 223 - 252