P4GPU: Accelerate Packet Processing of a P4 Program with a CPU-GPU Heterogeneous Architecture

被引:15
|
作者
Li, Peilong [1 ]
Luo, Yan [1 ]
机构
[1] Univ Massachusetts, Dept Elect & Comp Engn, Lowell, MA 01854 USA
来源
PROCEEDINGS OF THE 2016 SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS (ANCS'16) | 2016年
基金
美国国家科学基金会;
关键词
GPU; Heterogeneous; Packet Processing; P4;
D O I
10.1145/2881025.2889480
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The P4 language is an emerging domain-specific language for describing the data plane processing at a network device. P4 has been mapped to a wide range of forwarding devices including NPUs, programmable NICs and FPGAs, except for General Purpose Graphics Processing Unit (GPGPU) which is a salient parallel architecture for processing network flows. In this work, we design a heterogeneous architecture with both CPU and GPU as a P4 programming target, and present a toolset to map a P4 program onto the proposed architecture. Our evaluation reveals that a P4 program can render promising performance on such architecture by parallelizing its "match+action" engine with the GPGPU accelerator. The experiment results show that the auto-configured GPU kernels achieve scalable lookup and classification speeds: the prototype system can reach up to 580 Gbps for IP lookups (64-byte packets) and 60 million classifications per second for 4k firewall rules, respectively.
引用
收藏
页码:125 / 126
页数:2
相关论文
共 50 条
  • [1] P4GPU: Acceleration of Programmable Data Plane Using a CPU-GPU Heterogeneous Architecture
    Li, Peilong
    Luo, Yan
    2016 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (HPSR), 2016, : 168 - 175
  • [2] Latency-Aware Packet Processing on CPU-GPU Heterogeneous Systems
    Maghazeh, Arian
    Bordoloi, Unmesh D.
    Dastgeer, Usman
    Andrei, Alexandru
    Eles, Petru
    Peng, Zebo
    PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
  • [3] Heterogeneous Cache Hierarchy Management for Integrated CPU-GPU Architecture
    Wen, Hao
    Zhang, Wei
    2019 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2019,
  • [4] A Scalable and Portable Approach to Accelerate Hybrid HPL on Heterogeneous CPU-GPU Clusters
    Shi, Rong
    Potluri, Sreeram
    Hamidouche, Khald
    Lu, Xiaoyi
    Tomko, Karen
    Panda, Dhabaleswar K.
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [5] Asynchronous Processing for Latent Fingerprint Identification on Heterogeneous CPU-GPU Systems
    Sanchez-Fernandez, Andres J.
    Romero, Luis F.
    Peralta, Daniel
    Medina-Perez, Miguel Angel
    Saeys, Yvan
    Herrera, Francisco
    Tabik, Siham
    IEEE ACCESS, 2020, 8 (08): : 124236 - 124253
  • [6] Heterogeneous blocked CPU-GPU accelerate scheme for large scale extreme learning machine
    Li, Shijie
    Niu, Xin
    Dou, Yong
    Lv, Qi
    Wang, Yueqing
    NEUROCOMPUTING, 2017, 261 : 153 - 163
  • [7] Reducing Inter-Application Interferences in Integrated CPU-GPU Heterogeneous Architecture
    Wen, Hao
    Zhang, Wei
    2018 IEEE 36TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2018, : 278 - 281
  • [8] Parallel String Similarity Join Approach Based on CPU-GPU Heterogeneous Architecture
    Xu K.
    Nie T.
    Shen D.
    Kou Y.
    Yu G.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (03): : 598 - 608
  • [9] Implementation and Analysis of GNSS Software Receiver on Embedded CPU-GPU Heterogeneous Architecture
    Park, Kwi Woo
    Jang, Woo Jin
    Park, Chansik
    Kim, Sunwoo
    Lee, Min Jun
    PROCEEDINGS OF THE 29TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2016), 2016, : 70 - 76
  • [10] Design Space Exploration for Partitioning Dataflow Program on CPU-GPU Heterogeneous System
    Bloch, Aurelien
    Casale-Brunet, Simone
    Mattavelli, Marco
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2023, 95 (10): : 1219 - 1229