P4GPU: Accelerate Packet Processing of a P4 Program with a CPU-GPU Heterogeneous Architecture

被引：15

作者：

Li, Peilong ^{[1
]}

Luo, Yan ^{[1
]}

机构：

[1] Univ Massachusetts, Dept Elect & Comp Engn, Lowell, MA 01854 USA

来源：

PROCEEDINGS OF THE 2016 SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS (ANCS'16) | 2016年

基金：

美国国家科学基金会;

关键词：

GPU; Heterogeneous; Packet Processing; P4;

D O I：

10.1145/2881025.2889480

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The P4 language is an emerging domain-specific language for describing the data plane processing at a network device. P4 has been mapped to a wide range of forwarding devices including NPUs, programmable NICs and FPGAs, except for General Purpose Graphics Processing Unit (GPGPU) which is a salient parallel architecture for processing network flows. In this work, we design a heterogeneous architecture with both CPU and GPU as a P4 programming target, and present a toolset to map a P4 program onto the proposed architecture. Our evaluation reveals that a P4 program can render promising performance on such architecture by parallelizing its "match+action" engine with the GPGPU accelerator. The experiment results show that the auto-configured GPU kernels achieve scalable lookup and classification speeds: the prototype system can reach up to 580 Gbps for IP lookups (64-byte packets) and 60 million classifications per second for 4k firewall rules, respectively.

引用

页码：125 / 126

页数：2

共 50 条

[1] P4GPU: Acceleration of Programmable Data Plane Using a CPU-GPU Heterogeneous Architecture
Li, Peilong
Luo, Yan
2016 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (HPSR), 2016, : 168 - 175
[2] Latency-Aware Packet Processing on CPU-GPU Heterogeneous Systems
Maghazeh, Arian
Bordoloi, Unmesh D.
Dastgeer, Usman
Andrei, Alexandru
Eles, Petru
Peng, Zebo
PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
[3] Heterogeneous Cache Hierarchy Management for Integrated CPU-GPU Architecture
Wen, Hao
Zhang, Wei
2019 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2019,
[4] A Scalable and Portable Approach to Accelerate Hybrid HPL on Heterogeneous CPU-GPU Clusters
Shi, Rong
Potluri, Sreeram
Hamidouche, Khald
Lu, Xiaoyi
Tomko, Karen
Panda, Dhabaleswar K.
2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
[5] Asynchronous Processing for Latent Fingerprint Identification on Heterogeneous CPU-GPU Systems
Sanchez-Fernandez, Andres J.
Romero, Luis F.
Peralta, Daniel
Medina-Perez, Miguel Angel
Saeys, Yvan
Herrera, Francisco
Tabik, Siham
IEEE ACCESS, 2020, 8 (08): : 124236 - 124253
[6] Heterogeneous blocked CPU-GPU accelerate scheme for large scale extreme learning machine
Li, Shijie
Niu, Xin
Dou, Yong
Lv, Qi
Wang, Yueqing
NEUROCOMPUTING, 2017, 261 : 153 - 163
[7] Reducing Inter-Application Interferences in Integrated CPU-GPU Heterogeneous Architecture
Wen, Hao
Zhang, Wei
2018 IEEE 36TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2018, : 278 - 281
[8] Parallel String Similarity Join Approach Based on CPU-GPU Heterogeneous Architecture
Xu K.
Nie T.
Shen D.
Kou Y.
Yu G.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (03): : 598 - 608
[9] Implementation and Analysis of GNSS Software Receiver on Embedded CPU-GPU Heterogeneous Architecture
Park, Kwi Woo
Jang, Woo Jin
Park, Chansik
Kim, Sunwoo
Lee, Min Jun
PROCEEDINGS OF THE 29TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2016), 2016, : 70 - 76
[10] Design Space Exploration for Partitioning Dataflow Program on CPU-GPU Heterogeneous System
Bloch, Aurelien
Casale-Brunet, Simone
Mattavelli, Marco
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2023, 95 (10): : 1219 - 1229

← 1 2 3 4 5 →