IIsy: Hybrid In-Network Classification Using Programmable Switches

被引:16
作者
Zheng, Changgang [1 ]
Xiong, Zhaoqi [2 ]
Bui, Thanh T. [2 ]
Kaupmees, Siim [2 ]
Bensoussane, Riyad [1 ]
Bernabeu, Antoine [3 ]
Vargaftik, Shay [4 ]
Ben-Itzhak, Yaniv [4 ]
Zilberman, Noa [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, Oxford OX1 3PJ, England
[2] Univ Cambridge, Dept Comp Sci & Technol, Cambridge CB3 0FD, England
[3] Nantes Univ, Ecole Cent Nantes, CNRS, LS2N,UMR 6004, F-44400 Nantes, France
[4] VMware Res Grp Broadcom, IL-6997801 Tel Aviv, Israel
关键词
Load modeling; Feature extraction; Switches; Throughput; Performance evaluation; Computational modeling; Data models; In-network computing; machine learning; P4; programmable data planes; software defined networks;
D O I
10.1109/TNET.2024.3364757
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The soaring use of machine learning leads to increasing processing demands. As data volume keeps growing, providing classification services with good machine learning performance, high throughput, low latency, and minimal equipment overheads becomes a challenge. Offloading machine learning tasks to network switches can be a scalable solution to this problem, providing high throughput and low latency. However, network devices are resource constrained, and lack support for machine learning functionality. In this paper, we introduce IIsy - a novel mapping tool of machine learning classification models to off-the-shelf switches. Using an efficient encoding algorithm, IIsy enables fitting a range of classification models on switches, co-existing with standard switch functionality. To overcome resource constraints, IIsy adopts a hybrid approach for ensemble models, running a small model on a switch and a large model on the backend. The evaluation shows that IIsy achieves near-optimal classification results, within minimum resource overheads, and while reducing the load on the backend by 70% for data-intensive use cases.
引用
收藏
页码:2555 / 2570
页数:16
相关论文
共 79 条
[1]  
Alrashdi I, 2019, 2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), P305, DOI 10.1109/CCWC.2019.8666450
[2]  
[Anonymous], 2020, InsidePacket Extends SONiC Use Cases, Enabling New Edge-Cloud Network Services
[3]  
[Anonymous], 2000, Darpa information survivability conference and exposition
[4]  
Awan Ammar Ahmad, 2017, P MACH LEARN HPC ENV, P1
[5]  
AWS, 2019, New-TLS Termination for Network Load Balancers
[6]   Risk and Return in High-Frequency Trading [J].
Baron, Matthew ;
Brogaard, Jonathan ;
Hagstromer, Bjorn ;
Kirilenko, Andrei .
JOURNAL OF FINANCIAL AND QUANTITATIVE ANALYSIS, 2019, 54 (03) :993-1024
[7]   FlowLens: Enabling Efficient Flow Classification for ML-based Network Security Applications [J].
Barradas, Diogo ;
Santos, Nuno ;
Rodrigues, Luis ;
Signorello, Salvatore ;
Ramos, Fernando M., V ;
Madeira, Andre .
28TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2021), 2021,
[8]   FINN-R: An End-to-End Deep-Learning Framework for Fast Exploration of Quantized Neural Networks [J].
Blott, Michaela ;
Preusser, Thomas B. ;
Fraser, Nicholas J. ;
Gambardella, Giulio ;
O'Brien, Kenneth ;
Umuroglu, Yaman ;
Leeser, Miriam ;
Vissers, Kees .
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2018, 11 (03)
[9]  
Boroumand A, 2018, ACM SIGPLAN NOTICES, V53, P316, DOI [10.1145/3296957.3173177, 10.1145/3173162.3173177]
[10]   Programming Protocol-Independent Packet Processors [J].
Bosshart, Pat ;
Daly, Dan ;
Gibb, Glen ;
Izzard, Martin ;
McKeown, Nick ;
Rexford, Jennifer ;
Schlesinger, Cole ;
Talayco, Dan ;
Vahdat, Amin ;
Varghese, George ;
Walker, David .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2014, 44 (03) :87-95