Automated SmartNIC Offloading Insights for Network Functions

被引:21
作者
Qiu, Yiming [1 ]
Xing, Jiarong [1 ]
Hsu, Kuo-Feng [1 ]
Kang, Qiao [1 ]
Liu, Ming [2 ]
Narayana, Srinivas [3 ]
Chen, Ang [1 ]
机构
[1] Rice Univ, Houston, TX 77005 USA
[2] UW Madison, VMware, Madison, WI USA
[3] Rutgers State Univ, New Brunswick, NJ USA
来源
PROCEEDINGS OF THE 28TH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, SOSP 2021 | 2021年
关键词
Network function; SmartNIC; Machine learning; MODEL;
D O I
10.1145/3477132.3483583
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The gap between CPU and networking speeds has motivated the development of SmartNICs for NF (network functions) offloading. However, offloading performance is predicated upon intricate knowledge about SmartNIC hardware and careful hand-tuning of the ported programs. Today, developers cannot easily reason about the offloading performance or the effectiveness of different porting strategies without resorting to a trial-and-error approach. Clara is an automated tool that improves the productivity of this workflow by generating offloading insights. Our tool can a) analyze a legacy NF in its unported form, predicting its performance characteristics on a SmartNIC (e.g., compute vs. memory intensity); and b) explore and suggest porting strategies for the given NF to achieve higher performance. To achieve these goals, Clara uses program analysis techniques to extract NF features, and combines them with machine learning techniques to handle opaque SmartNIC details. Our evaluation using Click NF programs on a Netronome Smart-NIC shows that Clara achieves high accuracy in its analysis, and that its suggested porting strategies lead to significant performance improvements.
引用
收藏
页码:772 / 787
页数:16
相关论文
共 71 条
  • [1] [Anonymous], 2005, P 2005 ACM IEEE C SU
  • [2] [Anonymous], MELLANOX INNOVA 2 FL
  • [3] Cross-Architecture Performance Prediction (XAPP) Using CPU Code to Predict GPU Performance
    Ardalani, Newsha
    Lestourgeon, Clint
    Sankaralingam, Karthikeyan
    Zhu, Xiaojin
    [J]. PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48), 2015, : 725 - 737
  • [4] autokeras, Autokeras supported tasks
  • [5] LogCA: A Performance Model for Hardware Accelerators
    Bin Altaf, Muhammad Shoaib
    Wood, David A.
    [J]. IEEE COMPUTER ARCHITECTURE LETTERS, 2015, 14 (02) : 132 - 135
  • [6] Bysiek M., 2016, P 6 WORKSH PYTH HIGH
  • [7] CERE: LLVM-Based Code let Extractor and REplayer for Piecewise Benchmarking and Optimization
    Castro, Pablo De Oliveira
    Akel, Chadi
    Petit, Eric
    Popov, Mihail
    Jalby, William
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2015, 12 (01)
  • [8] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [9] Chen TQ, 2018, PROCEEDINGS OF THE 13TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P579
  • [10] λ-NIC: Interactive Serverless Compute on SmartNICs
    Choi, Sean
    Shahbaz, Muhammad
    Prabhakar, Balaji
    Rosenblum, Mendel
    [J]. PROCEEDINGS OF THE 2019 ACM SIGCOMM CONFERENCE POSTERS AND DEMOS (SIGCOMM '19), 2019, : 151 - 152