Henna: Hierarchical Machine Learning Inference in Programmable Switches

被引:8
|
作者
Tanyi-Jong Akem, Aristide [1 ,2 ]
Butun, Beyza [1 ,2 ]
Gucciardo, Michele [1 ]
Fiore, Marco [1 ]
机构
[1] IMDEA Networks Inst, Madrid, Spain
[2] Univ Carlos III Madrid, Madrid, Spain
来源
PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON NATIVE NETWORK INTELLIGENCE, NATIVENI 2022 | 2022年
关键词
Programmable switch; machine learning; in-switch inference; P4;
D O I
10.1145/3565009.3569520
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent proliferation of programmable network equipment has opened up new possibilities for embedding intelligence into the data plane. Deploying models directly in the data plane promises to achieve high throughput and low latency inference capabilities that cannot be attained with traditional closed loops involving control-plane operations. Recent efforts have paved the way for the integration of trained machine learning models in resource-constrained programmable switches, yet current solutions have significant limitations that translate into performance barriers when coping with complex inference tasks. In this paper, we present Henna, a first in-switch implementation of a hierarchical classification system. The concept underpinning our solution is that of splitting a difficult classification task into easier cascaded decisions, which can then be addressed with separated and resource-efficient tree-based classifiers. We propose a design of Henna that aligns with the internal organization of the Protocol Independent Switch Architecture (PISA), and integrates state-of-the-art strategies for mapping decision trees to switch hardware. We then implement Henna into a real testbed with off-the-shelf Intel Tofino programmable switches using the P4 language. Experiments with a complex 21-category classification task based on measurement data demonstrate how Henna improves the F1 score of an advanced single-stage model by 21%, while keeping usage of switch resources at 8% on average.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 50 条
  • [31] Disease Inference on Medical Datasets Using Machine Learning and Deep Learning Algorithms
    Chinnaswamy, Arunkumar
    Srinivasan, Ramakrishnan
    Gaurang, Desai Prutha
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 902 - 908
  • [32] Planter: Rapid Prototyping of In-Network Machine Learning Inference
    Zheng, Changgang
    Zang, Mingyuan
    Hong, Xinpeng
    Perreault, Liam
    Bensoussane, Riyad
    Vargaftik, Shay
    Ben-Itzhak, Yaniv
    Zilberman, Noa
    ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2024, 54 (01) : 2 - 20
  • [33] Planter: Rapid Prototyping of In-Network Machine Learning Inference
    Zheng, Changgang
    Zang, Mingyuan
    Hong, Xinpeng
    Perreault, Liam
    Bensoussane, Riyad
    Vargaftik, Shay
    Ben-Itzhak, Yaniv
    Zilberman, Noa
    Computer Communication Review, 2024, 54 (01): : 2 - 20
  • [34] Methods for correcting inference based on outcomes predicted by machine learning
    Wang, Siruo
    McCormick, Tyler H.
    Leek, Jeffrey T.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (48) : 30266 - 30275
  • [35] Real-World Evidence, Causal Inference, and Machine Learning
    Crown, William H.
    VALUE IN HEALTH, 2019, 22 (05) : 587 - 592
  • [36] Resource Sharing and Security Implications on Machine Learning Inference Accelerators
    Silveira, Plinio
    De Rose, Cesar Augusto
    Zorzo, Francisco Avelino
    Xavier, Miguel Gomes
    Milojicic, Dejan
    Chalamalasetti, Sai Rahul
    Serebryakov, Sergey
    2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, : 59 - 67
  • [37] Regularization, Bayesian Inference, and Machine Learning Methods for Inverse Problems
    Mohammad-Djafari, Ali
    ENTROPY, 2021, 23 (12)
  • [38] BrainNET: Inference of Brain Network Topology Using Machine Learning
    Murugesan, Gowtham Krishnan
    Ganesh, Chandan
    Nalawade, Sahil
    Davenport, Elizabeth M.
    Wagner, Ben
    Kim, Won Hwa
    Maldjian, Joseph A.
    BRAIN CONNECTIVITY, 2020, 10 (08) : 422 - 435
  • [39] Inference of Personal Attributes from Tweets Using Machine Learning
    Yo, Take
    Sasahara, Kazutoshi
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3168 - 3174
  • [40] Machine Learning Inference on Serverless Platforms Using Model Decomposition
    Gallego, Adrien
    Odyurt, Uraz
    Cheng, Yi
    Wang, Yuandou
    Zhao, Zhiming
    16TH IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC 2023, 2023,