A Tiny Transformer-Based Anomaly Detection Framework for IoT Solutions

被引:9
作者
Barbieri, Luca [1 ]
Brambilla, Mattia [1 ]
Stefanutti, Mario [2 ]
Romano, Ciro [2 ]
De Carlo, Niccolo [2 ]
Roveri, Manuel [1 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, I-20133 Milan, Italy
[2] Sensoworks, I- 00192 Rome, Italy
来源
IEEE OPEN JOURNAL OF SIGNAL PROCESSING | 2023年 / 4卷
关键词
Anomaly detection; machine learning; self-attention; knowledge distillation; Internet of Things; transformer; compression; NETWORKS; INTERNET; THINGS;
D O I
10.1109/OJSP.2023.3333756
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The widespread proliferation of Internet of Things (IoT) devices has pushed for the development of novel transformer-based Anomaly Detection (AD) tools for an accurate monitoring of functionalities in industrial systems. Despite their outstanding performances, transformer models often rely on large Neural Networks (NNs) that are difficult to be executed by IoT devices due to their energy/computing constraints. This paper focuses on introducing tiny transformer-based AD tools to make them viable solutions for on-device AD. Starting from the state-of-the-art Anomaly Transformer (AT) model, which has been shown to provide accurate AD functionalities but it is characterized by high computational and memory demand, we propose a tiny AD framework that finds an optimized configuration of the AT model and uses it for devising a compressed version compatible with resource-constrained IoT systems. A knowledge distillation tool is developed to obtain a highly compressed AT model without degrading the AD performance. The proposed framework is firstly analyzed on four widely-adopted AD datasets and then assessed using data extracted from a real-world monitoring facility. The results show that the tiny AD tool provides a compressed AT model with a staggering 99.93% reduction in the number of trainable parameters compared to the original implementation (from 4.8 million to 3300 or 1400 according to the input dataset), without significantly compromising the accuracy in AD. Moreover, the compressed model substantially outperforms a popular Recurrent Neural Network (RNN)-based AD tool having a similar number of trainable weights as well as a conventional One-Class Support Vector Machine (OCSVM) algorithm.
引用
收藏
页码:462 / 478
页数:17
相关论文
共 70 条
[1]   A Comprehensive Survey on TinyML [J].
Abadade, Youssef ;
Temouden, Anas ;
Bamoumen, Hatim ;
Benamar, Nabil ;
Chtouki, Yousra ;
Hafid, Abdelhakim Senhaji .
IEEE ACCESS, 2023, 11 :96892-96922
[2]   Practical Approach to Asynchronous Multivariate Time Series Anomaly Detection and Localization [J].
Abdulaal, Ahmed ;
Liu, Zhuanghua ;
Lancewicki, Tomer .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :2485-2494
[3]   An Overview of Machine Learning within Embedded and Mobile Devices-Optimizations and Applications [J].
Ajani, Taiwo Samuel ;
Imoize, Agbotiname Lucky ;
Atayero, Aderemi A. .
SENSORS, 2021, 21 (13)
[4]   A Review of Local Outlier Factor Algorithms for Outlier Detection in Big Data Streams [J].
Alghushairy, Omar ;
Alsini, Raed ;
Soule, Terence ;
Ma, Xiaogang .
BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (01) :1-24
[5]   Model-Free Fault Detection and Isolation in Large-Scale Cyber-Physical Systems [J].
Alippi, Cesare ;
Ntalampiras, Stavros ;
Roveri, Manuel .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2017, 1 (01) :61-71
[6]   The (Not) Far-Away Path to Smart Cyber-Physical Systems: An Information-Centric Framework [J].
Alippi, Cesare ;
Roveri, Manuel .
COMPUTER, 2017, 50 (04) :38-47
[7]   Hierarchical Change-Detection Tests [J].
Alippi, Cesare ;
Boracchi, Giacomo ;
Roveri, Manuel .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (02) :246-258
[8]  
Amer Mennatallah, 2013, P ACM SIGKDD WORKSH, P8, DOI [10.1145/2500853.2500857, DOI 10.1145/2500853.2500857]
[9]   Decentralized federated learning for extended sensing in 6G connected vehicles [J].
Barbieri, Luca ;
Savazzi, Stefano ;
Brambilla, Mattia ;
Nicoli, Monica .
VEHICULAR COMMUNICATIONS, 2022, 33
[10]  
Bin Z, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4433