Patterns of time-interval based patterns for improved multivariate time series data classification

被引:0
作者
Shenderovitz, Gil
Sheetrit, Eitam
Nissim, Nir [1 ]
机构
[1] Ben Gurion Univ Negev, Cyber Secur Res Ctr, Malware Lab, Beer Sheva, Israel
关键词
Time-interval mining; Temporal patterns; Multivariate time series data; Classification; Machine learning; Explainability; KNOWLEDGE; SYSTEM; INFORMATION; SEPSIS;
D O I
10.1016/j.engappai.2024.108171
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, computational advancements have contributed to a significant increase in the volume and availability of multivariate time series data (MTSD). The ability to make use of MTSD is challenging, as it often suffers from missing values, random noise, varied sampling rates, and temporal granularities. In the modern era of machine learning, learning frameworks must accurately capture the data's temporal structure and have high generalization capabilities, as well as provide interpretability and explainability regarding its decisions. The use of temporal abstractions (TAs) and interval-based temporal patterns (TPs) has been proposed for learning from MTSD, overcoming existing challenges, and achieving state-of-the-art results on various tasks, however, improvements are needed in terms of the generalization capabilities and decision explainability. In this paper, we propose a novel interval-based method, Inter-Rel, which provides an additional layer of learning that exploits the temporal relations among the discovered interval-based TPs, enhances the feature space, and improves explanatory capabilities by shedding light on the relations between the TPs over time. To evaluate the effectiveness and additional benefits provided by Inter-Rel, we evaluate our method on various learning tasks and domains such as early sepsis detection; electroencephalogram-based (EEG-based) subject classification; and the categorization of malicious portable executables based on dynamic analysis. The experimental results demonstrate that Inter-Rel's use enhances classification performance by up to 16% in terms of area under the curve (AUC) or accuracy (depending on the task) while providing additional explanatory capabilities that can be leveraged by a domain expert.
引用
收藏
页数:25
相关论文
共 96 条
[1]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[2]   MAINTAINING KNOWLEDGE ABOUT TEMPORAL INTERVALS [J].
ALLEN, JF .
COMMUNICATIONS OF THE ACM, 1983, 26 (11) :832-843
[3]  
[Anonymous], Worldwide Windows operating system market share
[4]   Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI [J].
Barredo Arrieta, Alejandro ;
Diaz-Rodriguez, Natalia ;
Del Ser, Javier ;
Bennetot, Adrien ;
Tabik, Siham ;
Barbado, Alberto ;
Garcia, Salvador ;
Gil-Lopez, Sergio ;
Molina, Daniel ;
Benjamins, Richard ;
Chatila, Raja ;
Herrera, Francisco .
INFORMATION FUSION, 2020, 58 :82-115
[5]  
Batal I., 2009, Proceedings of the 22nd International Florida Artificial Intelligence Research Society Conference, FLAIRS-22, V22, P344
[6]  
Batal Iyad, 2012, KDD, V2012, P280
[7]   Learning a symbolic representation for multivariate time series classification [J].
Baydogan, Mustafa Gokce ;
Runger, George .
DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (02) :400-422
[8]  
Berndt DonaldJ., 1994, KDD WORKSHOP, P359, DOI DOI 10.5555/3000850.3000887
[9]   The BCI competition 2003:: Progress and perspectives in detection and discrimination of EEG single trials [J].
Blankertz, B ;
Müller, KR ;
Curio, G ;
Vaughan, TM ;
Schalk, G ;
Wolpaw, JR ;
Schlögl, A ;
Neuper, C ;
Pfurtscheller, G ;
Hinterberger, T ;
Schröder, M ;
Birbaumer, N .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2004, 51 (06) :1044-1051
[10]  
Brockwell P.J., 2009, Time Series: Theory and Methods