MMDL-Based Data Augmentation with Domain Knowledge for Time Series Classification

被引:0
作者
Li, Xiaosheng [1 ]
Wu, Yifan [2 ]
Jiang, Wei [1 ]
Li, Ying [2 ]
Li, Jianguo [1 ]
机构
[1] Ant Grp, Hangzhou, Peoples R China
[2] Peking Univ, Beijing, Peoples R China
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT III, ECML PKDD 2024 | 2024年 / 14943卷
关键词
Time series classification; Data augmentation; Minimum description length; FOREST; MODEL;
D O I
10.1007/978-3-031-70352-2_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Plenty of time series classification methods have been proposed in the past. Most methods utilize the labeled time series instances to build classifiers, ignoring the explicit domain knowledge. However, in real-world applications, practitioners may identify domain characteristics of the time series, and build the heuristic relationship between the class labels of the time series and these domain characteristics. In this paper, we investigate the possibility of incorporating the domain knowledge into time series classification for possible performance improvement. To this end, we propose a Modified Minimum Description Length (MMDL)-based data augmentation method to inject domain knowledge into time series classification. Based on the type of domain knowledge, the proposed method applies the MMDL shapes or residuals to augment the training data. Experimental results demonstrate that the proposed method consistently improves the classification accuracy across all tested datasets and achieves better results than other time series data augmentation methods.
引用
收藏
页码:403 / 420
页数:18
相关论文
共 50 条
[21]   Domain Knowledge Graph Question Answering Based on Semantic Analysis and Data Augmentation [J].
Hu, Shulin ;
Zhang, Huajun ;
Zhang, Wanying .
APPLIED SCIENCES-BASEL, 2023, 13 (15)
[22]   Data augmentation in economic time series: Behavior and improvements in predictions [J].
de Rojas, Ana Lazcano .
AIMS MATHEMATICS, 2023, 8 (10) :24528-24544
[23]   Data augmentation for univariate time series forecasting with neural networks [J].
Semenoglou, Artemios-Anargyros ;
Spiliotis, Evangelos ;
Assimakopoulos, Vassilios .
PATTERN RECOGNITION, 2022, 134
[24]   Research on data augmentation algorithm for time series based on deep learning [J].
Liu, Shiyu ;
Qiao, Hongyan ;
Yuan, Lianhong ;
Yuan, Yuan ;
Liu, Jun .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (06) :1530-1544
[25]   GPR Data Augmentation Methods by Incorporating Domain Knowledge [J].
Yue, Guanghua ;
Liu, Chenglong ;
Li, Yishun ;
Du, Yuchuan ;
Guo, Shili .
APPLIED SCIENCES-BASEL, 2022, 12 (21)
[26]   Motif Alignment for Time Series Data Augmentation [J].
Bahri, Omar ;
Li, Peiyu ;
Boubrahimi, Soukaina Filali ;
Hamdi, Shah Muhammad .
BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2023, 2023, 14148 :42-48
[27]   DDTM: A Distance-Based Data Transformation Method for Time Series Classification [J].
Xu, Huarong ;
Wang, Ke ;
Sun, Wu ;
Chen, Mei ;
Li, Hui ;
Zhao, Heng .
ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 :94-111
[28]   Decomposition-based Data Augmentation for Time-series Building Load Data [J].
Deng, Yang ;
Liang, Rui ;
Wang, Dan ;
Li, Ao ;
Xiao, Fu .
PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, :51-60
[29]   Novel statistical time series data augmentation and machine learning based classification of unobtrusive respiration data for respiration Digital Twin model [J].
Khan, Sagheer ;
Alzaabi, Aaesha ;
Ratnarajah, Tharmalingam ;
Arslan, Tughrul .
COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 168
[30]   Time Series Data Classification using the Knowledge-Augmented Transformer Model [J].
Chung, Hyun-seok ;
Hyun, Sun-young ;
Lee, Chang-eun ;
Ha, Young-guk .
2025 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2025, :419-420