Simulation Study on How Input Data Affects Time-Series Classification Model Results

被引:0
作者
Sadowska, Maria [1 ]
Gajowniczek, Krzysztof [1 ]
机构
[1] Warsaw Univ Life Sci SGGW, Inst Informat Technol, PL-02787 Warsaw, Poland
关键词
time series; classification; synthetic data;
D O I
10.3390/e27060624
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
This paper discusses the results of a study investigating how input data characteristics affect the performance of time-series classification models. In this experiment, we used 82 synthetically generated time-series datasets, created based on predefined functions with added noise. These datasets varied in structure, including differences in the number of classes and noise levels, while maintaining a consistent length and total number of observations. This design allowed us to systematically assess the influence of dataset characteristics on classification outcomes. Seven classification models were evaluated and their performance was compared using accuracy metrics, training time and memory requirements. According to the evaluation, the CNN Classifier achieved the best results, demonstrating the highest robustness to an increasing number of classes and noise. In contrast, the least effective model was the Catch22 Classifier. Overall, the performed research leads to the conclusion that as the number of classes and the level of noise in the data increase, all classification models become less effective, achieving lower accuracy metrics.
引用
收藏
页数:20
相关论文
共 36 条
[31]  
Wachman G, 2009, LECT NOTES ARTIF INT, V5782, P489, DOI 10.1007/978-3-642-04174-7_32
[32]   Artificial intelligence in healthcare: a primer for medical education in radiomics [J].
Waldman, Carly E. ;
Hermel, Melody ;
Hermel, Jonathan A. ;
Allinson, Francis ;
Pintea, Mark N. ;
Bransky, Natalie ;
Udoh, Emem ;
Nicholson, Laura ;
Robinson, Austin ;
Gonzalez, Jorge ;
Suhar, Christopher ;
Nayak, Keshav ;
Wesbey, George ;
Bhavnani, Sanjeev P. .
PERSONALIZED MEDICINE, 2022, 19 (05) :445-456
[33]  
Wang Z., 2023, Advances in Neural Information Processing Systems
[34]   Boundary-enhanced time series data imputation with long-term dependency diffusion models [J].
Xiao, Chunjing ;
Jiang, Xue ;
Du, Xianghe ;
Yang, Wei ;
Lu, Wei ;
Wang, Xiaomin ;
Chetty, Kevin .
KNOWLEDGE-BASED SYSTEMS, 2025, 310
[35]   Imputation-based Time-Series Anomaly Detection with Conditional Weight-Incremental Diffusion Models [J].
Xiao, Chunjing ;
Gou, Zehua ;
Tai, Wenxin ;
Zhang, Kunpeng ;
Zhou, Fan .
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, :2742-2751
[36]   Convolutional neural networks for time series classification [J].
Zhao, Bendong ;
Lu, Huanzhang ;
Chen, Shangfeng ;
Liu, Junliang ;
Wu, Dongya .
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2017, 28 (01) :162-169