Early classification of time series Cost-based optimization criterion and algorithms

被引:9
作者
Achenchabe, Youssef [1 ,2 ]
Bondu, Alexis [2 ]
Cornuejols, Antoine [1 ]
Dachraoui, Asma [1 ]
机构
[1] Univ Paris Saclay, INRAe, AgroParisTech, UMR MIA Paris, F-75005 Paris, France
[2] Orange Labs, 44 Ave Republ, Chatillon, France
关键词
Early classification of time series; Cost estimation; Sequential decision making;
D O I
10.1007/s10994-021-05974-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An increasing number of applications require to recognize the class of an incoming time series as quickly as possible without unduly compromising the accuracy of the prediction. In this paper, we put forward a new optimization criterion which takes into account both the cost of misclassification and the cost of delaying the decision. Based on this optimization criterion, we derived a family of non-myopic algorithms which try to anticipate the expected future gain in information in balance with the cost of waiting. In one class of algorithms, unsupervised-based, the expectations use the clustering of time series, while in a second class, supervised-based, time series are grouped according to the confidence level of the classifier used to label them. Extensive experiments carried out on real datasets using a large range of delay cost functions show that the presented algorithms are able to solve the earliness vs. accuracy trade-off, with the supervised partition based approaches faring better than the unsupervised partition based ones. In addition, all these methods perform better in a wide variety of conditions than a state of the art method based on a myopic strategy which is recognized as being very competitive. Furthermore, our experiments show that the non-myopic feature of the proposed approaches explains in large part the obtained performances.
引用
收藏
页码:1481 / 1504
页数:24
相关论文
共 29 条
[1]  
Anderson H.S., 2012, Early Time-Series Classification with Reliability Guaran
[2]  
[Anonymous], 2016, Encyclopedia of machine learning and data mining
[3]   TSFEL: Time Series Feature Extraction Library [J].
Barandas, Marilia ;
Folgado, Duarte ;
Fernandes, Leticia ;
Santos, Sara ;
Abreu, Mariana ;
Bota, Patricia ;
Liu, Hui ;
Schultz, Tanja ;
Gamboa, Hugo .
SOFTWAREX, 2020, 11
[4]  
Berger JO, 1985, Status, Rewards, andInfluence, DOI DOI 10.1007/978-1-4757-4286-2
[5]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[6]   Early Classification of Time Series as a Non Myopic Sequential Decision Making Problem [J].
Dachraoui, Asma ;
Bondu, Alexis ;
Cornuejols, Antoine .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT I, 2015, 9284 :433-447
[7]  
Dachraoui Asma, 2013, RealStream2013 (ECML), P18
[8]  
DeGroot MH, 2005, OPTIMAL STAT DECISIO
[9]  
Ghalwash M.F., 2012, Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on, P1
[10]   Utilizing Temporal Patterns for Estimating Uncertainty in Interpretable Early Decision Making [J].
Ghalwash, Mohamed F. ;
Radosavljevic, Vladan ;
Obradovic, Zoran .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :402-411