FLOps: On Learning Important Time Series Features for Real-Valued Prediction

被引:8
作者
Patel, Dhaval [1 ]
Shah, Syed Yousaf [1 ]
Zhou, Nianjun [1 ]
Shrivastava, Shrey [1 ]
Iyengar, Arun [1 ]
Bhamidipaty, Anuradha [1 ]
Kalagnanam, Jayant [1 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
来源
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2020年
关键词
D O I
10.1109/BigData50022.2020.9378499
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series value forecasting using machine learning models utilizing time series features has recently got good attention of Time series analytics community. This paper proposes an automated feature learning mechanisms to filter out most useful features from hundreds of available features for time series prediction problems. The paper further proposes a novel mechanism to dynamically filter features that are most suitable for the given input time series data. With such mechanisms we create pipeline consisting of most useful features for given input data and increases the performance of the prediction model. Our proposed mechanism first, groups well known features for time series analysis, generates and assigns the features importance score using multiple scoring configurations. Once scores are assigned, features are filtered using a threshold that is derived using reference feature score and Critical Difference diagram. The filtered features are subsequently analyzed based on the characteristics of the input dataset. We show using experimental results that our approach of input data based dynamic feature selection improves the overall performance of machine learning models compared to the case where dynamic feature extraction is not applied prior to modeling.
引用
收藏
页码:1624 / 1633
页数:10
相关论文
共 31 条
[1]  
[Anonymous], 2016, ARXIV161007717
[2]  
[Anonymous], 2020, PMDARIMA AUTOARIMA P
[3]  
[Anonymous], 2020, AUTOTS MODEL SELECTI
[4]  
[Anonymous], 2020, PYAF AUTOMATIC TIME
[5]   The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances [J].
Bagnall, Anthony ;
Lines, Jason ;
Bostrom, Aaron ;
Large, James ;
Keogh, Eamonn .
DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (03) :606-660
[6]   TSFEL: Time Series Feature Extraction Library [J].
Barandas, Marilia ;
Folgado, Duarte ;
Fernandes, Leticia ;
Santos, Sara ;
Abreu, Mariana ;
Bota, Patricia ;
Liu, Hui ;
Schultz, Tanja ;
Gamboa, Hugo .
SOFTWAREX, 2020, 11
[7]  
Buitinck L., 2013, ECML PKDD WORKSHOP L
[8]   Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh - A Python']Python package) [J].
Christ, Maximilian ;
Braun, Nils ;
Neuffer, Julius ;
Kempa-Liehr, Andreas W. .
NEUROCOMPUTING, 2018, 307 :72-77
[9]  
Dempster A., 2019, ROCKET EXCEPTIONALLY
[10]  
Demsar J, 2006, J MACH LEARN RES, V7, P1