Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh - A Python']Python package)

被引:720
作者
Christ, Maximilian [1 ]
Braun, Nils [2 ]
Neuffer, Julius [1 ]
Kempa-Liehr, Andreas W. [3 ,4 ]
机构
[1] Blue Yonder GmbH, Karlsruhe, Germany
[2] Karlsruhe Inst Technol, Inst Expt Particle Phys, Karlsruhe, Germany
[3] Univ Auckland, Dept Engn Sci, Auckland, New Zealand
[4] Univ Freiburg, Freiburg Mat Res Ctr, Freiburg, Germany
关键词
Feature engineering; Time series; Feature extraction; Feature selection; Machine learning;
D O I
10.1016/j.neucom.2018.03.067
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series feature engineering is a time-consuming process because scientists and engineers have to consider the multifarious algorithms of signal processing and time series analysis for identifying and extracting meaningful features from time series. The Python package tsfresh (Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests) accelerates this process by combining 63 time series characterization methods, which by default compute a total of 794 time series features, with feature selection on basis automatically configured hypothesis tests. By identifying statistically significant time series characteristics in an early stage of the data science process, tsfresh closes feedback loops with domain experts and fosters the development of domain specific features early on. The package implements standard APIs of time series and machine learning libraries (e.g. pandas and scikit-learn) and is designed for both exploratory analyses as well as straightforward integration into operational data science applications. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:72 / 77
页数:6
相关论文
共 25 条
[1]  
Abadi M., 2016, TENSORFLOW LARGESCAL
[2]  
[Anonymous], 2001, SciPy: Open source scientific tools for Python
[3]  
Bishop C.M., 2006, PATTERN RECOGN, V4, P738, DOI DOI 10.1117/1.2819119
[4]  
Box G. E. P., 1970, Time series analysis, forecasting and control
[5]   A Survey of Data Mining and Machine Learning Methods for Cyber Security Intrusion Detection [J].
Buczak, Anna L. ;
Guven, Erhan .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (02) :1153-1176
[6]  
Christ M., 2016, WORKSH LEARN BIG DAT
[7]  
Christ M., 2016, P WORKSH EXTR VAL TI, DOI [10.13140/RG.2.1.3130.7922, DOI 10.13140/RG.2.1.3130.7922]
[8]  
Christoph M, 2016, 2016 AES INTERNATIONAL CONFERENCE ON SOUND FIELD CONTROL
[9]   A New Initiative on Precision Medicine [J].
Collins, Francis S. ;
Varmus, Harold .
NEW ENGLAND JOURNAL OF MEDICINE, 2015, 372 (09) :793-795
[10]  
Fulcher B. D., 2017, ARXIV170908055V2