Embedding and learning with signatures

被引:23
作者
Fermanian, Adeline [1 ]
机构
[1] Sorbonne Univ, CNRS, Lab Probabilites Stat & Modelisat, 4 Pl Jussieu, F-75005 Paris, France
关键词
Sequential data; Time series classification; Functional data; Signature;
D O I
10.1016/j.csda.2020.107148
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Sequential and temporal data arise in many fields of research, such as quantitative finance, medicine, or computer vision. A novel approach for sequential learning, called the signature method and rooted in rough path theory, is considered. Its basic principle is to represent multidimensional paths by a graded feature set of their iterated integrals, called the signature. This approach relies critically on an embedding principle, which consists in representing discretely sampled data as paths, i.e., functions from [0, 1] to R-d. After a survey of machine learning methodologies for signatures, the influence of embeddings on prediction accuracy is investigated with an in-depth study of three recent and challenging datasets. It is shown that a specific embedding, called lead-lag, is systematically the strongest performer across all datasets and algorithms considered. Moreover, an empirical study reveals that computing signatures over the whole path domain does not lead to a loss of local information. It is concluded that, with a good embedding, combining signatures with other simple algorithms achieves results competitive with state-of-the-art, domain-specific approaches. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:23
相关论文
共 50 条
[31]   PS-LSTM: Capturing Essential Sequential Online Information with Path Signature and LSTM for Writer Identification [J].
Liu, Manfei ;
Tin, Lianwen ;
Xie, Zecheng .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :664-669
[32]  
Lutkepohl H., 2006, New Introduction to Multiple Time Series Analysis
[33]  
Lyons T., 2014, P 2014 INT C BIG DAT, DOI [DOI 10.1145/2640087.2644157, 10.1145/2640087.2644157]
[34]  
Lyons T. J, 2007, LECT NOTES MATH, V1908
[35]   Inverting the signature of a path [J].
Lyons, Terry J. ;
Xu, Weijun .
JOURNAL OF THE EUROPEAN MATHEMATICAL SOCIETY, 2018, 20 (07) :1655-1687
[36]   Hyperbolic development and inversion of signature [J].
Lyons, Terry J. ;
Xu, Weijun .
JOURNAL OF FUNCTIONAL ANALYSIS, 2017, 272 (07) :2933-2955
[37]   Differential equations driven by rough signals [J].
Lyons, TJ .
REVISTA MATEMATICA IBEROAMERICANA, 1998, 14 (02) :215-310
[38]  
Malekzadeh M., 2018, P 1 WORKSHOP PRIVACY
[39]   Mobile Sensor Data Anonymization [J].
Malekzadeh, Mohammad ;
Clegg, Richard G. ;
Cavallaro, Andrea ;
Haddadi, Hamed .
PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS DESIGN AND IMPLEMENTATION (IOTDI '19), 2019, :49-58
[40]   Longitudinal functional data analysis [J].
Park, So Young ;
Staicu, Ana-Maria .
STAT, 2015, 4 (01) :212-226