Leveraging temporal autocorrelation of historical data for improving accuracy in network regression

被引:22
作者
Loglisci, Corrado [1 ,2 ]
Malerba, Donato [1 ,2 ]
机构
[1] Univ Bari Aldo Moro, Dipartimento Informat, Bari, Italy
[2] CINI, Bari, Italy
关键词
ensemble learning; historical data; network regression; temporal autocorrelation; TREES;
D O I
10.1002/sam.11336
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal data describe processes and phenomena that evolve over time. In many real-world applications temporal data are characterized by temporal autocorrelation, which expresses the dependence of time-stamped data over a certain a time lag. Often such processes and phenomena are characterized by evolving complex entities, which we can represent with evolving networks of data. In this scenario, a task that deserves attention is regression inference in temporal network data. In this paper, we investigate how to improve the predictive inference on network data by accommodating temporal autocorrelation of the historical data in the learning process of the prediction models. Historical data is a type of temporal data where most part of the elements has been already stored. In practice, we study how to explicitly consider the influence of data of a network observed in the past, to enhance the prediction on the same network observed at the present. The proposed approach relies on a model ensemble built with individual predictors learned on historical network data. The predictors are trained from summary networks, which synthesize the effect of the autocorrelation in distinct sequences of network observations. Summary networks are identified with a sliding window model. Finally, the model ensemble combines together the predictors with a weighting schema, which reflects the degree of influence of a predictor with respect to the network observed at the present. So, we aim at accommodating the temporal autocorrelation both in the data and in the prediction model. Empirical evaluation demonstrates that the proposed approach can boost regression performance in real-world network data.
引用
收藏
页码:40 / 53
页数:14
相关论文
共 31 条
[1]  
[Anonymous], 2011, Advances in Neural Informa- tion Processing Systems
[2]  
[Anonymous], P WORKSH STAT REL LE
[3]   Dealing with temporal and spatial correlations to classify outliers in geophysical data streams [J].
Appice, Annalisa ;
Guccione, Pietro ;
Malerba, Donato ;
Ciampi, Anna .
INFORMATION SCIENCES, 2014, 285 :162-180
[4]  
Box G. E. P., 1970, Time series analysis, forecasting and control
[5]  
Cortes C., 2001, P 4 INT C ADV INT DA
[6]  
Cressie NAC., 1993, STAT SPATIAL DATA, DOI [10.1002/9781119115151, DOI 10.1002/9781119115151]
[7]   Spatial and spacetime correlations in ecological models [J].
Epperson, BK .
ECOLOGICAL MODELLING, 2000, 132 (1-2) :63-76
[8]   Technical note: Using model trees for classification [J].
Frank, E ;
Wang, Y ;
Inglis, S ;
Holmes, G ;
Witten, IH .
MACHINE LEARNING, 1998, 32 (01) :63-76
[9]   A history of the concept of spatial autocorrelation: A geographer's perspective [J].
Getis, Arthur .
GEOGRAPHICAL ANALYSIS, 2008, 40 (03) :297-309
[10]  
Groves W., 2011, IJCAI 2011