Predicting multidimensional data via tensor learning

被引:3
|
作者
Brandi, Giuseppe [1 ]
Di Matteo, T. [1 ,2 ,3 ]
机构
[1] Kings Coll London, Dept Math, London WC2R 2LS, England
[2] Complex Sci Hub Vienna, Josefstaedter Str 39, A-1080 Vienna, Austria
[3] Ctr Ric Enrico Fermi, Via Panisperna 89 A, I-00184 Rome, Italy
关键词
Tensor regression; Multiway data; ALS; Multilinear regression; REGRESSION; DECOMPOSITIONS;
D O I
10.1016/j.jocs.2021.101372
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The analysis of multidimensional data is becoming a more and more relevant topic in statistical and machine learning research. Given their complexity, such data objects are usually reshaped into matrices or vectors and then analysed. However, this methodology presents several drawbacks. First of all, it destroys the intrinsic interconnections among datapoints in the multidimensional space and, secondly, the number of parameters to be estimated in a model increases exponentially. We develop a model that overcomes such drawbacks. In particular, in this paper, we propose a parsimonious tensor regression model that retains the intrinsic multidimensional structure of the dataset. Tucker structure is employed to achieve parsimony and a shrinkage penalization is introduced to deal with over-fitting and collinearity. To estimate the model parameters, an Alternating Least Squares algorithm is developed. In order to validate the model performance and robustness, a simulation exercise is produced. Moreover, we perform an empirical analysis that highlight the forecasting power of the model with respect to benchmark models. This is achieved by implementing an autoregressive specification on the Foursquares spatio-temporal dataset together with a macroeconomic panel dataset. Overall, the proposed model is able to outperform benchmark models present in the forecasting literature.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] A data-driven approach to predicting diabetes and cardiovascular disease with machine learning
    Dinh, An
    Miertschin, Stacey
    Young, Amber
    Mohanty, Somya D.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
  • [42] FEDERATED LEARNING OF TENSOR GENERALIZED LINEAR MODELS WITH LOW SEPARATION RANK
    Hoyos Sanchez, Jose
    Taki, Batoul
    Bajwa, Waheed U.
    Sarwate, Anand D.
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2136 - 2140
  • [43] Theories, algorithms and applications in tensor learning
    Deng, Xiaowu
    Shi, Yuanquan
    Yao, Dunhong
    APPLIED INTELLIGENCE, 2023, 53 (17) : 20514 - 20534
  • [44] Mode-R Subspace Projection of a Tensor for Multidimensional Harmonic Parameter Estimations
    Li, Yang
    Zhang, Jian Qiu
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2013, 61 (11) : 3002 - 3014
  • [45] A tensor framework for learning in structured domains
    Castellana, Daniele
    Bacciu, Davide
    NEUROCOMPUTING, 2022, 470 : 405 - 426
  • [46] Multidimensional denoising of rolling element bearings with compound fault based on tensor factorization
    Hu, Chaofan
    Wang, Yanxue
    PROCEEDINGS OF THE 2017 4TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND COMPUTER (MACMC 2017), 2017, 150 : 350 - 355
  • [47] TensorLy: Tensor Learning in Python']Python
    Kossaifi, Jean
    Panagakis, Yannis
    Anandkumar, Anima
    Pantic, Maja
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [48] Visualizing Multidimensional Data with Order Statistics
    Raj, M.
    Whitaker, R. T.
    COMPUTER GRAPHICS FORUM, 2018, 37 (03) : 277 - 287
  • [49] Learning mixtures of polynomials of multidimensional probability densities from data using B-spline interpolation
    Lopez-Cruz, Pedro L.
    Bielza, Concha
    Larranaga, Pedro
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2014, 55 (04) : 989 - 1010
  • [50] Scalable tensor factorizations for incomplete data
    Acar, Evrim
    Dunlavy, Daniel M.
    Kolda, Tamara G.
    Morup, Morten
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2011, 106 (01) : 41 - 56