Predicting multidimensional data via tensor learning

被引:3
|
作者
Brandi, Giuseppe [1 ]
Di Matteo, T. [1 ,2 ,3 ]
机构
[1] Kings Coll London, Dept Math, London WC2R 2LS, England
[2] Complex Sci Hub Vienna, Josefstaedter Str 39, A-1080 Vienna, Austria
[3] Ctr Ric Enrico Fermi, Via Panisperna 89 A, I-00184 Rome, Italy
关键词
Tensor regression; Multiway data; ALS; Multilinear regression; REGRESSION; DECOMPOSITIONS;
D O I
10.1016/j.jocs.2021.101372
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The analysis of multidimensional data is becoming a more and more relevant topic in statistical and machine learning research. Given their complexity, such data objects are usually reshaped into matrices or vectors and then analysed. However, this methodology presents several drawbacks. First of all, it destroys the intrinsic interconnections among datapoints in the multidimensional space and, secondly, the number of parameters to be estimated in a model increases exponentially. We develop a model that overcomes such drawbacks. In particular, in this paper, we propose a parsimonious tensor regression model that retains the intrinsic multidimensional structure of the dataset. Tucker structure is employed to achieve parsimony and a shrinkage penalization is introduced to deal with over-fitting and collinearity. To estimate the model parameters, an Alternating Least Squares algorithm is developed. In order to validate the model performance and robustness, a simulation exercise is produced. Moreover, we perform an empirical analysis that highlight the forecasting power of the model with respect to benchmark models. This is achieved by implementing an autoregressive specification on the Foursquares spatio-temporal dataset together with a macroeconomic panel dataset. Overall, the proposed model is able to outperform benchmark models present in the forecasting literature.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Proposing Machine Learning Models Suitable for Predicting Open Data Utilization
    Jeong, Junyoung
    Cho, Keuntae
    SUSTAINABILITY, 2024, 16 (14)
  • [32] Coupled support tensor machine classification for multimodal neuroimaging data
    Li, Peide
    Sofuoglu, Seyyid Emre
    Aviyente, Selin
    Maiti, Tapabrata
    STATISTICAL ANALYSIS AND DATA MINING, 2022, 15 (06) : 797 - 818
  • [33] Tensor Algebra and Multidimensional Harmonic Retrieval in Signal Processing for MIMO Radar
    Nion, Dimitri
    Sidiropoulos, Nicholas D.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (11) : 5693 - 5705
  • [34] Regularized and Smooth Double Core Tensor Factorization for Heterogeneous Data
    Tarzanagh, Davoud Ataee
    Michailidis, George
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [35] Sparse and Low-Rank Tensor Estimation via Cubic Sketchings
    Hao, Botao
    Zhang, Anru
    Cheng, Guang
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2020, 66 (09) : 5927 - 5964
  • [36] Noisy Tensor Completion via Low-Rank Tensor Ring
    Qiu, Yuning
    Zhou, Guoxu
    Zhao, Qibin
    Xie, Shengli
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 1127 - 1141
  • [37] Predicting obesity and smoking using medication data: A machine-learning approach
    Ali, Sitwat
    Na, Renhua
    Waterhouse, Mary
    Jordan, Susan J.
    Olsen, Catherine M.
    Whiteman, David C.
    Neale, Rachel E.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2022, 31 (01) : 91 - 99
  • [38] The Utilization of Satellite Data and Machine Learning for Predicting the Inundation Height in the Majalaya Watershed
    Burnama, Nabila Siti
    Rohmat, Faizal Immaddudin Wira
    Farid, Mohammad
    Kuntoro, Arno Adi
    Kardhana, Hadi
    Rohmat, Fauzan Ikhlas Wira
    Wijayasari, Winda
    WATER, 2023, 15 (17)
  • [39] Comparison of Single and Multitask Learning for Predicting Cognitive Decline Based on MRI Data
    Imani, Vandad
    Prakash, Mithilesh
    Zare, Marzieh
    Tohka, Jussi
    IEEE ACCESS, 2021, 9 : 154275 - 154291
  • [40] Tensor Factorization via Matrix Factorization
    Kuleshov, Volodymyr
    Chaganty, Arun Tejasvi
    Liang, Percy
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 507 - 516