Tri-Partition Alphabet-Based State Prediction for Multivariate Time-Series

被引:1
作者
Wen, Zuo-Cheng [1 ]
Zhang, Zhi-Heng [1 ]
Zhou, Xiang-Bing [1 ,2 ]
Gu, Jian-Gang [1 ,3 ]
Shen, Shao-Peng [1 ,3 ]
Chen, Gong-Suo [1 ]
Deng, Wu [1 ,4 ]
机构
[1] Sichuan Tourism Univ, Sch Informat & Engn, Chengdu 610100, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Resources & Environm, Chengdu 611731, Peoples R China
[3] Chengdu Univ Informat Technol, Sch Software Engn, Chengdu 610225, Peoples R China
[4] Civil Aviat Univ China, Sch Elect Informat & Automat, Tianjin 300300, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 23期
基金
中国国家自然科学基金;
关键词
multivariate time-series; k matrix nearest neighbor; tri-partition alphabet; state prediction; 3-WAY DECISION; SYMBOLIC REPRESENTATION; MODEL; SYSTEM;
D O I
10.3390/app112311294
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Recently, predicting multivariate time-series (MTS) has attracted much attention to obtain richer semantics with similar or better performances. In this paper, we propose a tri-partition alphabet-based state (tri-state) prediction method for symbolic MTSs. First, for each variable, the set of all symbols, i.e., alphabets, is divided into strong, medium, and weak using two user-specified thresholds. With the tri-partitioned alphabet, the tri-state takes the form of a matrix. One order contains the whole variables. The other is a feature vector that includes the most likely occurring strong, medium, and weak symbols. Second, a tri-partition strategy based on the deviation degree is proposed. We introduce the piecewise and symbolic aggregate approximation techniques to polymerize and discretize the original MTS. This way, the symbol is stronger and has a bigger deviation. Moreover, most popular numerical or symbolic similarity or distance metrics can be combined. Third, we propose an along-across similarity model to obtain the k-nearest matrix neighbors. This model considers the associations among the time stamps and variables simultaneously. Fourth, we design two post-filling strategies to obtain a completed tri-state. The experimental results from the four-domain datasets show that (1) the tri-state has greater recall but lower precision; (2) the two post-filling strategies can slightly improve the recall; and (3) the along-across similarity model composed by the Triangle and Jaccard metrics are first recommended for new datasets.
引用
收藏
页数:23
相关论文
共 56 条
  • [1] Dynamic and Internal Longest Common Substring
    Amir, Amihood
    Charalampopoulos, Panagiotis
    Pissis, Solon P.
    Radoszewski, Jakub
    [J]. ALGORITHMICA, 2020, 82 (12) : 3707 - 3743
  • [2] A class of hybrid morphological perceptrons with application in time series forecasting
    Araujo, Ricardo de A.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2011, 24 (04) : 513 - 529
  • [3] Multivariate times series classification through an interpretable representation
    Baldan, Francisco J.
    Benitez, Jose M.
    [J]. INFORMATION SCIENCES, 2021, 569 : 596 - 614
  • [4] Learning a symbolic representation for multivariate time series classification
    Baydogan, Mustafa Gokce
    Runger, George
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (02) : 400 - 422
  • [5] A novel approach for the structural comparison of origin-destination matrices: Levenshtein distance
    Behara, Krishna N. S.
    Bhaskar, Ashish
    Chung, Edward
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 111 : 513 - 530
  • [6] A weighted LS-SVM based learning system for time series forecasting
    Chen, Thao-Tsen
    Lee, Shie-Jue
    [J]. INFORMATION SCIENCES, 2015, 299 : 99 - 116
  • [7] Chen X., 2020, ARXIV200610436
  • [8] Initialization by a Novel Clustering for Wavelet Neural Network as Time Series Predictor
    Cheng, Rong
    Hu, Hongping
    Tan, Xiuhui
    Bai, Yanping
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2015, 2015
  • [9] Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data
    Chung, Neo Christopher
    Miasojedow, Blazej
    Startek, Michal
    Gambin, Anna
    [J]. BMC BIOINFORMATICS, 2019, 20 (Suppl 15)
  • [10] Decision-theoretic three-way approximations of fuzzy sets
    Deng, Xiaofei
    Yao, Yiyu
    [J]. INFORMATION SCIENCES, 2014, 279 : 702 - 715