A Hypothesis Discovery Method for Predicting Change in Multidimensional Time-series Data

被引:0
|
作者
Kumoi, Gendo [1 ]
Goto, Masayuki [1 ]
机构
[1] Waseda Univ, Sch Creat Sci & Engn, Tokyo, Japan
来源
2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC) | 2020年
关键词
hypothesis discovery; customer analysis; change prediction; betweeness centrality; random forest;
D O I
10.1109/smc42975.2020.9282955
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of IoT technology, it has become possible to accumulate and regularly measure multidimensional time-series data. In this study, we focus on the usage of multidimensional time-series data from printer products' log data and propose a method for its analysis. In addition to the number of sheets printed by each customer, the log data includes various time-series information such as the amount of remaining toner, the number of stoppages that occur, and the activation times. To utilize these data for business purposes, it is desirable to construct a model for predicting future changes in use characteristics for each customer. In this study, we apply the random forest algorithm to predict such changes. However, if all measurable features of the problem are included, the model becomes complex and cannot be interpreted. Although the accuracy is relatively high if an appropriate learning algorithm is applied, the complex model tends to overfit the training data. In this paper, we propose a method to select the modeling features that can be interpreted by graph mining while maintaining accuracy. This would enable us to interpret the data at the field level and discover the hypotheses that are necessary for planned marketing policies. Finally, the proposed method is applied to real data and its efficacy is demonstrated.
引用
收藏
页码:854 / 859
页数:6
相关论文
共 50 条
  • [1] Predicting Student Performance in Online Learning: A Multidimensional Time-Series Data Analysis Approach
    Shou, Zhaoyu
    Xie, Mingquan
    Mo, Jianwen
    Zhang, Huibing
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [2] Predicting the Success of Online Petitions Leveraging Multidimensional Time-Series
    Proskurnia, Julia
    Grabowicz, Przemyslaw A.
    Kobayashi, Ryota
    Castillo, Carlos
    Cudre-Mauroux, Philippe
    Aberer, Karl
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 755 - 764
  • [3] Correlation Set Discovery on Time-Series Data
    Amagata, Daichi
    Hara, Takahiro
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, 2019, 11707 : 275 - 290
  • [4] Event discovery in medical time-series data
    Tsien, CL
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 858 - 862
  • [5] A METHOD FOR PREDICTING CHAOTIC TIME-SERIES WITH OUTLIERS
    ITOH, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 1995, 78 (05): : 44 - 53
  • [6] Trend analysis of time-series data: A novel method for untargeted metabolite discovery
    Peters, Sonja
    Janssen, Hans-Gerd
    Vivo-Truyols, Gabriel
    ANALYTICA CHIMICA ACTA, 2010, 663 (01) : 98 - 104
  • [7] Indexing multidimensional time-series
    Vlachos, M
    Hadjieleftheriou, M
    Gunopulos, D
    Keogh, E
    VLDB JOURNAL, 2006, 15 (01): : 1 - 20
  • [8] Indexing Multidimensional Time-Series
    Michail Vlachos
    Marios Hadjieleftheriou
    Dimitrios Gunopulos
    Eamonn Keogh
    The VLDB Journal, 2006, 15 : 1 - 20
  • [9] TIME-SERIES ANALYSES OF GLOBAL CHANGE DATA
    LANE, LJ
    NICHOLS, MH
    OSBORN, HB
    ENVIRONMENTAL POLLUTION, 1994, 83 (1-2) : 63 - 68
  • [10] MOOC Dropout Prediction Based on Multidimensional Time-Series Data
    Shou, Zhaoyu
    Chen, Pan
    Wen, Hui
    Liu, Jinghua
    Zhang, Huibing
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022