Time series clustering based on sparse subspace clustering algorithm and its application to daily box-office data analysis

被引:0
作者
Yan Wang
Yunian Ru
Jianping Chai
机构
[1] Communication University of China,School of Data Science and Intelligent Media
[2] Communication University of China,Department of Information Engineering College
来源
Neural Computing and Applications | 2019年 / 31卷
关键词
Movie box office; Sparse subspace clustering; Time series; Data mining;
D O I
暂无
中图分类号
学科分类号
摘要
Movie box-office research is an important work for the rapid development of the film industry, and it is also a challenging task. Our study focuses on finding the regular box-office revenue patterns. Clustering algorithm is unsupervised machine learning algorithm which classifies the data in the absence of early knowledge of the classes. Unlike static data, the time series data vary with time. The work focused on time series clustering analysis is relatively less than those focused on static data. In this paper, the sparse subspace clustering (SSC) algorithm is introduced to analyze the time series data. The SSC algorithm has a better performance both on the artificial data set and the daily box-office data than recently developed well-known clustering algorithm such as K-means and spectral clustering algorithm. On the artificial data set, SSC is more suitable for time series, whether from the angle of clustering error or visualization. On the actual data, movies are divided into five clusters by SSC algorithm, and each cluster represents a distinct type of distribution pattern. And these patterns can be used in movie recommendation, film evaluation and can guide theater exhibitors and distributors. In addition, this is the first time to apply SSC to deal with time series clustering problem and get a pleasant effect.
引用
收藏
页码:4809 / 4818
页数:9
相关论文
共 38 条
  • [1] Barry RL(1989)Predicting financial success of motion pictures: the ‘80 s experience J Media Econ 2 35-50
  • [2] Kohl LS(1994)Predicting the performance of motion pictures J Media Econ 7 1-20
  • [3] Sochay S(2015)Using the hierarchical linear model to forecast movie box-office performance. the effect of online word of mouth Asia Pac J Inf Syst 25 563-578
  • [4] Park J(2010)Impact of star and movie buzz on motion picture distribution and box-office revenue Int J Res Mark 28 62-74
  • [5] Chung Y(2009)The differential effects of online word-of-mouth and critics’ reviews on pre-release movie evaluation Soc Sci Electron Publ 24 185-197
  • [6] Cho Y(2010)Predicting the future with social media Proc of Wiiat 7 492-499
  • [7] Karniouchina EV(2013)Quantifying anecdotes: google search data and political developments in Egypt Ps Political Sci Politics 46 291-298
  • [8] Chakravarty A(1998)Clustering at the Movies Market Lett 9 393-405
  • [9] Liu Y(2010)Chinese movies’ life cycle model and empirical analysis Syst Eng Theory Prac 30 1790-1797
  • [10] Mazumdar T(2016)Network analysis: understanding consumers’ choice in the film industry and predicting pre-released weekly box-office revenue Appl Stoch Models Bus Ind 32 409-422