A text mining-based framework to discover the important factors in text reviews for predicting the views of live streaming

被引:26
作者
Chen, Wen-Kuo [1 ]
Chen, Long-Sheng [2 ]
Pan, Yi-Ting [2 ]
机构
[1] Chaoyang Univ Technol, Dept Mkt & Logist Management, Taichung 413310, Taiwan
[2] Chaoyang Univ Technol, Dept Informat Management, Taichung 413310, Taiwan
关键词
Live streaming; Text mining; Prediction; Feature selection; Games; Latent semantic analysis; LATENT SEMANTIC ANALYSIS; SOCIAL MEDIA; FEATURE-SELECTION; ONLINE REVIEWS; CLASSIFICATION; REGRESSION; MOTIVATIONS; ENGAGEMENT; SENTIMENT; FEATURES;
D O I
10.1016/j.asoc.2021.107704
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Live streaming has become one of the leisure activities of most people due to the rich and various contents. For young generation, to watch other people playing games on the live streaming platform is becoming very popular. Related researches mainly focused on predicting the number of viewers, finding popular streamer, studying the gift giving behaviors, and so on. Relatively few studies focused on how viewers' comments affect users' viewing behaviors, since the power of text comments in social media have been confirmed. In addition, published studies usually employed questionnaire survey methods which are prone to experimental effects. And online text comments will be more objective and less sampling bias than data collected by questionnaires. Consequently, this study focuses live streaming of games and uses viewers' text comments for experimental analysis. A text mining-based framework which includes Least Absolute Shrinkage and Selection Operator (LASSO), Support Vector Machine-Recursive Feature Elimination (SVM-RFE), Chi-square test will be proposed to determine the important keywords of predicting the number of views in live streaming. Support Vector Machine (SVM) will be utilized to evaluate the performances of candidate feature subsets. Then, K-means and Latent Semantic Analysis (LSA) using Singular Value Decomposition (SVD) have been used to organize the selected keywords into understandable concepts. Real cases of game live streaming cases will be collected from Twitch.tv for our experiments. Results can be used as a reference for live streaming platforms and live channels, and help them to increase the number of viewers for further income enhancement. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 87 条
[1]   You Tweet What You Eat: Studying Food Consumption Through Twitter [J].
Abbar, Sofiane ;
Mejova, Yelena ;
Weber, Ingmar .
CHI 2015: PROCEEDINGS OF THE 33RD ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2015, :3197-3206
[2]  
[Anonymous], 2011, Acm T. Intel. Syst. Tec., DOI DOI 10.1145/1961189.1961199
[3]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[4]   Feature selection using an improved Chi-square for Arabic text classification [J].
Bahassine, Said ;
Madani, Abdellah ;
Al-Sarem, Mohammed ;
Kissi, Mohamed .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (02) :225-231
[5]   Chilean Wine Classification Using Volatile Organic Compounds Data Obtained With a Fast GC Analyzer [J].
Beltran, Nicolas H. ;
Duarte-Mermoud, Manuel A. ;
Vicencio, Victor A. Soto ;
Salah, Sebastian A. ;
Bustos, Matias A. .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2008, 57 (11) :2421-2436
[6]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[7]  
Burling, 2015, PUBL WKLY
[8]   Selecting Features Subsets Based on Support Vector Machine Recursive Features Elimination and One Dimensional-Naive Bayes Classifier using Support Vector Machines for Classification of Prostate and Breast Cancer [J].
Bustamam, Alhadi ;
Bachtiar, Anas ;
Sarwinda, Devvi .
4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 :450-458
[9]   Sentiment Classification of Consumer-Generated Online Reviews Using Topic Modeling [J].
Calheiros, Ana Catarina ;
Moro, Sergio ;
Rita, Paulo .
JOURNAL OF HOSPITALITY MARKETING & MANAGEMENT, 2017, 26 (07) :675-693
[10]  
Calinski T., 1974, Communications in Statistics-theory and Methods, V3, P1, DOI [DOI 10.1080/03610927408827101, 10.1080/03610927408827101]