Using text mining and sentiment analysis for online forums hotspot detection and forecast

被引:292
|
作者
Li, Nan [2 ]
Wu, Desheng Dash [1 ,3 ]
机构
[1] Univ Toronto, RiskLab, Toronto, ON M5S 1A1, Canada
[2] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
[3] Reykjavik Univ, Reykjavik, Iceland
关键词
Text mining; Sentiment analysis; Cluster analysis; Online sports forums; Dynamic interacting network analysis; Hotspot detection; Machine learning; Support vector machine; SEQUENCE MOTIFS; CLASSIFICATION;
D O I
10.1016/j.dss.2009.09.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text sentiment analysis, also referred to as emotional polarity computation, has become a flourishing frontier in the text mining community. This paper studies online forums hotspot detection and forecast using sentiment analysis and text mining approaches. First, we create an algorithm to automatically analyze the emotional polarity of a text and to obtain a value for each piece of text. Second, this algorithm is combined with K-means clustering and support vector machine (SVM) to develop unsupervised text mining approach. We use the proposed text mining approach to group the forums into various clusters. with the center of each representing a hotspot forum within the current time span. The data sets used in our empirical studies are acquired and formatted from Sina sports forums, which spans a range of 31 different topic forums and 220,053 posts. Experimental results demonstrate that SVM forecasting achieves highly consistent results with K-means clustering. The top 10 hotspot forums listed by SVM forecasting resembles 80% of K-means clustering results. Both SVM and K-means achieve the same results for the top 4 hotspot forums of the year. (c) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:354 / 368
页数:15
相关论文
共 50 条
  • [31] Sentiment analysis with text mining in contexts of big data
    Andrade C.S.
    Santos M.Y.
    1600, IGI Global (13): : 47 - 67
  • [32] Using Text Mining and Sentiment Analysis To Explore Tourists Consumer Focus From Online Reviews - Taking Mausoleum Of The First Qin
    Tseng, Hsiao-Ting
    Xue, Lirong
    Chen, Ming-Hsien
    PROCEEDINGS OF NINETEENTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, 2020, : 53 - 60
  • [33] Text Mining Based Approach for Customer Sentiment and Product Competitiveness Using Composite Online Review Data
    Wen, Zhanming
    Chen, Yanjun
    Liu, Hongwei
    Liang, Zhouyang
    JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH, 2024, 19 (03): : 1776 - 1792
  • [34] Multimodal Sentiment Analysis of Online Product Information Based on Text Mining Under the Influence of Social Media
    Zeng, Xiao
    Zhong, Ziqi
    JOURNAL OF ORGANIZATIONAL AND END USER COMPUTING, 2022, 34 (08)
  • [35] Sentiment Analysis and Text Summarization of Online Reviews: A Survey
    Gupta, Pankaj
    Tiwari, Ritu
    Robert, Nirmal
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 241 - 245
  • [36] Semisupervised sentiment analysis method for online text reviews
    Lee, Gyeong Taek
    Kim, Chang Ouk
    Song, Min
    JOURNAL OF INFORMATION SCIENCE, 2021, 47 (03) : 387 - 403
  • [37] Social Media Analysis of User's Responses to Terrorism Using Sentiment Analysis and Text Mining
    Mansour, Samah
    CYBER PHYSICAL SYSTEMS AND DEEP LEARNING, 2018, 140 : 95 - 103
  • [38] Sentiment Detection from Bangla Text using Contextual Valency Analysis
    Hasan, K. M. Azharul
    Rahman, Mosiur
    Badiuzzaman
    2014 17TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2014, : 292 - 295
  • [39] A text-mining based cyber-risk assessment and mitigation framework for critical analysis of online hacker forums
    Biswas, Baidyanath
    Mukhopadhyay, Arunabha
    Bhattacharjee, Sudip
    Kumar, Ajay
    Delen, Dursun
    DECISION SUPPORT SYSTEMS, 2020, 152
  • [40] Opinion Mining and Sentiment Analysis on Online Customer Review
    Kumar, Santhosh K. L.
    Desai, Jayanti
    Majumdar, Jharna
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH, 2016, : 552 - 555