Using text mining and sentiment analysis for online forums hotspot detection and forecast

被引:292
|
作者
Li, Nan [2 ]
Wu, Desheng Dash [1 ,3 ]
机构
[1] Univ Toronto, RiskLab, Toronto, ON M5S 1A1, Canada
[2] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
[3] Reykjavik Univ, Reykjavik, Iceland
关键词
Text mining; Sentiment analysis; Cluster analysis; Online sports forums; Dynamic interacting network analysis; Hotspot detection; Machine learning; Support vector machine; SEQUENCE MOTIFS; CLASSIFICATION;
D O I
10.1016/j.dss.2009.09.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text sentiment analysis, also referred to as emotional polarity computation, has become a flourishing frontier in the text mining community. This paper studies online forums hotspot detection and forecast using sentiment analysis and text mining approaches. First, we create an algorithm to automatically analyze the emotional polarity of a text and to obtain a value for each piece of text. Second, this algorithm is combined with K-means clustering and support vector machine (SVM) to develop unsupervised text mining approach. We use the proposed text mining approach to group the forums into various clusters. with the center of each representing a hotspot forum within the current time span. The data sets used in our empirical studies are acquired and formatted from Sina sports forums, which spans a range of 31 different topic forums and 220,053 posts. Experimental results demonstrate that SVM forecasting achieves highly consistent results with K-means clustering. The top 10 hotspot forums listed by SVM forecasting resembles 80% of K-means clustering results. Both SVM and K-means achieve the same results for the top 4 hotspot forums of the year. (c) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:354 / 368
页数:15
相关论文
共 50 条
  • [21] Topic Discovery and Hotspot Analysis of Sentiment Analysis of Chinese Text Using Information-Theoretic Method
    Zhang, Changlu
    Fan, Haojie
    Zhang, Jian
    Yang, Qiong
    Tang, Liqian
    ENTROPY, 2023, 25 (06)
  • [22] Anti-Islamic Arabic Text Categorization using Text Mining and Sentiment Analysis Techniques
    Alraddadi, Rawan Abdullah
    Ghembaza, Moulay Ibrahim El-Khalil
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (08) : 776 - 785
  • [23] Multimodal Sentiment Analysis using Audio and Text for Crime Detection
    Boukabous, Mohammed
    Azizi, Mostafa
    2022 2ND INTERNATIONAL CONFERENCE ON INNOVATIVE RESEARCH IN APPLIED SCIENCE, ENGINEERING AND TECHNOLOGY (IRASET'2022), 2022, : 803 - 807
  • [24] Using social network and semantic analysis to analyze online travel forums and forecast tourism demand
    Colladon, Andrea Fronzetti
    Guardabascio, Barbara
    Innarella, Rosy
    DECISION SUPPORT SYSTEMS, 2019, 123
  • [25] Sentiment analysis on massive open online course evaluations: A text mining and deep learning approach
    Onan, Aytug
    COMPUTER APPLICATIONS IN ENGINEERING EDUCATION, 2021, 29 (03) : 572 - 589
  • [26] Detection and Analysis of Medical Misbehavior in Online Forums
    Bigeard, Elise
    Grabar, Natalia
    2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 7 - 12
  • [27] COMPETITIVE ANALYSIS OF ONLINE REVIEWS USING EXPLORATORY TEXT MINING
    Amadio, William J.
    Procaccino, J. Drew
    TOURISM AND HOSPITALITY MANAGEMENT-CROATIA, 2016, 22 (02): : 193 - 210
  • [28] Text mining based sentiment analysis using a novel deep learning approach
    Abdullaha, Enas Fadhil
    Alasadib, Suad A.
    Al-Jodac, Alyaa Abdulhussein
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2021, 12 : 595 - 604
  • [29] Detecting Suspicious Discussion on Online Forums Using Data Mining
    Rasheed, Haroon Ur
    Khan, Farhan Hassan
    Bashir, Saba
    Fatima, Irsa
    INTELLIGENT TECHNOLOGIES AND APPLICATIONS, INTAP 2018, 2019, 932 : 262 - 273
  • [30] Opinion Mining and Sentiment Analysis Need Text Understanding
    Delmonte, Rodolfo
    Pallotta, Vincenzo
    ADVANCES IN DISTRIBUTED AGENT-BASED RETRIEVAL TOOLS, 2011, 361 : 81 - +