Opinion mining on large scale data using sentiment analysis and k-means clustering

被引:46
作者
Riaz, Sumbal [1 ]
Fatima, Mehvish [1 ]
Kamran, M. [1 ]
Nisar, M. Wasif [1 ]
机构
[1] COMSATS Inst Informat Technol, Dept Comp Sci, Wah Cantt, Pakistan
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2019年 / 22卷 / Suppl 3期
关键词
Heterogeneous data processing; Imbalanced learning; Intelligent computing; CLASSIFICATION; ALGORITHMS; LEXICON; WORDS;
D O I
10.1007/s10586-017-1077-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid growth of web technology and easy access of internet, online shopping has been increased. Now people express their opinions and share their experiences that greatly influence new buyers for purchasing products, thereby generating large data sets. This large data is very helpful for analyzing customer preference, needs and its behavior toward a product. Companies face the challenge of analyzing this sheer amount of data to extract customer opinion. To address this challenge, in this paper, we performed sentiment analysis on the customer review real-world data at phrase level to find out customer preference by analyzing subjective expressions. Then we calculated the strength of sentiment word to find out the intensity of each expression and applied clustering for placing the words in various clusters based on their intensity. We also compared the results of our technique with star-ranking given on the same dataset and found the drastic change in our results. We also provide a visual representation of our results to provide a clear insight of customer preference and behavior to help decision makers for better decision making.
引用
收藏
页码:S7149 / S7164
页数:16
相关论文
共 50 条
  • [41] An Analysis of Burnout among Female Nurse Educators in Saudi Arabia Using K-Means Clustering
    Baghdadi, Nadiah. A. A.
    Alsayed, Shatha Khalid
    Malki, Ghalia Amer
    Balaha, Hossam Magdy
    Abdelaliem, Sally Mohammed Farghaly
    EUROPEAN JOURNAL OF INVESTIGATION IN HEALTH PSYCHOLOGY AND EDUCATION, 2023, 13 (01) : 33 - 53
  • [42] Opinion Mining System for Twitter Sentiment Analysis
    Aquino, Pamella A.
    Lopez, Vivian F.
    Moreno, Maria N.
    Munoz, Maria D.
    Rodriguez, Sara
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2020, 2020, 12344 : 465 - 476
  • [43] Intelligent Opinion Mining and Sentiment Analysis Using Artificial Neural Networks
    Stuart, Keith Douglas
    Majewski, Maciej
    NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 103 - 110
  • [44] Spherical k-means clustering is good for interpreting multivariate species occurrence data
    Hill, Mark O.
    Harrower, Colin A.
    Preston, Christopher D.
    METHODS IN ECOLOGY AND EVOLUTION, 2013, 4 (06): : 542 - 551
  • [45] Stable clustering of offshore downhole data using a combined k-means and Gaussian mixture modelling approach
    Singh, Amrita
    Ojha, Maheswar
    MARINE GEOPHYSICAL RESEARCH, 2022, 43 (03)
  • [46] K-Means Clustering Versus Validation Measures: A Data-Distribution Perspective
    Xiong, Hui
    Wu, Junjie
    Chen, Jian
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2009, 39 (02): : 318 - 331
  • [47] Sem-AI: A Unique Framework for Sentiment Analysis and Opinion Mining Using Social Network Data
    J. Maruthupandi
    S. Sivakumar
    V. Senthil Kumar
    P. Balaji Srikaanth
    SN Computer Science, 6 (2)
  • [48] A modified K-means clustering for mining of multimedia databases based on dimensionality reduction and similarity measures
    Jiang, Xiaoping
    Li, Chenghua
    Sun, Jing
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (01): : 797 - 804
  • [49] Email Sentiment Analysis Through k-Means Labeling and Support Vector Machine Classification
    Liu, Sisi
    Lee, Ickjai
    CYBERNETICS AND SYSTEMS, 2018, 49 (03) : 181 - 199
  • [50] Semi-supervised Text Categorization Using Recursive K-means Clustering
    Gowda, Harsha S.
    Suhil, Mahamad
    Guru, D. S.
    Raju, Lavanya Narayana
    RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION (RTIP2R 2016), 2017, 709 : 217 - 227