Semi-supervised topic representation through sentiment analysis and semantic networks

被引:2
作者
Ortu, Marco [1 ]
Romano, Maurizio [1 ]
Carta, Andrea [1 ]
机构
[1] Univ Cagliari, Dept Business & Econ Sci, Viale Fra Ignazio 17, Cagliari, Italy
关键词
Semi-supervised clustering; Topic modeling; Natural language processing; Threshold-based na & iuml; ve Bayes classifier; COMMUNITY DETECTION;
D O I
10.1016/j.bdr.2024.100474
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel approach to topic detection aimed at improving the semi-supervised clustering of customer reviews in the context of customers' services. The proposed methodology, named SeMi-supervised clustering for Assessment of Reviews using Topic and Sentiment (SMARTS) for Topic-Community Representation with Semantic Networks, combines semantic and sentiment analysis of words to derive topics related to positive and negative reviews of specific services. To achieve this, a semantic network of words is constructed based on word embedding semantic similarity to identify relationships between words used in the reviews. The resulting network is then used to derive the topics present in users' reviews, which are grouped by positive and negative sentiment based on words related to specific services. Clusters of words, obtained from the network's communities, are used to extract topics related to particular services and to improve the interpretation of users' assessments of those services. The proposed methodology is applied to tourism review data from Booking.com, and the results demonstrate the efficacy of the approach in enhancing the interpretability of the topics obtained by semi-supervised clustering. The methodology has the potential to provide valuable insights into the sentiment of customers toward tourism services, which could be utilized by service providers and decision-makers to enhance the quality of their services.
引用
收藏
页数:13
相关论文
共 36 条
  • [1] BAVELAS A, 1950, J ACOUST SOC AM, V22, P723
  • [2] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [3] Fast unfolding of communities in large networks
    Blondel, Vincent D.
    Guillaume, Jean-Loup
    Lambiotte, Renaud
    Lefebvre, Etienne
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
  • [4] Overlapping Community Detection based on Network Decomposition
    Ding, Zhuanlian
    Zhang, Xingyi
    Sun, Dengdi
    Luo, Bin
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [5] Sentiment Analysis in Social Media and Its Application: Systematic Literature Review
    Drus, Zulfadzli
    Khalid, Haliyana
    [J]. FIFTH INFORMATION SYSTEMS INTERNATIONAL CONFERENCE, 2019, 161 : 707 - 714
  • [6] Ebrahimi M, 2017, IEEE INTELL SYST, V32, P70, DOI 10.1109/MIS.2017.3711649
  • [7] SET OF MEASURES OF CENTRALITY BASED ON BETWEENNESS
    FREEMAN, LC
    [J]. SOCIOMETRY, 1977, 40 (01): : 35 - 41
  • [8] Frigau L., 2023, Statistical Methods & Applications, P1
  • [9] Grootendorst M., 2022, arXiv, DOI [10.48550/arXiv.2203.05794, DOI 10.48550/ARXIV.2203.05794]
  • [10] A review of clique-based overlapping community detection algorithms
    Gupta, Sumit Kumar
    Singh, Dhirendra Pratap
    Choudhary, Jaytrilok
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (08) : 2023 - 2058