SyntaPulse: An unsupervised framework for sentiment annotation and semantic topic extraction

被引:0
作者
Bashiri, Hadis [1 ]
Naderi, Hassan [1 ]
机构
[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran
关键词
Sentiment classification; Machine learning; Natural language processing; Lexicon-based method; Semantic topics;
D O I
10.1016/j.patcog.2025.111593
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is a critical area within natural language processing, with applications in various domains like marketing, social media analytics, and politics. However, current methods encounter challenges in handling contextual ambiguities, accurately detecting sarcasm and irony, and effectively processing domain-specific vocabulary without extensive labeled datasets. Addressing these issues is essential, as the nuanced nature of language can lead to diverse interpretations across contexts, complicating reliable sentiment analysis. Furthermore, sarcasm and irony remain difficult to identify precisely, while reliance on labeled data and limitations in handling domain-specific vocabulary restrict adaptability across different fields. This paper presents SyntaPulse, a novel framework for sentiment classification in social networks, developed to overcome these challenges. The framework combines an innovative dictionary-based approach with Probabilistic Syntactic Latent Semantic Analysis (PSLSA) for semantic topic extraction. This integration enables it to handle homographs effectively, thereby enhancing sarcasm detection, facilitating the interpretation of domain-specific vocabulary, and reducing dependency on labeled data. Evaluated on 12 datasets, our framework demonstrates adaptability across various domains and achieves high Macro-F1 scores, ranging from 72.89 % to 96.22 %. SyntaPulse has also obtained improvements on seven datasets, with the lowest improvement rate being 0.21 % and the highest reaching 2.97 %.
引用
收藏
页数:14
相关论文
共 40 条
  • [1] FC-Kmeans: Fixed-centered K-means algorithm
    Ay, Merhad
    Ozbakir, Lale
    Kulluk, Sinem
    Gulmez, Burak
    Ozturk, Guney
    Ozer, Sertay
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211
  • [2] Comprehensive review and comparative analysis of transformer models in sentiment analysis
    Bashiri, Hadis
    Naderi, Hassan
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (12) : 7305 - 7361
  • [3] Probabilistic temporal semantic graph: a holistic framework for event detection in twitter
    Bashiri, Hadis
    Naderi, Hassan
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (12) : 7581 - 7607
  • [4] LexiSNTAGMM: an unsupervised framework for sentiment classification in data from distinct domains, synergistically integrating dictionary-based and machine learning approaches
    Bashiri, Hadis
    Naderi, Hassan
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [5] ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis
    Basiri, Mohammad Ehsan
    Nemati, Shahla
    Abdar, Moloud
    Cambria, Erik
    Acharya, U. Rajendra
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 115 : 279 - 294
  • [6] Using VADER sentiment and SVM for predicting customer response sentiment
    Borg, Anton
    Boldt, Martin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162
  • [7] Joint multimodal sentiment analysis based on information relevance
    Chen, Danlei
    Su, Wang
    Wu, Peng
    Hua, Bolin
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)
  • [8] Chen YC, 2016, CONF TECHNOL APPL, P78, DOI 10.1109/TAAI.2016.7880184
  • [9] Cheng W., 2006, Int. J. Corpus Linguist., V11, P411, DOI DOI 10.1075/IJCL.11.4.04CHE
  • [10] Emerging Trends Word2Vec
    Church, Kenneth Ward
    [J]. NATURAL LANGUAGE ENGINEERING, 2017, 23 (01) : 155 - 162