SyntaPulse: An unsupervised framework for sentiment annotation and semantic topic extraction

被引：0

作者：

Bashiri, Hadis ^{[1
]}

Naderi, Hassan ^{[1
]}

机构：

[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran

来源：

PATTERN RECOGNITION | 2025年 / 164卷

关键词：

Sentiment classification; Machine learning; Natural language processing; Lexicon-based method; Semantic topics;

D O I：

10.1016/j.patcog.2025.111593

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sentiment analysis is a critical area within natural language processing, with applications in various domains like marketing, social media analytics, and politics. However, current methods encounter challenges in handling contextual ambiguities, accurately detecting sarcasm and irony, and effectively processing domain-specific vocabulary without extensive labeled datasets. Addressing these issues is essential, as the nuanced nature of language can lead to diverse interpretations across contexts, complicating reliable sentiment analysis. Furthermore, sarcasm and irony remain difficult to identify precisely, while reliance on labeled data and limitations in handling domain-specific vocabulary restrict adaptability across different fields. This paper presents SyntaPulse, a novel framework for sentiment classification in social networks, developed to overcome these challenges. The framework combines an innovative dictionary-based approach with Probabilistic Syntactic Latent Semantic Analysis (PSLSA) for semantic topic extraction. This integration enables it to handle homographs effectively, thereby enhancing sarcasm detection, facilitating the interpretation of domain-specific vocabulary, and reducing dependency on labeled data. Evaluated on 12 datasets, our framework demonstrates adaptability across various domains and achieves high Macro-F1 scores, ranging from 72.89 % to 96.22 %. SyntaPulse has also obtained improvements on seven datasets, with the lowest improvement rate being 0.21 % and the highest reaching 2.97 %.

引用

页数：14

共 40 条

[1] FC-Kmeans: Fixed-centered K-means algorithm
Ay, Merhad
Ozbakir, Lale
Kulluk, Sinem
Gulmez, Burak
Ozturk, Guney
Ozer, Sertay
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211
[2] Comprehensive review and comparative analysis of transformer models in sentiment analysis
Bashiri, Hadis
Naderi, Hassan
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (12) : 7305 - 7361
[3] Probabilistic temporal semantic graph: a holistic framework for event detection in twitter
Bashiri, Hadis
Naderi, Hassan
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (12) : 7581 - 7607
[4] LexiSNTAGMM: an unsupervised framework for sentiment classification in data from distinct domains, synergistically integrating dictionary-based and machine learning approaches
Bashiri, Hadis
Naderi, Hassan
[J]. SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
[5] ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis
Basiri, Mohammad Ehsan
Nemati, Shahla
Abdar, Moloud
Cambria, Erik
Acharya, U. Rajendra
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 115 : 279 - 294
[6] Using VADER sentiment and SVM for predicting customer response sentiment
Borg, Anton
Boldt, Martin
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162
[7] Joint multimodal sentiment analysis based on information relevance
Chen, Danlei
Su, Wang
Wu, Peng
Hua, Bolin
[J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)
[8] Chen YC, 2016, CONF TECHNOL APPL, P78, DOI 10.1109/TAAI.2016.7880184
[9] Cheng W., 2006, Int. J. Corpus Linguist., V11, P411, DOI DOI 10.1075/IJCL.11.4.04CHE
[10] Emerging Trends Word2Vec
Church, Kenneth Ward
[J]. NATURAL LANGUAGE ENGINEERING, 2017, 23 (01) : 155 - 162

← 1 2 3 4 →