A Semi-supervised Corpus Annotation for Saudi Sentiment Analysis Using Twitter

被引:4
作者
Alqarafi, Abdulrahman [1 ,2 ]
Adeel, Ahsan [1 ]
Hawalah, Ahmed [2 ]
Swingler, Kevin [1 ]
Hussain, Amir [1 ]
机构
[1] Univ Stirling, Dept Comp Sci & Math, CogBID Lab, Stirling FK9 4LA, Scotland
[2] Univ Taibah, Medina, Saudi Arabia
来源
ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018 | 2018年 / 10989卷
关键词
Sentiment analysis; Saudi dialect; Word embedding;
D O I
10.1007/978-3-030-00563-4_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the literature, limited work has been conducted to develop sentiment resources for Saudi dialect. The lack of resources such as dialectical lexicons and corpora are some of the major bottlenecks to the successful development of Arabic sentiment analysis models. In this paper, a semi-supervised approach is presented to construct an annotated sentiment corpus for Saudi dialect using Twitter. The presented approach is primarily based on a list of lexicons built by using word embedding techniques such as word2vec. A huge corpus extracted from twitter is annotated and manually reviewed to exclude incorrect annotated tweets which is publicly available. For corpus validation, state-of-the-art classification algorithms (such as Logistic Regression, Support Vector Machine, and Naive Bayes) are applied and evaluated. Simulation results demonstrate that the Naive Bayes algorithm outperformed all other approaches and achieved accuracy up to 91%.
引用
收藏
页码:589 / 596
页数:8
相关论文
共 50 条
[21]   Set-Similarity Joins Based Semi-supervised Sentiment Analysis [J].
Dong, Xishuang ;
Zou, Qibo ;
Guan, Yi .
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 :176-183
[22]   Saudi Stock Market Sentiment Analysis using Twitter Data [J].
Alazba, Amal ;
Alturayeif, Nora ;
Alturaief, Nouf ;
Alhathloul, Zainab .
PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1, 2020, :36-47
[23]   Semi-supervised target-oriented sentiment classification [J].
Xu, Weidi ;
Tan, Ying .
NEUROCOMPUTING, 2019, 337 :120-128
[24]   Leveraging Emotional Consistency for Semi-supervised Sentiment Classification [J].
Minh Luan Nguyen .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 :369-381
[25]   NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis [J].
Muhammad, Shamsuddeen Hassan ;
Adelani, David Ifeoluwa ;
Ruder, Sebastian ;
Ahmad, Ibrahim Sa'id ;
Abdulmumin, Idris ;
Bello, Bello Shehu ;
Choudhury, Monojit ;
Emezue, Chris Chinenye ;
Abdullahi, Saheed Salahudeen ;
Aremu, Anuoluwapo ;
Jorge, Alipio ;
Brazdil, Pavel .
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, :590-602
[26]   Using Multiple Resources in Graph-Based Semi-supervised Sentiment Classification [J].
Xu, Ge ;
Wang, Houfeng .
2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, :132-136
[27]   Multimodal Consistency-Based Teacher for Semi-Supervised Multimodal Sentiment Analysis [J].
Yuan, Ziqi ;
Fang, Jingliang ;
Xu, Hua ;
Gao, Kai .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 :3669-3683
[28]   A Review on Corpus Annotation for Arabic Sentiment Analysis [J].
Almuqren, Latifah ;
Alzammam, Arwa ;
Alotaibi, Shahad ;
Cristea, Alexandra ;
Alhumoud, Sarah .
SOCIAL COMPUTING AND SOCIAL MEDIA: APPLICATIONS AND ANALYTICS, SCSM 2017, PT II, 2017, 10283 :215-225
[29]   Bootstrapping semi-supervised annotation method for potential suicidal messages [J].
Acuna Caicedo, Roberto Wellington ;
Gomez Soriano, Jose Manuel ;
Melgar Sasieta, Hector Andres .
INTERNET INTERVENTIONS-THE APPLICATION OF INFORMATION TECHNOLOGY IN MENTAL AND BEHAVIOURAL HEALTH, 2022, 28
[30]   A Semi-Supervised Topic Model Incorporating Sentiment and Dynamic Characteristic [J].
Zhang, Lanshan ;
Ding, Xi ;
Tian, Ye ;
Gong, Xiangyang ;
Wang, Wendong .
CHINA COMMUNICATIONS, 2016, 13 (12) :162-175