A Semi-supervised Corpus Annotation for Saudi Sentiment Analysis Using Twitter

被引:4
作者
Alqarafi, Abdulrahman [1 ,2 ]
Adeel, Ahsan [1 ]
Hawalah, Ahmed [2 ]
Swingler, Kevin [1 ]
Hussain, Amir [1 ]
机构
[1] Univ Stirling, Dept Comp Sci & Math, CogBID Lab, Stirling FK9 4LA, Scotland
[2] Univ Taibah, Medina, Saudi Arabia
来源
ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018 | 2018年 / 10989卷
关键词
Sentiment analysis; Saudi dialect; Word embedding;
D O I
10.1007/978-3-030-00563-4_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the literature, limited work has been conducted to develop sentiment resources for Saudi dialect. The lack of resources such as dialectical lexicons and corpora are some of the major bottlenecks to the successful development of Arabic sentiment analysis models. In this paper, a semi-supervised approach is presented to construct an annotated sentiment corpus for Saudi dialect using Twitter. The presented approach is primarily based on a list of lexicons built by using word embedding techniques such as word2vec. A huge corpus extracted from twitter is annotated and manually reviewed to exclude incorrect annotated tweets which is publicly available. For corpus validation, state-of-the-art classification algorithms (such as Logistic Regression, Support Vector Machine, and Naive Bayes) are applied and evaluated. Simulation results demonstrate that the Naive Bayes algorithm outperformed all other approaches and achieved accuracy up to 91%.
引用
收藏
页码:589 / 596
页数:8
相关论文
共 50 条
[41]   Multi-Objective Model Selection (MOMS)-based Semi-Supervised Framework for Sentiment Analysis [J].
Farhan Hassan Khan ;
Usman Qamar ;
Saba Bashir .
Cognitive Computation, 2016, 8 :614-628
[42]   AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets [J].
Al-Twairesh, Nora ;
Al-Khalifa, Hend ;
Al-Salman, AbdulMalik ;
Al-Ohali, Yousef .
ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 :63-72
[43]   Multi-Objective Model Selection (MOMS)-based Semi-Supervised Framework for Sentiment Analysis [J].
Khan, Farhan Hassan ;
Qamar, Usman ;
Bashir, Saba .
COGNITIVE COMPUTATION, 2016, 8 (04) :614-628
[44]   Feature selection for semi-supervised sentiment analysis of e-commerce reviews using CNN and neutrosophic fuzzy parameters [J].
Jena, Alok Kumar ;
Gopal, K. Murali ;
Tripathy, Abinash ;
Tripathy, Siba Prasada .
KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (08) :6645-6687
[45]   AraCust: a Saudi Telecom Tweets corpus for sentiment analysis [J].
Almuqren, Latifah ;
Cristea, Alexandra .
PEERJ COMPUTER SCIENCE, 2021,
[46]   AraCust: a Saudi Telecom Tweets corpus for sentiment analysis [J].
Almuqren L. ;
Cristea A. .
PeerJ Computer Science, 2021, 7 :1-30
[47]   Sentiment Analysis of Twitter Corpus Related to Artificial Intelligence Assistants [J].
Park, Chae Won ;
Seo, Dae Ryong .
2018 5TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND APPLICATIONS (ICIEA), 2018, :495-498
[48]   Assessment of Iterative Semi-Supervised Feature Selection Learning for Sentiment Analyses:Digital Currency Markets [J].
Akba, Firat ;
Medeni, Ihsan Tolga ;
Guzel, Mehmet Serdar ;
Askerzade, Iman .
2020 IEEE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2020), 2020, :459-463
[49]   Sentiment analysis via semi-supervised learning: a model based on dynamic threshold and multi-classifiers [J].
Han, Yue ;
Liu, Yuhong ;
Jin, Zhigang .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (09) :5117-5129
[50]   SEML: A Semi-Supervised Multi-Task Learning Framework for Aspect-Based Sentiment Analysis [J].
Li, Ning ;
Chow, Chi-Yin ;
Zhang, Jia-Dong .
IEEE ACCESS, 2020, 8 :189287-189297