A Semi-Supervised Approach to Sentiment Analysis of Tweets during the 2022 Philippine Presidential Election

被引:11
|
作者
Macrohon, Julio Jerison E. [1 ]
Villavicencio, Charlyn Nayve [1 ,2 ]
Inbaraj, X. Alphonse [1 ]
Jeng, Jyh-Horng [1 ]
机构
[1] I Shou Univ, Dept Informat Engn, Kaohsiung 84001, Taiwan
[2] Bulacan State Univ, Coll Informat & Commun Technol, Bulacan 3000, Philippines
关键词
2022 Philippine Presidential Election; semi-supervised learning; Natural Language Processing; sentiment analysis; !text type='Python']Python[!/text; social media; Twitter; tweets;
D O I
10.3390/info13100484
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increasing popularity of Twitter as both a social media platform and a data source for companies, decision makers, advertisers, and even researchers alike, data have been so massive that manual labeling is no longer feasible. This research uses a semi-supervised approach to sentiment analysis of both English and Tagalog tweets using a base classifier. In this study involving the Philippines, where social media played a central role in the campaign of both candidates, the tweets during the widely contested race between the son of the Philippines' former President and Dictator, and the outgoing Vice President of the Philippines were used. Using Natural Language Processing techniques, these tweets were annotated, processed, and trained to classify both English and Tagalog tweets into three polarities: positive, neutral, and negative. Through the Self-Training with Multinomial Naive Bayes as base classifier with 30% unlabeled data, the results yielded an accuracy of 84.83%, which outweighs other studies using Twitter data from the Philippines.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Sentiment and Emotions Analysis of Tweets During the Second Round of 2021 Ecuadorian Presidential Election
    Minango Negrete, Juan Carlos
    Iano, Yuzo
    Minango Negrete, Pablo David
    Vaz, Gabriel Caumo
    de Oliveira, Gabriel Gomes
    PROCEEDINGS OF THE 7TH BRAZILIAN TECHNOLOGY SYMPOSIUM (BTSYM 21): EMERGING TRENDS IN HUMAN SMART AND SUSTAINABLE FUTURE OF CITIES, VOL 1, 2023, 207 : 257 - 268
  • [2] A large-scale sentiment analysis of tweets pertaining to the 2020 US presidential election
    Ali, Rao Hamza
    Pinto, Gabriela
    Lawrie, Evelyn
    Linstead, Erik J.
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [3] Sentiment based Analysis of Tweets during the US Presidential Elections
    Yaqub, Ussama
    Chun, Soon Ae
    Atluri, Vijayalakshmi
    Vaidya, Jaideep
    DG.O 2017: THE PROCEEDINGS OF THE 18TH ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH: INNOVATIONS AND TRANSFORMATIONS IN GOVERNMENT, 2017, : 1 - 10
  • [4] A hybrid semi-supervised boosting to sentiment analysis
    Tanha, Jafar
    Mahmudyan, Solmaz
    Farahi, Ahmad
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2021, 12 (02): : 1769 - 1784
  • [5] Semi-supervised Multi-view Sentiment Analysis
    Lazarova, Gergana
    Koychev, Ivan
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT I, 2015, 9329 : 181 - 190
  • [6] Semi-supervised distributed representations of documents for sentiment analysis
    Park, Saerom
    Lee, Jaewook
    Kim, Kyoungok
    NEURAL NETWORKS, 2019, 119 : 139 - 150
  • [7] Attention Aware Semi-supervised Framework for Sentiment Analysis
    Liu, Jingshuang
    Rong, Wenge
    Tian, Chuan
    Gao, Min
    Xiong, Zhang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 208 - 215
  • [8] A large-scale sentiment analysis of tweets pertaining to the 2020 US presidential election
    Rao Hamza Ali
    Gabriela Pinto
    Evelyn Lawrie
    Erik J. Linstead
    Journal of Big Data, 9
  • [9] A semi-supervised approach to sentiment analysis using revised sentiment strength based on SentiWordNet
    Farhan Hassan Khan
    Usman Qamar
    Saba Bashir
    Knowledge and Information Systems, 2017, 51 : 851 - 872
  • [10] A semi-supervised approach to sentiment analysis using revised sentiment strength based on SentiWordNet
    Khan, Farhan Hassan
    Qamar, Usman
    Bashir, Saba
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) : 851 - 872