Sentiment Analysis of Noisy Malay Text: State of Art, Challenges and Future Work

被引:8
|
作者
Abu Bakar, Muhammad Fakhrur Razi [1 ]
Idris, Norisma [1 ]
Shuib, Liyana [2 ]
Khamis, Norazlina [3 ]
机构
[1] Univ Malaya, Fac Comp Sci & IT, Dept Artificial Intelligence, Kuala Lumpur 50603, Malaysia
[2] Univ Malaya, Fac Comp Sci & IT, Dept Informat Syst, Kuala Lumpur 50603, Malaysia
[3] Univ Malaysia Sabah, Fac Comp & Informat, Language Engn & Applicat Dev Res Grp, Kota Kinabalu 88400, Sabah, Malaysia
关键词
Hybrid; lexicon-based; machine learning; noisy Malay text; sentiment analysis;
D O I
10.1109/ACCESS.2020.2968955
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis (SA) is a study where people & x2019;s opinions and emotions are automatically extracted in the form of sentiments from the natural language text. In social media monitoring, it is very useful because it allows user to gain an overall picture of the extensive public opinion behind many topics. Most works on SA are for the English text. Only a few works focus on the Malay language. Currently, a review on SA for the Malay language only focus on the SA approaches and the dataset. Some major issues such as the pre-processing techniques used to normalize the noisy text, the most employed performance measures for Malay SA, and the challenges for Malay SA has not been reviewed. Malaysians tend not to fully follow any abbreviations rules when writing on social media. Thus, a lot of noisy text can be found in social media sites like Facebook and Twitter which create some issues to SA process. Hence, the aim of this study is to investigate the state of the art, challenges and future works of SA for Malay social media text. This study provides a review on various approaches, datasets, performance measures, and pre-processing techniques used in the previous works on SA of the Malay text. More than 700 articles from journals and conference proceedings have been identified using the search keywords, however, only 17 relevant articles published from year 2013 to 2018 were reviewed. The findings from this review focus on three commonly used SA approaches which are lexicon-based, machine learning, and hybrid.
引用
收藏
页码:24687 / 24696
页数:10
相关论文
共 50 条
  • [1] Sentiment Analysis of Malay Social Media Text
    Chekima, Khalifa
    Alfred, Rayner
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 205 - 219
  • [2] Rule-Based Model for Malay Text Sentiment Analysis
    Chekima, Khalifa
    Alfred, Rayner
    Chin, Kim On
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 172 - 185
  • [3] Anonymisation Models for Text Data: State of the Art, Challenges and Future Directions
    Lison, Pierre
    Pilan, Ildiko
    Sanchez, David
    Batet, Montserrat
    Ovrelid, Lilja
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4188 - 4203
  • [4] Main Concepts, State of the Art and Future Research Questions in Sentiment Analysis
    Appel, Orestes
    Chiclana, Francisco
    Carter, Jenny
    ACTA POLYTECHNICA HUNGARICA, 2015, 12 (03) : 87 - 108
  • [6] Graph-Based Text Representation and Matching: A Review of the State of the Art and Future Challenges
    Osman, Ahmed Hamza
    Barukub, Omar Mohammed
    IEEE ACCESS, 2020, 8 : 87562 - 87583
  • [7] Facial Sentiment Analysis Using AI Techniques: State-of-the-Art, Taxonomies, and Challenges
    Patel, Keyur
    Mehta, Dev
    Mistry, Chinmay
    Gupta, Rajesh
    Tanwar, Sudeep
    Kumar, Neeraj
    Alazab, Mamoun
    IEEE ACCESS, 2020, 8 : 90495 - 90519
  • [8] A Review on Arabic Sentiment Analysis: State-of-the-Art, Taxonomy and Open Research Challenges
    Abo, Mohamed Elhag Mohamed
    Raj, Ram Gopal
    Qazi, Atika
    IEEE ACCESS, 2019, 7 : 162008 - 162024
  • [9] An Enhancement of Malay Social Media Text Normalization for Lexicon-Based Sentiment Analysis
    Abu Bakar, Muhammad Fakhrur Razi
    Idris, Norisma
    Shuib, Liyana
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 211 - 215
  • [10] A survey of blockchain consensus safety and security: State-of-the-art, challenges, and future work?
    Bao, Qihao
    Li, Bixin
    Hu, Tianyuan
    Sun, Xueyong
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 196