A survey on sentiment analysis and its applications

被引:13
作者
Al-Qablan, Tamara Amjad [1 ]
Noor, Mohd Halim Mohd [1 ]
Al-Betar, Mohammed Azmi [2 ,3 ]
Khader, Ahamad Tajudin [1 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, George Town 11800, Malaysia
[2] Ajman Univ, Coll Engn & Informat Technol, Artificial Intelligence Res Ctr AIRC, 346, Ajman, U Arab Emirates
[3] Al Balqa Appl Univ, Al Huson Univ Coll, Dept Informat Technol, 50, Irbid, Jordan
基金
英国科研创新办公室;
关键词
Sentiment analysis; Feature selection; Deep learning; Machine learning; Optimization; LEXICON-BASED APPROACH; FEATURE-SELECTION; LEARNING APPROACH; SPEECH EMOTION; NEURAL-NETWORK; SOCIAL MEDIA; TWITTER; ENSEMBLE; MODEL; ALGORITHMS;
D O I
10.1007/s00521-023-08941-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analyzing and understanding the sentiments of social media documents on Twitter, Facebook, and Instagram has become a very important task at present. Analyzing the sentiment of these documents gives meaningful knowledge about the user opinions, which will help understand the overall view on these platforms. The problem of sentiment analysis (SA) can be regarded as a classification problem in which the text is classified as positive, negative, or neutral. This paper aims to give an intensive, but not exhaustive, review of the main concepts of SA and the state-of-the-art techniques; other aims are to make a comparative study of their performances, the main applications of SA as well as the limitations and the future directions for SA. Based on our analysis, researchers have utilized three main approaches for SA, namely lexicon/rules, machine learning (ML), and deep learning (DL). The performance of lexicon/rules-based models typically falls within the range of 55-85%. ML models, on the other hand, generally exhibit performance ranging from 55% to 90%, while DL models tend to achieve higher performance, ranging from 70% to 95%. These ranges are estimated and may be higher or lower depending on various factors, including the quality of the datasets, the chosen model architecture, the preprocessing techniques employed, as well as the quality and coverage of the lexicon utilized. Moreover, to further enhance models' performance, researchers have delved into the implementation of hybrid models and optimization techniques which have demonstrated an ability to enhance the overall performance of SA models.
引用
收藏
页码:21567 / 21601
页数:35
相关论文
共 270 条
[1]  
Adarsh M. J., 2019, 2019 1st International Conference on Advances in Information Technology (ICAIT). Proceedings, P94, DOI 10.1109/ICAIT47043.2019.8987393
[2]   Big data applications in operations/supply-chain management: A literature review [J].
Addo-Tenkorang, Richard ;
Helo, Petri T. .
COMPUTERS & INDUSTRIAL ENGINEERING, 2016, 101 :528-543
[3]  
Ahmad SR, 2019, INT J ADV COMPUT SC, V10, P240
[4]   A systematic survey on multimodal emotion recognition using learning algorithms [J].
Ahmed, Naveed ;
Al Aghbari, Zaher ;
Girija, Shini .
INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 17
[5]  
Ahuja Ravinder, 2019, Procedia Computer Science, V152, P341, DOI [10.1016/j.procs.2019.05.008, 10.1016/j.procs.2019.05.008]
[6]  
Akhtar MS, 2019, ARXIV
[7]   Enhanced Video Analytics for Sentiment Analysis Based on Fusing Textual, Auditory and Visual Information [J].
Al-Azani, Sadam ;
El-Alfy, El-Sayed M. .
IEEE ACCESS, 2020, 8 :136843-136857
[8]  
Al-Moslmi T, 2017, Journal of Engineering and Applied Sciences, V12, P5226, DOI 10.3923/jeasci.2017.5226.5232
[9]   The impact of preprocessing steps on the accuracy of machine learning algorithms in sentiment analysis [J].
Alam, Saqib ;
Yao, Nianmin .
COMPUTATIONAL AND MATHEMATICAL ORGANIZATION THEORY, 2019, 25 (03) :319-335
[10]  
Alayba AM, 2018, 2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), P13, DOI 10.1109/ASAR.2018.8480191