Unsupervised Semantic Approach of Aspect-Based Sentiment Analysis for Large-Scale User Reviews

被引:30
作者
Al-Ghuribi, Sumaia Mohammed [1 ,2 ]
Mohd Noah, Shahrul Azman [1 ]
Tiun, Sabrina [1 ]
机构
[1] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Malaysia
[2] Taiz Univ, Fac Appl Sci, Dept Comp Sci, Taizi, Yemen
关键词
Task analysis; Sentiment analysis; Data mining; Syntactics; Semantics; Feature extraction; Ontologies; Aspect; core terms; aspect extraction; aspect weight; aspect rating; domain-specific lexicon; total review score; real large-scale dataset; ASPECT EXTRACTION; RECOMMENDER; FRAMEWORK; LSTM;
D O I
10.1109/ACCESS.2020.3042312
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aspect-based sentiment analysis (ABSA) has recently attracted increasing attention due to its extensive applications. Most of the existing ABSA methods been applied on small-sized labeled datasets. However, real datasets such as the Amazon and TripAdvisor contain a massive number of reviews. Thus, applying these methods on large-scale datasets may produce inefficient results. Furthermore, these existing methods extract huge number of aspects, most of which are not relevant to the domain of interest. But, on other hand, some of the infrequent relevant aspects are excluded during the extraction process. These limitations negatively affect the performance of the ABSA process. This article, therefore, aims to overcome such limitations by proposing an efficient approach that is suitable for real large-scale unlabeled datasets. The proposed approach is a combination of hybridizing a frequency-based approach (word level) and a syntactic-relation based approach (sentence level). It was enhanced further with a semantic similarity-based approach to extract aspects that are relevant to the domain, even terms (related to the aspects) are not frequently mentioned in the reviews. The extracted aspects according to the proposed approach are used to generate a total review sentiment score after estimating the weight and the rating of each extracted aspect mentioned in the review. The assignment of the weight of each extracted aspect is calculated based on a modified TF-IDF weighting scheme and the assignment of the aspect rating is calculated based on a domain-specific lexicon. Effectiveness of the extracted aspects is evaluated against two baselines available from existing literature: fixed aspect and extracted aspects. Evaluation was also performed by using a general lexicon and a domain-specific lexicon. Results in terms of F-measure and accuracy on Amazon and Yelp datasets show that the extracted aspects using the proposed approach with the domain-specific lexicon outperformed all the baselines.
引用
收藏
页码:218592 / 218613
页数:22
相关论文
共 84 条
[1]   Informed recommender: Basing recommendations on consumer product reviews [J].
Aciar, Silvana ;
Zhang, Debbie ;
Simoff, Simeon ;
Debenham, John .
IEEE INTELLIGENT SYSTEMS, 2007, 22 (03) :39-47
[2]   Automatic ontology construction from text: a review from shallow to deep learning trend [J].
Al-Aswadi, Fatima N. ;
Chan, Huah Yong ;
Gan, Keng Hoon .
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (06) :3901-3928
[3]  
Al-Ghuribi S. M., 2020, P INT C ADV INT SYST, P204
[4]   Multi-Criteria Review-Based Recommender SystemThe State of the Art [J].
Al-Ghuribi, Sumaia Mohammed ;
Noah, Shahrul Azman Mohd .
IEEE ACCESS, 2019, 7 (169446-169468) :169446-169468
[5]   Sentiment Analysis in Tourism: Capitalizing on Big Data [J].
Alaei, Ali Reza ;
Becken, Susanne ;
Stantic, Bela .
JOURNAL OF TRAVEL RESEARCH, 2019, 58 (02) :175-191
[6]   Transportation sentiment analysis using word embedding and ontology-based topic modeling [J].
Ali, Farman ;
Kwak, Daehan ;
Khan, Pervez ;
El-Sappagh, Shaker ;
Ali, Amjad ;
Ullah, Sana ;
Kim, Kye Hyun ;
Kwak, Kyung-Sup .
KNOWLEDGE-BASED SYSTEMS, 2019, 174 :27-42
[7]   Aspect-based sentiment analysis using smart government review data [J].
Alqaryouti, Omar ;
Siyam, Nur ;
Monem, Azza Abdel ;
Shaalan, Khaled .
APPLIED COMPUTING AND INFORMATICS, 2024, 20 (1/2) :142-161
[8]   Term weighting scheme for short-text classification: Twitter corpuses [J].
Alsmadi, Issa ;
Hoon, Gan Keng .
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08) :3819-3831
[9]   Semi-supervised Aspect Based Sentiment Analysis for Movies using Review Filtering [J].
Anand, Deepa ;
Naorem, Deepan .
PROCEEDING OF THE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2015), 2016, 84 :86-93
[10]  
[Anonymous], INF PROCESS MANAGE