A two-fold rule-based model for aspect extraction

被引:74
作者
Rana, Toqir A. [1 ]
Cheah, Yu-N [1 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, Usm Penang, Malaysia
关键词
Aspect-based sentiment analysis; Opinion mining; Aspect extraction; Explicit aspects; Sequential pattern-based rules; Aspect pruning; PRODUCT FEATURE-EXTRACTION; SENTIMENT ANALYSIS; LDA; FEATURES; DOMAIN; WORDS;
D O I
10.1016/j.eswa.2017.07.047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Opinion target extraction or aspect extraction is the most important subtask of the aspect-based sentiment analysis. This task focuses on the identification of the targets of user's opinions or sentiments from online reviews. In the recent years, syntactic patterns-based approaches have performed quite well and produced significant improvement in the aspect extraction task. However, these approaches are heavily dependent on the dependency parsers which produced syntactic relations following the grammatical rules and language constraints. In contemporary, users do not give much importance to these rules and constraints while expressing their opinions about particular product and neither reviewer websites restrict users to do so. This makes syntactic patterns-based approaches vulnerable. Therefore, in this paper, we are proposing a two-fold rules-based model (TF-RBM) which uses rules defined on the basis of sequential patterns mined from customer reviews. The first fold extracts aspects associated with domain independent opinions and the second fold extracts aspects associated with domain dependent opinions. We have also applied frequency- and similarity-based approaches to improve the aspect extraction accuracy of the proposed model. Our experimental evaluation has shown better results as compared with the state-of-the-art and most recent approaches. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:273 / 285
页数:13
相关论文
共 82 条
[1]  
[Anonymous], 1994, P INT C VERY LARGE D
[2]  
[Anonymous], 2010, Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid
[3]  
[Anonymous], 2012, PROC AAAI C ARTIF IN
[4]  
[Anonymous], 2014, ARXIV14041982
[5]  
[Anonymous], 2013, IJCAI
[6]  
[Anonymous], 1998, WordNet, DOI DOI 10.7551/MITPRESS/7287.001.0001
[7]  
[Anonymous], 2007, P 2007 JOINT C EMP M
[8]  
[Anonymous], 2009, P 2009 C EMPIRICAL M, DOI 10.3115/1699648.1699700
[9]  
Bagheri Ayoub, 2013, Natural Language Processing and Information Systems. 18th International Conference on Applications of Natural Language to Information Systems, NLDB 2013. Proceedings: LNCS 7934, P140, DOI 10.1007/978-3-642-38824-8_12
[10]   ADM-LDA: An aspect detection model based on topic modelling using the structure of review sentences [J].
Bagheri, Ayoub ;
Saraee, Mohamad ;
de Jong, Franciska .
JOURNAL OF INFORMATION SCIENCE, 2014, 40 (05) :621-636