Cross-domain aspect extraction for sentiment analysis: A transductive learning approach

被引:39
作者
Marcacini, Ricardo Marcondes [2 ]
Rossi, Rafael Geraldeli [2 ]
Matsuno, Ivone Penque [1 ]
Rezende, Solange Oliveira [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci ICMC, Av Trabalhador Sao Carlense 400, BR-13566590 Sao Carlos, SP, Brazil
[2] Fed Univ Mato Grosso do Sul UFMS, Av Ranulpho Marques Leal 3484, BR-79613000 Tres Lagoas, MS, Brazil
基金
巴西圣保罗研究基金会;
关键词
Cross domain; Opinion mining; Aspect extraction; FEATURE-SELECTION; REGULARIZATION; FRAMEWORK; NETWORKS;
D O I
10.1016/j.dss.2018.08.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aspect-Based Sentiment Analysis (ABSA) is a promising approach to analyze consumer reviews at a high level of detail, where the opinion about each feature of the product or service is considered. ABSA usually explores supervised inductive learning algorithms, which requires intense human effort for the labeling process. In this paper, we investigate Cross-Domain Transfer Learning approaches, in which aspects already labeled in some domains can be used to support the aspect extraction of another domain where there are no labeled aspects. Existing cross-domain transfer learning approaches learn classifiers from labeled aspects in the source domain and then apply these classifiers in the target domain, Le, two separate stages that may cause inconsistency due to different feature spaces. To overcome this drawback, we present an innovative approach called CD-ALPHN (Cross-Domain Aspect Label Propagation through Heterogeneous Networks). First, we propose a heterogeneous network-based representation that combines different features (labeled aspects, unlabeled aspects, and linguistic features) from source and target domain as nodes in a single network. Second, we propose a label propagation algorithm for aspect extraction from heterogeneous networks, where the linguistic features are used as a bridge for this propagation. Our algorithm is based on a transductive learning process, where we explore both labeled and unlabeled aspects during the label propagation. Experimental results show that the CD-ALPHN outperforms the state-of-the-art methods in scenarios where there is a high-level of inconsistency between the source and target domains the most common scenario in real-world applications.
引用
收藏
页码:70 / 80
页数:11
相关论文
共 52 条
[1]   Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis [J].
Akhtar, Md Shad ;
Gupta, Deepak ;
Ekbal, Asif ;
Bhattacharyya, Pushpak .
KNOWLEDGE-BASED SYSTEMS, 2017, 125 :116-135
[2]   Approaches to Cross-Domain Sentiment Analysis: A Systematic Literature Review [J].
Al-Moslmi, Tareq ;
Omar, Nazlia ;
Abdullah, Salwani ;
Albared, Mohammed .
IEEE ACCESS, 2017, 5 :16173-16192
[3]  
[Anonymous], 2013, Advances in neural information processing systems
[4]  
[Anonymous], 2006, BOOK REV IEEE T NEUR
[5]  
[Anonymous], 2003, P 20 INT C MACH LEAR
[6]  
[Anonymous], 2012, Mining text data
[7]  
[Anonymous], 2003, P 20 INT C MACH LEAR, DOI DOI 10.1145/2612669.2612699
[8]  
[Anonymous], 2008, Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '08
[9]  
Banerjee Prithish, 2017, HDB RES APPL CYBERNE, V154
[10]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399