A Weakly Supervised WordNet-Guided Deep Learning Approach to Extracting Aspect Terms from Online Reviews

被引:9
作者
Tao, Jie [1 ]
Zhou, Lina [2 ]
机构
[1] Fairfield Univ, Charles F Dolan Sch Business, 1073 N Benson Rd, Fairfield, CT 06824 USA
[2] Univ North Carolina Charlotte, Belk Coll Business, 9201 Univ City Blvd, Charlotte, NC 28223 USA
关键词
Aspect term extraction; continuous-space language model; deep learning; semantic knowledge; text analytics; SENTIMENT ANALYSIS; FEATURE-SELECTION; PRODUCT FEATURES; DECISION-SUPPORT; CLASSIFICATION; SIMILARITY; TEXT;
D O I
10.1145/3399630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The unstructured nature of online reviews makes it inefficient and inconvenient for prospective consumers to research and use in support of purchase decision making. The aspects of products provide a fine-grained meaningful perspective for understanding and organizing review texts. Traditional aspect term extraction approaches rely on discrete language models that treat words in isolation. Despite that continuous-space language models have demonstrated promise in addressing a wide range of problems, their application in aspect term extraction faces significant challenges. For instance, existing continuous-space language models typically require large collections of labeled data, which remain difficult to obtain in many domains. More importantly, previous methods are largely data driven but overlook the role of human knowledge in guiding model development. To address these limitations, this study designs and develops weakly supervised WordNet-guided deep learning to aspect term extraction. The approach draws on deep-level semantic information from WordNet to guide not only the selection representative seed terms but also the pruning of aspect candidate terms. The weak supervision is provided by a very small set of labeled data. We conduct a comprehensive evaluation of the proposed method using both direct and indirect methods. The evaluation results with Yelp restaurant reviews demonstrate that our proposed method consistently outperforms all baseline methods including discrete models and the state-of-the-art continuous-space language models for aspect term extraction across both direct and indirect evaluations. The research findings have broad research, technical, and practical implications for various stakeholders of online reviews.
引用
收藏
页数:22
相关论文
共 58 条
[1]   Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis [J].
Akhtar, Md Shad ;
Gupta, Deepak ;
Ekbal, Asif ;
Bhattacharyya, Pushpak .
KNOWLEDGE-BASED SYSTEMS, 2017, 125 :116-135
[2]  
Al-Moslmi T., 2017, IEEE ACCESS, V99, P1
[3]  
[Anonymous], processing (EMNLP)
[4]  
[Anonymous], 2017, SENSORS
[5]  
Aranganayagi S., 2008, P INT C COMPUTATIONA, P13
[6]   Deriving the Pricing Power of Product Features by Mining Consumer Reviews [J].
Archak, Nikolay ;
Ghose, Anindya ;
Ipeirotis, Panagiotis G. .
MANAGEMENT SCIENCE, 2011, 57 (08) :1485-1509
[7]   WarpLDA: a Cache Efficient O(1) Algorithm for Latent Dirichlet Allocation [J].
Chen, Jianfei ;
Li, Kaiwei ;
Zhu, Jun ;
Chen, Wenguang .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (10) :744-755
[8]  
Danesh S., 2015, P 4 JOINT C LEXICAL, P117, DOI [DOI 10.18653/V1/S15-1013, 10.18653/v1/s15-1013, DOI 10.18653/V1/S15-1]
[9]   Adapting sentiment lexicons to domain-specific social media texts [J].
Deng, Shuyuan ;
Sinha, Atish P. ;
Zhao, Huimin .
DECISION SUPPORT SYSTEMS, 2017, 94 :65-76
[10]   Deep Learning for Aspect-Based Sentiment Analysis: A Comparative Review [J].
Do, Hai Ha ;
Prasad, P. W. C. ;
Maag, Angelika ;
Alsadoon, Abeer .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 118 :272-299