A Semantic Relatedness Approach to Classifying Opinion from Web Reviews

被引:0
作者
Balahur, Alexandra [1 ]
Montoyo, Andres [1 ]
机构
[1] Univ Alicante, DLSI, Ap Correos 99, Alicante 03080, Spain
来源
PROCESAMIENTO DEL LENGUAJE NATURAL | 2009年 / 42期
关键词
Opinion mining; summarization; Normalized Google Distance; SVM machine learning;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Recent years have marked the beginning and rapid expansion of the social web, where people can freely express their opinion on different "objects", such as products, persons, topics etc. on blogs, forums or e-commerce sites. While the rapid growth of the information volume on the web allowed for better and more informed decisions from users, its expansion led to the need to develop specialized NLP systems that automatically mine the web for opinions (retrieve, extract and classify opinions of a query object). Opinion mining (sentiment analysis) has been proven to be a difficult problem, due to the large semantic variability of free text. In this article, we propose a method to extract, classify and summarize opinions on products from web reviews, based on the prior building of product characteristics taxonomy and on the semantic relatedness given by the Normalized Google Distance and SVM learning. We prove that our approach outperforms the baselines and has a high precision and classification confidence.
引用
收藏
页码:47 / 54
页数:8
相关论文
共 30 条
[1]  
Banerjee S., 2003, P 4 INT C INT TEXT P
[2]  
Chaovalit P, 2005, P HICSS 05 38 HAW IN
[3]  
Cilibrasi D., 2006, IEEE J T KNOWLEDGE D
[4]  
Datar, 2006, AAAI
[5]  
Dave K, 2003, P WWW 03
[6]  
Ding X., 2008, P WSDM 2008
[7]  
Fellbaum C., 1999, WORDNET ELECT LEXICA
[8]  
Ferrandez A., 1999, ANAPHORA RESOLUTION
[9]  
Gamon M., 2005, LECT NOTES COMPUTER
[10]  
Goldberg AB, 2006, HLT NAACL 2006 WORKS