'Long autonomy or long delay?' The importance of domain in opinion mining

被引:54
作者
Cruz, Fermin L. [1 ]
Troyano, Jose A. [1 ]
Enriquez, Fernando [1 ]
Javier Ortega, F. [1 ]
Vallejo, Carlos G. [1 ]
机构
[1] Univ Seville, Dept Languages & Comp Syst, Seville, Spain
关键词
Sentiment analysis; Opinion mining; Feature-based opinion extraction; User-generated contents; Information extraction;
D O I
10.1016/j.eswa.2012.12.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, people do not only navigate the web, but they also contribute contents to the Internet. Among other things, they write their thoughts and opinions in review sites, forums, social networks, blogs and other websites. These opinions constitute a valuable resource for businesses, governments and consumers. In the last years, some researchers have proposed opinion extraction systems, mostly domain-independent ones, to automatically extract structured representations of opinions contained in those texts. In this work, we tackle this task in a domain-oriented approach, defining a set of domain-specific resources which capture valuable knowledge about how people express opinions on a given domain. These resources are automatically induced from a set of annotated documents. Some experiments were carried out on three different domains (user-generated reviews of headphones, hotels and cars), comparing our approach to other state-of-the-art, domain-independent techniques. The results confirm the importance of the domain in order to build accurate opinion extraction systems. Some experiments on the influence of the dataset size and an example of aggregation and visualization of the extracted opinions are also shown. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3174 / 3184
页数:11
相关论文
共 24 条
  • [1] Agrawal R., 1994, FAST ALGORITHMS MINI
  • [2] [Anonymous], 2005, P C HUM LANG PROC VA
  • [3] [Anonymous], 2010, Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid
  • [4] [Anonymous], 2012, Mining Text Data, DOI DOI 10.1007/978-1-4614-3223-413
  • [5] [Anonymous], 2004, Using WordNet to Measure Semantic Orientations of Adjectives
  • [6] [Anonymous], 2003, P 12 INT C WORLD WID, DOI DOI 10.1145/775152.775226
  • [7] Atserias J., 2006, P INT C LANG RES EV
  • [8] Baccianella S, 2010, LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
  • [9] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [10] Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001