Experiments in Cross-Lingual Sentiment Analysis in Discussion Forums

被引:0
作者
Ghorbel, Hatem [1 ]
机构
[1] HE Arc Ingn, HES SO, Informat & Commun Syst Lab ISIC, St Imier, Switzerland
来源
SOCIAL INFORMATICS, SOCINFO 2012 | 2012年 / 7710卷
关键词
Cross-Lingual Sentiment Analysis; Machine Translation; Supervised Classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the objectives of sentiment analysis is to classify the polarity of conveyed opinions from the perspective of textual evidence. Most of the work in the field has been intensively applied to the English language and only few experiments have explored other languages. In this paper, we present a supervised classification of posts in French online forums where sentiment analysis is based on shallow linguistic features such as POS tagging, chunking and common negation forms. Furthermore, we incorporate word semantic orientation extracted from the English lexical resource SentiWordNet as an additional feature. Since SentiWordNet is an English resource, lexical entries in the studied French corpus should be translated into English. For this purpose, we propose a number of French to English translation experiments such as machine translation and WordNet synset translation using EuroWordNet. Obtained results show that WordNet synset translation have not significantly improved the classification performance with respect to the bag of words baseline due to the shortage in coverage. Automatic translation haven't either significantly improved the results due to its insufficient quality. Propositions of improving the classification performance are given by the end of the article.
引用
收藏
页码:138 / 151
页数:14
相关论文
共 36 条
  • [1] [Anonymous], 2005, P HUM LANG TECHN C E
  • [2] [Anonymous], 2005, RHETORICAL QUESTIONS, DOI DOI 10.1075/SIDAG.16
  • [3] [Anonymous], P 20 INT C COMPUTATI, DOI DOI 10.3115/1220355.1220555
  • [4] [Anonymous], P REC ADV NAT LANG P
  • [5] [Anonymous], 2006, P HUM LANG TECHN C N
  • [6] [Anonymous], 2001, LINGUISTIC INQUIRY W
  • [7] [Anonymous], 2008, ACM Transactions on Information Systems (TOIS)
  • [8] [Anonymous], 2011, Multilingual Natural Language Processing
  • [9] Balahur Alexandra, 2012, Association for Computational Linguistics, P52
  • [10] Banea Carmen, 2008, P 2008 C EMP METH NA, P127