A simple approach to multilingual polarity classification in Twitter

被引:19
作者
Tellez, Eric S. [1 ,3 ]
Miranda-Jimenez, Sabino [1 ,3 ]
Graff, Mario [1 ,3 ]
Moctezuma, Daniela [1 ,2 ]
Suarez, Ranyart R. [4 ]
Siordia, Oscar S. [2 ]
机构
[1] CONACyT Consejo Nacl Ciencia & Tecnol, Direcc Cdaedras, Insurgentes Sur 1582, Ciudad De Mexico 03940, Mexico
[2] Ctr Invest Geog & Geomat Ing Jorge L Tamayo AC, Circuito Tecnopolo Norte 117,Tecnopolo Pocitos 2, Aguascalientes 20313, Mexico
[3] INFOTEC Ctr Invest & Innovac Tecnol Informac & Co, Circuito Tecnopolo Sur 112,Tecnopolo Pocitos 2, Aguascalientes 20313, Mexico
[4] Univ Michoacana de San Nicolas Hidalgo, Fac Ingn Elect, Div Estudios Posgrad, Santiago Tapia 403, Morelia 58000, Michoacan, Mexico
关键词
Multilingual sentiment analysis; Error-robust text representations; Opinion mining; TRANSLATION;
D O I
10.1016/j.patrec.2017.05.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, sentiment analysis has received a lot of attention due to the interest in mining opinions of social media users. Sentiment analysis consists in determining the polarity of a given text, i.e., its degree of positiveness or negativeness. Traditionally, Sentiment Analysis algorithms have been tailored to a specific language given the complexity of having a number of lexical variations and errors introduced by the people generating content. In this contribution, our aim is to provide a simple to implement and easy to use multilingual framework, that can serve as a baseline for sentiment analysis contests, and as a starting point to build new sentiment analysis systems. We compare our approach in eight different languages, three of them correspond to important international contests, namely, SemEval (English), TASS (Spanish), and SENTIPOLC (Italian). Within the competitions, our approach reaches from medium to high positions in the rankings; whereas in the remaining languages our approach outperforms the reported results. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:68 / 74
页数:7
相关论文
共 27 条
  • [1] [Anonymous], 2015, CEUR Workshop Proceedings
  • [2] [Anonymous], 2016, P ACM S APPL COMP 20
  • [3] [Anonymous], 2005, SEARCH METHODOLOGIES: Introductory Tutorials in Optimization and Decision Support Techniques, DOI DOI 10.1007/0-387-28356-0_6
  • [4] [Anonymous], 2014, OVERVIEW EVALITA 201
  • [5] [Anonymous], 2015, P 2015 C N AM CHAPT
  • [6] Bal D., 2011, SENTIMENT ANAL MULTI, P129
  • [7] Comparative experiments using supervised learning and machine translation for multilingual sentiment analysis
    Balahur, Alexandra
    Turchi, Marco
    [J]. COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01) : 56 - 75
  • [8] Balahur Alexandra, 2012, Association for Computational Linguistics, P52
  • [9] Basile P., 2014, P 4 EV CAMP NAT LANG
  • [10] Battiti R., 2008, SPRINGER SCI BUSINES, V45