Studying the Effects of Text Preprocessing and Ensemble Methods on Sentiment Analysis of Brazilian Portuguese Tweets

被引:3
作者
Gomes, Fernando Barbosa [1 ]
Adan-Coello, Juan Manuel [1 ]
Kintschner, Fernando Ernesto [1 ]
机构
[1] Pontificia Univ Catolica Campinas, Campinas, SP, Brazil
来源
STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018 | 2018年 / 11171卷
关键词
D O I
10.1007/978-3-030-00810-9_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The analysis of social media posts can provide useful feedback regarding user experience for people and organizations. This task requires the use of computational tools due to the massive amount of content and the speed at which it is generated. In this article we study the effects of text preprocessing heuristics and ensembles of machine learning algorithms on the accuracy and polarity bias of classifiers when performing sentiment analysis on short text messages. The results of an experimental evaluation performed on a Brazilian Portuguese tweets dataset have shown that these strategies have significant impact on increasing classification accuracy, particularly when the ensembles include a deep neural net, but not always on reducing polarity bias.
引用
收藏
页码:167 / 177
页数:11
相关论文
共 17 条
  • [1] [Anonymous], 2015, P 9 INT WORKSH SEM E
  • [2] [Anonymous], 2006, PATTERN RECOGN
  • [3] [Anonymous], 2018, P 11 INT C LANG RES
  • [4] [Anonymous], 2012, Mining text data
  • [5] [Anonymous], P 6 WORKSH REC ADV R
  • [6] [Anonymous], J HLTH INFORM
  • [7] [Anonymous], 2002, ADV NEURAL INFORM PR
  • [8] [Anonymous], ENCONTRO LINGUISTICA
  • [9] Balage Filho Pedro, 2013, P 9 BRAZ S INF HUM L
  • [10] Performance Evaluation of Sentiment Analysis Methods for Brazilian Portuguese
    Cirqueira, Douglas
    Jacob, Antonio, Jr.
    Lobato, Fabio
    de Santana, Adamo Lima
    Pinheiro, Marcia
    [J]. BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2016, 2017, 263 : 245 - 251