Text Mining: Sentiment Analysis on news classification

被引:0
作者
Gomes, Helder [1 ]
Neto, Miguel de Castro [1 ]
Henriques, Roberto [1 ]
机构
[1] Univ Nova Lisboa, Inst Super Estat & Gestao Informacao, P-1200 Lisbon, Portugal
来源
PROCEEDINGS OF THE 2013 8TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2013) | 2013年
关键词
Sentiment Analysis; Text Mining; Natural Language Processing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the last few years, due to the emergence of social networks, the interaction between customers and companies has experienced major changes. This change, like others, has advantages but also disadvantages. One of the major disadvantages which arose from this modification is the fact that, currently, organizations have lost control over what customers say about them, since they can easily publish their negative opinions and spread them rapidly. However, some organizations have quickly realized this situation could promote important competitive advantages, through the analysis of what customers say about them in different communication channels. Besides that, the increasing use of internet allowed that a lot of information is available online and an example of it is that, nowadays, the majority of newspapers make their publications daily available, on their websites, on the internet. Therefore, the data volume daily available on the internet grows exponentially and all of the information produced through this data might be important, if treated and used correctly. That is how the challenge of creating knowledge through this information in an automated way, emerges. Thus, the goal of this project is to build a model able to evaluate the polarity (positive, negative or neutral) of economic news headlines, available on RSS Feeds addresses. In order to do that, software SAS was used and, consequently its methodology, whose detailed description is also a goal. In this way, section I introduces the subject for a better contextualization. Section II presents the goals for the project which originated this paper, followed by the state of art in the section III. The section IV portrays the methodology to Knowledge Discovery in Text as well as the methodology used in the creation of Sentiment Analysis model. The section V refers the results achieved with the implementation of this project and, for last, the conclusions are presented in the section VI.
引用
收藏
页数:6
相关论文
共 20 条
  • [1] [Anonymous], 2008, Introduction to information retrieval
  • [2] Aranha C.N., 2007, Uma Abordagem de Pre-Processamento Automatico para Mineracao de Textos em Portugues: Sob o Enfoque da Inteligencia Computacional
  • [3] Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
  • [4] Natural language processing
    Chowdhury, GG
    [J]. ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2003, 37 : 51 - 89
  • [5] Dorre J., 1999, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, P398
  • [6] Feldman R, 1998, P 2 INT C PRACT ASP
  • [7] Gang Li, 2010, Proceedings 2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2010), P331, DOI 10.1109/ISKE.2010.5680859
  • [8] Hearst M.A., 1999, P ASS COMPUTATIONAL, P3, DOI DOI 10.3115/1034678.1034679
  • [9] Hotho A., 2005, GLDV Journal for Computational Linguistics and Language Technology
  • [10] Indurkhya N, 2010, CH CRC MACH LEARN PA, pXXI