Sentiment Analysis from Stock Market News in Romanian using Chaos Game Representation

被引:1
作者
Stoean, Catalin [1 ,2 ]
Lichtblau, Daniel [3 ]
机构
[1] Univ Craiova, Fac Sci, Dept Comp Sci, Craiova, Romania
[2] Univ Bucharest, Human Language Technol Res Ctr, Bucharest, Romania
[3] Wolfram Res, 100 Trade Ctr Dr, Champaign, IL 61820 USA
来源
2021 23RD INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2021) | 2021年
关键词
sentiment analysis; text processing; chaos game representation; classification;
D O I
10.1109/SYNASC54541.2021.00050
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A recently proposed methodology for authorship attribution is adapted in the current work for sentiment analysis. Furthermore, it is applied here for a non-English language, i.e. for Romanian. The procedure works at the character level, hence it does not depend on the language, although it is designed only for the languages that use the Latin alphabet. The data set used is taken from financial market news and it contains paragraphs that refer to two particular companies. In order to establish the ground truth for the sentiment scores, the text is translated into English and the sentiment analysis tool VADER is further used. The aim of the methodology is to build a regression model that fits the initial paragraphs with text in Romanian to the scores established by VADER and the results are encouraging.
引用
收藏
页码:252 / 258
页数:7
相关论文
共 17 条
[1]   Deep learning and multilingual sentiment analysis on social media data: An overview [J].
Aguero-Torales, Marvin M. ;
Salas, Jose I. Abreu ;
Lopez-Herrera, Antonio G. .
APPLIED SOFT COMPUTING, 2021, 107 (107)
[2]  
[Anonymous], 2014, Linear Algebra and Matrix Analysis for Statistics
[3]  
Barriere V., COLING, P266
[4]  
Bird S., 2009, Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit, P1
[5]   Multilingual Sentiment Analysis: State of the Art and Independent Comparison of Techniques [J].
Dashtipour, Kia ;
Poria, Soujanya ;
Hussain, Amir ;
Cambria, Erik ;
Hawalah, Ahmad Y. A. ;
Gelbukh, Alexander ;
Zhou, Qiang .
COGNITIVE COMPUTATION, 2016, 8 (04) :757-771
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]  
Hutto C., 2014, Proceedings of the International AAAI Conference on Web Social Media, V8, P216, DOI DOI 10.1609/ICWSM.V8I1.14550
[8]   CHAOS GAME VISUALIZATION OF SEQUENCES [J].
JEFFREY, HJ .
COMPUTERS & GRAPHICS, 1992, 16 (01) :25-33
[9]   CHAOS GAME REPRESENTATION OF GENE STRUCTURE [J].
JEFFREY, HJ .
NUCLEIC ACIDS RESEARCH, 1990, 18 (08) :2163-2170
[10]   An automatic non-English sentiment lexicon builder using unannotated corpus [J].
Kaity, Mohammed ;
Balakrishnan, Vimala .
JOURNAL OF SUPERCOMPUTING, 2019, 75 (04) :2243-2268