Correlations and Fractality in Sentence-Level Sentiment Analysis Based on VADER for Literary Texts

被引:1
作者
Hernandez-Perez, Ricardo [1 ]
Lara-Martinez, Pablo [2 ]
Obregon-Quintana, Bibiana [2 ]
Liebovitch, Larry S. [3 ,4 ]
Guzman-Vargas, Lev [5 ]
机构
[1] Inst Politecn Nacl, Escuela Super Fis & Matemat, Edif 9,2o Piso, Mexico City 07738, Mexico
[2] Univ Nacl Autonoma Mexico, Fac Ciencias, Mexico City 04510, Mexico
[3] CUNY Queens Coll, Dept Phys, New York, NY 11367 USA
[4] Columbia Univ, Climate Sch, Adv Consortium Cooperat Conflict & Complex AC4, New York, NY 10027 USA
[5] Inst Politecn Nacl, Unidad Interdisciplinaria Ingn & Tecnol Avanzadas, Av IPN 2580, Mexico City 07340, Mexico
关键词
sentiment analysis; opinion mining; social systems; TIME-SERIES; EMOTIONS; TRANSLATION; LANGUAGE; MEMORY; SCALE;
D O I
10.3390/info15110698
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We perform a sentence-level sentiment analysis study of different literary texts in English language. Each text is converted into a series in which the data points are the sentiment value of each sentence obtained using the sentiment analysis tool (VADER). By applying the Detrended Fluctuation Analysis (DFA) and the Higuchi Fractal Dimension (HFD) methods to these sentiment series, we find that they are monofractal with long-term correlations, which can be explained by the fact that the writing process has memory by construction, with a sentiment evolution that is self-similar. Furthermore, we discretize these series by applying a classification approach which transforms the series into a one on which each data point has only three possible values, corresponding to positive, neutral or negative sentiments. We map these three-states series to a Markov chain and investigate the transitions of sentiment from one sentence to the next, obtaining a state transition matrix for each book that provides information on the probability of transitioning between sentiments from one sentence to the next. This approach shows that there are biases towards increasing the probability of switching to neutral or positive sentences. The two approaches supplement each other, since the long-term correlation approach allows a global assessment of the sentiment of the book, while the state transition matrix approach provides local information about the sentiment evolution along the text.
引用
收藏
页数:14
相关论文
共 62 条
  • [1] How Intense Are You? Predicting Intensities of Emotions and Sentiments using Stacked Ensemble
    Akhtar, Md Shad
    Ekbal, Asif
    Cambria, Erik
    [J]. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2020, 15 (01) : 64 - 75
  • [2] Measuring complexity with multifractals in texts. Translation effects
    Ausloos, M.
    [J]. CHAOS SOLITONS & FRACTALS, 2012, 45 (11) : 1349 - 1357
  • [3] Generalized Hurst exponent and multifractal function of original and translated texts mapped into frequency and length time series
    Ausloos, M.
    [J]. PHYSICAL REVIEW E, 2012, 86 (03):
  • [4] Comparison of detrending methods for fluctuation analysis
    Bashan, Amir
    Bartsch, Ronny
    Kantelhardt, Jan W.
    Havlin, Shlomo
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2008, 387 (21) : 5080 - 5090
  • [5] Bizzoni Y., 2022, P 2 INT WORKSH NAT L, P31
  • [6] Bradley M. M., 1999, TECHNICAL REPORT C1
  • [7] EMOTIONS IN TEXT: DIMENSIONAL AND CATEGORICAL MODELS
    Calvo, Rafael A.
    Mac Kim, Sunghwan
    [J]. COMPUTATIONAL INTELLIGENCE, 2013, 29 (03) : 527 - 543
  • [8] Cambria E., 2018, P AAAI C ART INT NEW, VVolume 32
  • [9] Generalization of Higuchi's fractal dimension for multifractal analysis of time series with limited length
    Carrizales-Velazquez, Carlos
    Donner, Reik V.
    Guzman-Vargas, Lev
    [J]. NONLINEAR DYNAMICS, 2022, 108 (01) : 417 - 431
  • [10] Collective Emotions Online and Their Influence on Community Life
    Chmiel, Anna
    Sienkiewicz, Julian
    Thelwall, Mike
    Paltoglou, Georgios
    Buckley, Kevan
    Kappas, Arvid
    Holyst, Janusz A.
    [J]. PLOS ONE, 2011, 6 (07):