An intelligent textual corpus big data computing approach for lexicons construction and sentiment classification of public emergency events

被引:15
作者
Zhang, Wei [1 ]
Zhu, Yan-chun [2 ]
Wang, Jia-peng [1 ]
机构
[1] Cent Univ Finance & Econ, Sch Informat, Beijing 100081, Peoples R China
[2] Beijing Normal Univ, Business Sch, Beijing 100875, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Textual corpus; Big data; Lexicon construction; Sentiment computing; Public emergency events; EMOTIONS; EXTRACTION; MEDIA;
D O I
10.1007/s11042-018-7018-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Considering the deficiencies in the existing emotional lexicons like too many manual interventions, lack of scalability and ignorance of dependency parsing in emotional computing, this paper first uses Word2Vec, cosine word vector similarity calculation and SO-PMI algorithms to build a public event-oriented Weibo emotional lexicon; then, it proposes a Weibo emotion computing method based on dependency parsing and designs an emotion binary tree based on dependency parsing, and dependency-based emotion calculation rules; and at last, through an experiment, it shows that this emotional lexicon has a wider coverage and higher accuracy than the existing ones, and it also performs a public opinion evolution analysis on an actual public event and the empirical results show that the algorithm is feasible and effective.
引用
收藏
页码:30159 / 30174
页数:16
相关论文
共 40 条
  • [1] [Anonymous], 2016, INT C INN TECHN APPL
  • [2] [Anonymous], 2018, ARXIV180700775
  • [3] Badaro G., 2018, P 7 JOINT C LEX COMP, P86, DOI DOI 10.18653/V1/S18-2009
  • [4] Emotion-aware polarity lexicons for Twitter sentiment analysis
    Bandhakavi, Anil
    Wiratunga, Nirmalie
    Massie, Stewart
    Deepak, P.
    [J]. EXPERT SYSTEMS, 2021, 38 (07)
  • [5] Lexicon based feature extraction for emotion text classification
    Bandhakavi, Anil
    Wiratunga, Nirmalie
    Padmanabhan, Deepak
    Massie, Stewart
    [J]. PATTERN RECOGNITION LETTERS, 2017, 93 : 133 - 142
  • [6] Bestgen Y, 2008, SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, P496
  • [7] Che W, 2010, P COL 2010 DEM BEIJ, P13, DOI [10.5555/1944284.1944288, DOI 10.5555/1944284.1944288]
  • [8] Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network
    Ghiassi, M.
    Skinner, J.
    Zimbra, D.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (16) : 6266 - 6282
  • [9] Guan T, 2018, J CHIN POLIT SCI, P1
  • [10] The 2013 Boston marathon bombing: Publics' emotions, coping, and organizational engagement
    Guo, Sylvia Jiankun
    [J]. PUBLIC RELATIONS REVIEW, 2017, 43 (04) : 755 - 767