An Ensemble Method for Radicalization and Hate Speech Detection Online Empowered by Sentic Computing

被引:0
作者
Oscar Araque
Carlos A. Iglesias
机构
[1] Universidad Politécnica de Madrid,Intelligent Systems Group
来源
Cognitive Computation | 2022年 / 14卷
关键词
Sentic computing; Affective computing; Radicalization detection; Hate speech detection; Machine learning; Natural language processing;
D O I
暂无
中图分类号
学科分类号
摘要
The dramatic growth of the Web has motivated researchers to extract knowledge from enormous repositories and to exploit the knowledge in myriad applications. In this study, we focus on natural language processing (NLP) and, more concretely, the emerging field of affective computing to explore the automation of understanding human emotions from texts. This paper continues previous efforts to utilize and adapt affective techniques into different areas to gain new insights. This paper proposes two novel feature extraction methods that use the previous sentic computing resources AffectiveSpace and SenticNet. These methods are efficient approaches for extracting affect-aware representations from text. In addition, this paper presents a machine learning framework using an ensemble of different features to improve the overall classification performance. Following the description of this approach, we also study the effects of known feature extraction methods such as TF-IDF and SIMilarity-based sentiment projectiON (SIMON). We perform a thorough evaluation of the proposed features across five different datasets that cover radicalization and hate speech detection tasks. To compare the different approaches fairly, we conducted a statistical test that ranks the studied methods. The obtained results indicate that combining affect-aware features with the studied textual representations effectively improves performance. We also propose a criterion considering both classification performance and computational complexity to select among the different methods.
引用
收藏
页码:48 / 61
页数:13
相关论文
共 104 条
  • [1] Hendler J(2008)Web science: an interdisciplinary approach to understanding the web Commun ACM 51 60-69
  • [2] Shadbolt N(2014)Jumping NLP curves: A review of natural language processing research IEEE Comput Intell Mag 9 48-57
  • [3] Hall W(2016)Multilingual sentiment analysis: state of the art and independent comparison of techniques Cogn Comput 8 757-771
  • [4] Berners-Lee T(2012)Using natural language processing technology for qualitative data analysis Int J Soc Res Methodol 15 523-543
  • [5] Weitzner D(2015)Sentic computing: A common-sense-based framework for concept-level sentiment analysis Cogn Comput 7 183-185
  • [6] Cambria E(2019)A semantic similarity-based perspective of affect lexicons for sentiment analysis Knowl-Based Syst 165 346-359
  • [7] White B(2017)Sentiment analysis is a big suitcase IEEE Intell Syst 32 74-80
  • [8] Dashtipour K(2016)Affective computing and sentiment analysis IEEE Intell Syst 31 102-107
  • [9] Poria S(2018)Ontosenticnet: A commonsense ontology for sentiment analysis IEEE Intell Syst 33 77-85
  • [10] Hussain A(2017)Aspect-based extraction and analysis of affective knowledge from social media streams IEEE Intell Syst 32 80-88