Applying Machine Learning Techniques for Religious Extremism Detection on Online User Contents

被引:4
作者
Mussiraliyeva, Shynar [1 ]
Omarov, Batyrkhan [1 ]
Yoo, Paul [1 ,2 ]
Bolatbek, Milana [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan
[2] Univ London, Birkbeck Coll, CSIS, London, England
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 01期
关键词
Extremism; religious extremism; machine learning; social media; social network; natural language processing; NLP; ISLAMIST;
D O I
10.32604/cmc.2022.019189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this research paper, we propose a corpus for the task of detecting religious extremism in social networks and open sources and compare various machine learning algorithms for the binary classification problem using a previously created corpus, thereby checking whether it is possible to detect extremist messages in the Kazakh language. To do this, the authors trained models using six classic machine-learning algorithms such as Support Vector Machine, Decision Tree, Random Forest, K Nearest Neighbors, Naive Bayes, and Logistic Regression. To increase the accuracy of detecting extremist texts, we used various characteristics such as Statistical Features, TF-IDF, POS, LIWC, and applied oversampling and undersampling techniques to handle imbalanced data. As a result, we achieved 98% accuracy in detecting religious extremism in Kazakh texts for the collected dataset. Testing the developed machine learning models in various databases that are often found in everyday life "Jokes", "News", "Toxic content", "Spam", "Advertising" has also shown high rates of extremism detection.
引用
收藏
页码:915 / 934
页数:20
相关论文
共 28 条
  • [1] The symbiotic relationship between Islamophobia and radicalisation
    Abbas, Tahir
    [J]. CRITICAL STUDIES ON TERRORISM, 2012, 5 (03) : 345 - 358
  • [2] Borders and sovereignty in Islamist and jihadist thought: past and present
    Adraoui, Mohamed-Ali
    [J]. INTERNATIONAL AFFAIRS, 2017, 93 (04) : 917 - +
  • [3] Detection and classification of social media-based extremist affiliations using sentiment analysis techniques
    Ahmad, Shakeel
    Asghar, Muhammad Zubair
    Alotaibi, Fahad M.
    Awan, Irfanullah
    [J]. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2019, 9
  • [4] Spotting the Islamist Radical within: Religious Extremists Profiling in the United State
    Al-Zewairi, Malek
    Naymat, Ghazi
    [J]. 8TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2017) / 7TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2017) / AFFILIATED WORKSHOPS, 2017, 113 : 162 - 169
  • [5] [Anonymous], 2021, 5 KEY QUESTIONS ANSW
  • [6] An Approach for Radicalization Detection Based on Emotion Signals and Semantic Similarity
    Araque, Oscar
    Iglesias, Carlos A.
    [J]. IEEE ACCESS, 2020, 8 : 17877 - 17891
  • [7] Detecting Jihadist Messages on Twitter
    Ashcroft, Michael
    Fisher, Ali
    Kaati, Lisa
    Omer, Enghin
    Prucha, Nico
    [J]. 2015 EUROPEAN INTELLIGENCE AND SECURITY INFORMATICS CONFERENCE (EISIC), 2015, : 161 - 164
  • [8] Sentiment analysis of extremism in social media from textual information
    Asif, Muhammad
    Ishtiaq, Atiab
    Ahmad, Haseeb
    Aljuaid, Hanan
    Shah, Jalal
    [J]. TELEMATICS AND INFORMATICS, 2020, 48
  • [9] Azizan SA., 2017, J Eng Appl Sci, V12, P691
  • [10] Devyatkin D, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), P188, DOI 10.1109/ISI.2017.8004907