Applying Machine Learning Techniques for Religious Extremism Detection on Online User Contents

被引：4

作者：

Mussiraliyeva, Shynar ^{[1
]}

Omarov, Batyrkhan ^{[1
]}

Yoo, Paul ^{[1
,2
]}

Bolatbek, Milana ^{[1
]}

机构：

[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan

[2] Univ London, Birkbeck Coll, CSIS, London, England

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 01期

关键词：

Extremism; religious extremism; machine learning; social media; social network; natural language processing; NLP; ISLAMIST;

D O I：

10.32604/cmc.2022.019189

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this research paper, we propose a corpus for the task of detecting religious extremism in social networks and open sources and compare various machine learning algorithms for the binary classification problem using a previously created corpus, thereby checking whether it is possible to detect extremist messages in the Kazakh language. To do this, the authors trained models using six classic machine-learning algorithms such as Support Vector Machine, Decision Tree, Random Forest, K Nearest Neighbors, Naive Bayes, and Logistic Regression. To increase the accuracy of detecting extremist texts, we used various characteristics such as Statistical Features, TF-IDF, POS, LIWC, and applied oversampling and undersampling techniques to handle imbalanced data. As a result, we achieved 98% accuracy in detecting religious extremism in Kazakh texts for the collected dataset. Testing the developed machine learning models in various databases that are often found in everyday life "Jokes", "News", "Toxic content", "Spam", "Advertising" has also shown high rates of extremism detection.

引用

页码：915 / 934

页数：20

共 28 条

[1] The symbiotic relationship between Islamophobia and radicalisation
Abbas, Tahir
[J]. CRITICAL STUDIES ON TERRORISM, 2012, 5 (03) : 345 - 358
[2] Borders and sovereignty in Islamist and jihadist thought: past and present
Adraoui, Mohamed-Ali
[J]. INTERNATIONAL AFFAIRS, 2017, 93 (04) : 917 - +
[3] Detection and classification of social media-based extremist affiliations using sentiment analysis techniques
Ahmad, Shakeel
Asghar, Muhammad Zubair
Alotaibi, Fahad M.
Awan, Irfanullah
[J]. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2019, 9
[4] Spotting the Islamist Radical within: Religious Extremists Profiling in the United State
Al-Zewairi, Malek
Naymat, Ghazi
[J]. 8TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2017) / 7TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2017) / AFFILIATED WORKSHOPS, 2017, 113 : 162 - 169
[5] [Anonymous], 2021, 5 KEY QUESTIONS ANSW
[6] An Approach for Radicalization Detection Based on Emotion Signals and Semantic Similarity
Araque, Oscar
Iglesias, Carlos A.
[J]. IEEE ACCESS, 2020, 8 : 17877 - 17891
[7] Detecting Jihadist Messages on Twitter
Ashcroft, Michael
Fisher, Ali
Kaati, Lisa
Omer, Enghin
Prucha, Nico
[J]. 2015 EUROPEAN INTELLIGENCE AND SECURITY INFORMATICS CONFERENCE (EISIC), 2015, : 161 - 164
[8] Sentiment analysis of extremism in social media from textual information
Asif, Muhammad
Ishtiaq, Atiab
Ahmad, Haseeb
Aljuaid, Hanan
Shah, Jalal
[J]. TELEMATICS AND INFORMATICS, 2020, 48
[9] Azizan SA., 2017, J Eng Appl Sci, V12, P691
[10] Devyatkin D, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), P188, DOI 10.1109/ISI.2017.8004907

← 1 2 3 →