A graph proximity feature augmentation approach for identifying accounts of terrorists on twitter

被引:5
作者
Aleroud, Ahmed [1 ,2 ]
Abu-Alsheeh, Nisreen [1 ]
Al-Shawakfa, Emad [1 ]
机构
[1] Yarmouk Univ, Dept Informat Syst, Irbid, Jordan
[2] Univ Maryland, Dept Informat Syst, Baltimore Cty UMBC, Baltimore, MD 21201 USA
关键词
Feature augmentation; Latent dirichlet allocation (lda); Social network analysis; Temporal analysis; Terrorism Informatics; Graph Neighborhood; MEDIA;
D O I
10.1016/j.cose.2070.107056
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the popularity of social networks, terrorist groups such as ISIS encouraged others to follow their activities, share their ideas, recruit fans, radicalize communities, and raise funds to support future attacks. This has led to the emergence of radicalized online accounts that belong to terrorists or their fans. Existing techniques for counter-terrorism investiga-tions which aim to suspend such accounts are based on reports by users or syntactic-based sentiment analysis techniques, which are not accurate on short texts shared by terrorist such as tweets. This work proposed a feature augmentation approach to enrich the content of tweets before investigating them to discover the radicalized online contents. The augmented tweets are then used to classify accounts into Pro-ISIS or Anti-ISIS categories. We utilized topic modeling as a baseline method for feature augmentation. We studied the effects of utilizing tweets at different time intervals on the quality of the generated models that classify tweets and the corresponding accounts. We then introduced a novel feature augmentation approach that utilizes Neighborhood Overlap, a graph proximity technique that discovers terms having a strong relationship with the Pro-ISIS category. Terms extracted from tweets are represented as nodes in a graph, which is then partitioned into clusters containing different terms. Terms in strongly connected parts of each cluster are augmented to the original term vectors of the tweets based on the similarity between those terms and each tweet. We compared our approach with other baseline augmentation techniques such Term to-Term correlation, Topic Modeling, and other existing techniques. Experimental results on a dataset containing Proand Anti-ISIS tweets show that our approach is quite promising to automate the identification of terrorist contents online. The results have shown that using graph proximity measures such as Neighborhood Overlap for term augmentation gains higher Precision, Recall, and F-measure than the typical approaches. In addition, we found that applying time-based analysis with term augmentation to identify radicalized accounts enhanced the Precision of the investigation process. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:25
相关论文
共 65 条
[1]  
Abrar M.F., 2019, INT C ELECT COMPUTER, P1, DOI [DOI 10.1109/ECACE.2019.8679511, DOI 10.1109/ECACE.2019.8679430]
[2]   Multimode co-clustering for analyzing terrorist networks [J].
Aleroud, Ahmed ;
Gangopadhyay, Aryya .
INFORMATION SYSTEMS FRONTIERS, 2018, 20 (05) :1053-1074
[3]  
[Anonymous], 2010, 4 INT AAAI C WEBL SO
[4]  
[Anonymous], 2013, SECURITY INFORM
[5]  
[Anonymous], SIG P 36 INT ACM
[6]   Evidence collection and forensics on social networks: Research challenges and directions [J].
Arshad, Humaira ;
Jantan, Aman ;
Omolara, Esther .
DIGITAL INVESTIGATION, 2019, 28 :126-138
[7]  
Baldwin Timothy, 2013, P 6 INT JOINT C NATU, P356
[8]   A Novel Graph Analytic Approach to Monitor Terrorist Networks [J].
Basu, Kaustav ;
Zhou, Chenyang ;
Sen, Arunabha ;
Goliber, Victoria Horan .
2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, :1159-1166
[9]  
Bedjou K., 2019, P 3 INT C FUTURE NET, P1
[10]  
Benner K., 2016, TWITTER SUSPENDS 235