Detecting and Classifying Crimes from Arabic Twitter Posts using Text Mining Techniques

被引:0
作者
Al-Saif, Hissah [1 ]
Al-Dossari, Hmood [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
关键词
Crimes; text mining; classification; features extraction techniques; arabic posts; twitter;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Crime analysis has become a critical area for helping law enforcement agencies to protect civilians. As a result of a rapidly increasing population, crime rates have increased dramatically, and appropriate analysis has become a time-consuming effort. Text mining is an effective tool that may help to solve this problem to classify crimes in effective manner. The proposed system aims to detect and classify crimes in Twitter posts that written in the Arabic language, one of the most widespread languages today. In this paper, classification techniques are used to detect crimes and identify their nature by different classification algorithms. The experiments evaluate different algorithms, such as SVM, DT, CNB, and KNN, in terms of accuracy and speed in the crime domain. Also, different features extraction techniques are evaluated, including root-based stemming, light stemming, n-gram. The experiments revealed the superiority of n-gram over other techniques. Specifically, the results indicate the superiority of SVM with trigram over other classifiers, with a 91.55% accuracy.
引用
收藏
页码:377 / 387
页数:11
相关论文
共 44 条
[1]  
Aggarwal C.C., 2012, MINING TEXT DATA, P163, DOI [10.1007/978-1-4614-3223-4_6, DOI 10.1007/978-1-4614-3223-4_6]
[2]  
Al Ameed H., 2005, 2 INT C INN INF TECH, P1
[3]  
Al-Harbi S., 2008, PROC 9 INT C STAT AN, V8, P77
[4]  
Al-refai M., 2007, STEMMING VERSUS LIGH, P446
[5]  
Allah F. A., 2006, 2006 2 INT C INF COM, V1, P720
[6]  
[Anonymous], 1965, CROWN BRIDE JEWELS D
[7]  
[Anonymous], 2015, LANGUAGES WORLD
[8]  
[Anonymous], 2003, P 20 INT C MACH LEAR
[9]   The Effect of Preprocessing on Arabic Document Categorization [J].
Ayedh, Abdullah ;
Tan, Guanzheng ;
Alwesabi, Khaled ;
Rajeh, Hamdi .
ALGORITHMS, 2016, 9 (02)
[10]  
Brahimi B., 2016, J DIGIT INF MANAG, V14, P15