Detecting hate crimes through machine learning and natural language processing

被引:0
作者
Salazar, Ana Ortiz [1 ]
机构
[1] Performance Analyt & Res, Seattle Police Dept, Seattle, WA USA
关键词
Hate crimes; bias; NLP; machine learning; Seattle; BIAS;
D O I
10.1080/15614263.2024.2397363
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Misidentification and misreporting of hate crimes by victims and law enforcement are significant barriers to accurate data collection of hate crimes, and their consequent study and prevention. The use of machine learning in crime detection can improve the accuracy and speed at which reported incidents with bias elements are identified. This study develops a machine learning classifier that categorizes police reports as either events with bias elements or events with no bias elements. We use incident/offense reports from the Seattle Police Department to train a Natural Language Processing classification algorithm. We collect narratives, location data, and victim and suspect demographics to use as features. We evaluate the performance of logistic regression, random forest, and XGBoost algorithms, as well as several text embedding techniques. Despite substantial class imbalance, our model achieves a macro F1-score of 0.79, demonstrating the benefits of applied machine learning in accurately detecting and reporting hate crimes.
引用
收藏
页数:23
相关论文
共 50 条
[1]  
Alikhademi K, 2022, ARTIF INTELL LAW, V30, P1, DOI 10.1007/s10506-021-09286-4
[2]  
[Anonymous], 2014, Proceedings of the Second Australasian Web Conference-Volume 155, AWC'14
[3]  
[Anonymous], 2020, Hate crime statistics, 2019 [Report]
[4]  
[Anonymous], 2009, NATL CRIME VICTIMIZA
[5]  
BARNES A, 1994, SOC WORK, V39, P247
[6]   Unsupervised identification of crime problems from police free-text data [J].
Birks, Daniel ;
Coleman, Alex ;
Jackson, David .
CRIME SCIENCE, 2020, 9 (01)
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
Brochu E, 2010, Arxiv, DOI [arXiv:1012.2599, DOI 10.48550/ARXIV.1012.2599]
[9]  
Bureau of Justice Statistics, 2023, Funding Awards. Retrieved from FY 2023 law enforcement transition to the National incident-based reporting system (NIBRS) to improve hate crime reporting
[10]   Where are we? Using Scopus to map the literature at the intersection between artificial intelligence and research on crime [J].
Campedelli, Gian Maria .
JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2021, 4 (02) :503-530