CrimeProfiler: Crime Information Extraction and Visualization from News Media

被引:11
作者
Dasgupta, Tirthankar [1 ]
Naskar, Abir [1 ]
Saha, Rupsa [1 ]
Dey, Lipika [2 ]
机构
[1] Tata Consultancy Serv Ltd, Innovat Lab, Kharagpur, W Bengal, India
[2] Tata Consultancy Serv Ltd, TCS Innovat Lab, Kharagpur, W Bengal, India
来源
2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017) | 2017年
关键词
Crime Pro. ling; Entity Extraction; Entity Resolution; Crime Ontology; Text Classification;
D O I
10.1145/3106426.3106476
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
News articles from different sources regularly report crime incidents that contain details of crime, information about accused entities, details of the investigation process and finally details of judgement. In this paper, we have proposed natural language processing techniques for extraction and curation of crime-related information from digitally published News articles. We have leveraged computational linguistics based methods to analyse crime related News documents to extract different crime related entities and events. This includes name of the criminal, name of the victim, nature of crime, geographic location, date and time, and action taken against the criminal. We have also proposed a semi-supervised learning technique to learn different categories of crime events from the News documents. This helps in continuous evolution of the crime dictionaries. Thus the proposed methods are not restricted to detecting known crimes only but contribute actively towards maintaining an updated crime dictionary. We have done experiments with a collection of 3000 crime-reporting News articles. The end-product of our experiments is a crime-register that contains details of crime committed across geographies and time. This register can be further utilized for analytical and reporting purposes.
引用
收藏
页码:541 / 549
页数:9
相关论文
共 28 条
[1]  
[Anonymous], 2005, P 43 ANN M ASS COMP
[2]  
[Anonymous], 2003, P 2003 ANN NATL C DI
[3]  
Arulanandam R., 2014, P 2 AUSTR WEB C, V155, P31
[4]  
Asaro C., 2003, P ICAIL 2003 WORKSH
[5]  
Bechhofer S., 2009, ENCY DATABASE SYSTEM, P2008, DOI DOI 10.1007/978-0-387-39940-9_1073
[6]  
Buczak A. L., 2010, ACM SIGKDD WORKSH IN, P1
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]  
Chau M., 2002, Proceedings of the 2002 annual national conference on Digital government research, P1
[9]   Crime data mining: A general framework and some examples [J].
Chen, HC ;
Chung, WY ;
Xu, JJ ;
Wang, G ;
Qin, Y ;
Chau, M .
COMPUTER, 2004, 37 (04) :50-+
[10]  
de Bruin JS, 2006, IEEE DATA MINING, P171