A Two-stage Approach of Named-Entity Recognition for Crime Analysis

被引:0
|
作者
Das, Priyanka [1 ]
Das, Asit Kumar [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Sibpur, Howrah, India
关键词
Crime reports; text mining; named entity recognition; modus operandi; precision; recall;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Crime against women in India is on increase over the past few years and enormous crime reports are being generated everyday. But it is difficult to manually access the crime reports to derive useful information that can provide insights to the law enforcement officers for analysing the crime trends. The present work emphasizes on a simple yet efficient two stage approach for analysing crime against women in India. Initially, the proposed framework extracts crime reports from online newspaper articles. Once the data is collected, the first stage approach provides an interesting aspect by identifying named entities like name of states, cities, person etc. from the dataset and a collection of top ten entities of various categories is ranked according to their frequency of occurrence. The preliminary assessment shows feasible results which are also compared with crime records drawn from National Crime Records Bureau. However, the identified subtypes of entities are mostly ignored whereas dealing only with the basic entities fails to provide in-depth recognition of crime trends. So considering the subtypes can really provide the prerequisites for finer distinction in the field of crime data mining. The second stage approach in the present work considers the sub-types of named entities as 'Modus Operandi' features (mode of operation) of the crime that caters exquisite perception of the crime performed against women in India. Though lot of research exists on crime analysis, considering modus operandi features is very less. The present work demonstrates the effectiveness of the method with high recall and precision for the identified named entities.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Unsupervised named-entity recognition: Generating gazetteers and resolving ambiguity
    Nadeau, David
    Turney, Peter D.
    Matwin, Stan
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 266 - 277
  • [42] An Effective Two-Stage Model for Exploiting Non-Local Dependencies in Named Entity Recognition
    Krishnan, Vijay
    Manning, Christopher D.
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 1121 - 1128
  • [43] POSBIOTM-NER: a trainable biomedical named-entity recognition system
    Song, Y
    Kim, E
    Lee, GG
    Yi, BK
    BIOINFORMATICS, 2005, 21 (11) : 2794 - 2796
  • [44] Deep Learning with Word Embedding Improves Kazakh Named-Entity Recognition
    Haisa, Gulizada
    Altenbek, Gulila
    INFORMATION, 2022, 13 (04)
  • [45] An Active Co-Training Algorithm for Biomedical Named-Entity Recognition
    Munkhdalai, Tsendsuren
    Li, Meijing
    Yun, Unil
    Namsrai, Oyun-Erdene
    Ryu, Keun Ho
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2012, 8 (04): : 575 - 588
  • [46] Arabic named entity recognition in crime documents
    Asharef, M.
    Omar, N.
    Albared, M.
    Journal of Theoretical and Applied Information Technology, 2012, 44 (01) : 1 - 6
  • [47] KIND: an Italian Multi-Domain Dataset for Named-Entity Recognition
    Paccosi, Teresa
    Aprosio, Alessio Palmero
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 501 - 507
  • [48] A Perspective on Text Classification, Clustering, and Named-entity Recognition in Social Media
    Jahanbin, Kia
    Rahmanian, Fereshte
    Rahmanian, Vahid
    Shakeri, Masihollah
    Shakeri, Heshmatollah
    Rahmaniani, Zhila
    Jahromi, Abdolreza Sotoodeh
    AMBIENT SCIENCE, 2019, 6 (01) : 1 - 4
  • [49] GoalBERT: A Lightweight Named-Entity Recognition Model Based on Multiple Fusion
    Xu, Yingjie
    Tan, Xiaobo
    Wang, Mengxuan
    Zhang, Wenbo
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [50] An Enhanced Malay Named Entity Recognition using Combination Approach for Crime Textual Data Analysis
    Asmai, Siti Azirah
    Salleh, Muhammad Sharilazlan
    Basiron, Halizah
    Ahmad, Sabrina
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (09) : 474 - 483