A Two-stage Approach of Named-Entity Recognition for Crime Analysis

被引:0
|
作者
Das, Priyanka [1 ]
Das, Asit Kumar [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Sibpur, Howrah, India
关键词
Crime reports; text mining; named entity recognition; modus operandi; precision; recall;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Crime against women in India is on increase over the past few years and enormous crime reports are being generated everyday. But it is difficult to manually access the crime reports to derive useful information that can provide insights to the law enforcement officers for analysing the crime trends. The present work emphasizes on a simple yet efficient two stage approach for analysing crime against women in India. Initially, the proposed framework extracts crime reports from online newspaper articles. Once the data is collected, the first stage approach provides an interesting aspect by identifying named entities like name of states, cities, person etc. from the dataset and a collection of top ten entities of various categories is ranked according to their frequency of occurrence. The preliminary assessment shows feasible results which are also compared with crime records drawn from National Crime Records Bureau. However, the identified subtypes of entities are mostly ignored whereas dealing only with the basic entities fails to provide in-depth recognition of crime trends. So considering the subtypes can really provide the prerequisites for finer distinction in the field of crime data mining. The second stage approach in the present work considers the sub-types of named entities as 'Modus Operandi' features (mode of operation) of the crime that caters exquisite perception of the crime performed against women in India. Though lot of research exists on crime analysis, considering modus operandi features is very less. The present work demonstrates the effectiveness of the method with high recall and precision for the identified named entities.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Label Prompt Guiding for Two-Stage Few-Shot Named Entity Recognition
    Qi, Rongzhi
    Lou, Jiazheng
    Mao, Yingchi
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 334 - 346
  • [32] Named Entity Recognition Approach for Malay Crime News Retrieval
    Saad, Saidah
    Mansor, Mohamed Kamil
    GEMA ONLINE JOURNAL OF LANGUAGE STUDIES, 2018, 18 (04): : 216 - 235
  • [33] Named Entity Recognition in Crime Using Machine Learning Approach
    Shabat, Hafedh
    Omar, Nazlia
    Rahem, Khmael
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 : 280 - 288
  • [34] Named entity recognition in crime using machine learning approach
    Shabat, Hafedh (h2005_ali@yahoo.com), 1600, Springer Verlag (8870):
  • [35] Two Stage Genetic Approach for Bio-chemical Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 709 - 713
  • [36] Learning Subword Embedding to Improve Uyghur Named-Entity Recognition
    Saimaiti, Alimu
    Wang, Lulu
    Yibulayin, Tuergen
    INFORMATION, 2019, 10 (04)
  • [37] Named-Entity Recognition for Disaster Related Filipino News Articles
    Dela Cruz, Bern Maris
    Montalla, Cyril
    Manansala, Allysa
    Rodriguez, Ramon
    Octaviano, Manolito, Jr.
    Fabito, Bernie S.
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 1633 - 1636
  • [38] Re-ranking for Joint Named-Entity Recognition and Linking
    Sil, Avirup
    Yates, Alexander
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2369 - 2374
  • [39] A Survey of Named-Entity Recognition Methods for Food Information Extraction
    Popovski, Gorjan
    Seljak, Barbara Korousic
    Eftimov, Tome
    IEEE ACCESS, 2020, 8 : 31586 - 31594
  • [40] An Approach to Web-Scale Named-Entity Disambiguation
    Sarmento, Luis
    Kehlenbeck, Alexander
    Oliveira, Eugenio
    Ungar, Lyle
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, 2009, 5632 : 689 - +