Named Entity Recognition in User-Generated Text: A Systematic Literature Review

被引:0
|
作者
Esmaail, Naji [1 ,2 ]
Omar, Nazlia [1 ]
Mohd, Masnizah [1 ]
Fauzi, Fariza [1 ]
Mansur, Zainab [1 ,2 ]
机构
[1] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Ctr Artificial Intelligence Technol, Bangi 43600, Malaysia
[2] Omar Al Mukhtar Univ, Fac Sci, Dept Comp Sci, Al Bayda, Libya
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Social networking (online); Blogs; Reviews; Systematics; Surveys; Named entity recognition; Databases; Information retrieval; Natural language processing; NER; user-generated text; WNUT; X; systematic literature review; SLR; information extraction; natural language processing; social media; INFORMATION EXTRACTION; LINKING; TRENDS;
D O I
10.1109/ACCESS.2024.3427714
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) in social media has received much research attention in the field of natural language processing (NLP) and information extraction. Research on this topic has grown dramatically in recent years. Hence, one of the objectives of this systematic literature review (SLR) is to present the outline techniques, approaches, and methods used to handle NER on X based on English datasets prepared for WNUT (Workshop on User-generated Text). This study could be used to develop more accurate models in the future. This SLR focuses on articles that had been published over the course of eight years, i.e., from July 2015 to the end of 2023. A total of 67 out of 316 articles published during the period were selected having met the set chosen criteria. Based on the analysis of the selected articles, challenges were identified and discussed. In this SLR, we aim to provide a better understanding of current viewpoints and highlight opportunities for research in NER in User-generated Text specifically for English usage on X. It can aid in identifying named entities, such as names, locations, companies, and groups, within a specific informal social media context like X. This research is notable for being the first systematic review that emphasizes the dearth of NER on X based on English datasets prepared for WNUT. The main contribution of this systematic review is a comprehensive study on NER in X messages for social media, entailing its challenges and opportunities. Moreover, new possible research directions are suggested for the researchers.
引用
收藏
页码:136330 / 136353
页数:24
相关论文
共 50 条
  • [1] Improving named entity recognition accuracy for gene and protein in biomedical text literature
    Tohidi, Hossein
    Ibrahim, Hamidah
    Murad, Masrah Azrifah Azmi
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2014, 10 (03) : 239 - 268
  • [2] Improving named entity recognition in noisy user-generated text with local distance neighbor feature
    Wesam Al-Nabki, Mhd
    Fidalgo, Eduardo
    Alegre, Enrique
    Fernandez-Robles, Laura
    NEUROCOMPUTING, 2020, 382 : 1 - 11
  • [3] Expanding UlyssesNER-Br Named Entity Recognition Corpus with Informal User-Generated Text
    Costa, Rosimeire
    Albuquerque, Hidelberg Oliveira
    Silvestre, Gabriel
    Silva, Nadia Felix F.
    Souza, Ellen
    Vitorio, Douglas
    Nunes, Augusto
    Siqueira, Felipe
    Tarrega, Joao Pedro
    Beinotti, Joao Vitor
    Dias, Marcio de Souza
    Pereira, Fabiola S. F.
    Silva, Matheus
    Gardini, Miguel
    Silva, Vinicius
    de Carvalho, Andre C. P. L. F.
    Oliveira, Adriano L., I
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2022, 2022, 13566 : 767 - 779
  • [4] Advancements in Arabic Named Entity Recognition: A Comprehensive Review
    El Moussaoui, Taoufiq
    Loqman, Chakir
    IEEE ACCESS, 2024, 12 : 180238 - 180266
  • [5] Real-Time Text Classification of User-Generated Content on Social Media: Systematic Review
    Rogers, David
    Preece, Alun
    Innes, Martin
    Spasic, Irena
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 9 (04) : 1154 - 1166
  • [6] An Automatically Generated Annotated Corpus for Albanian Named Entity Recognition
    Hoxha, Klesti
    Baxhaku, Artur
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2018, 18 (01) : 95 - 108
  • [7] An Empirical Analysis of Moroccan Dialectal User-Generated Text
    Tachicart, Ridouane
    Bouzoubaa, Karim
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT II, 2019, 11684 : 3 - 12
  • [8] Named Entity Recognition for Short Text Messages
    Ek, Tobias
    Kirkegaard, Camilla
    Jonsson, Hakan
    Nugues, Pierre
    COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 178 - 187
  • [9] A systematic literature review on travel planning through user-generated video
    Nguyen, Phuong Minh Binh
    Pham, Lan Xuan
    Tran, Dang Khoa
    Truong, Giang Nu To
    JOURNAL OF VACATION MARKETING, 2024, 30 (03) : 553 - 581
  • [10] Named Entity Recognition in Unstructured Medical Text Documents
    Pearson, Cole
    Seliya, Naeem
    Dave, Rushit
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 412 - 417