Fine Grained Named Entity Recognition via Seq2seq Framework

被引:10
|
作者
Zhu, Huiming [1 ,2 ]
He, Chunhui [1 ]
Fang, Yang [1 ]
Xiao, Weidong [1 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha 410073, Peoples R China
[2] Changsha Commerce & Tourism Coll, Dept Econ & Trade, Changsha 410116, Peoples R China
关键词
Tagging; Encyclopedias; Electronic publishing; Internet; Task analysis; Decoding; Named entity recognition; fine-grained; seq2seq framework;
D O I
10.1109/ACCESS.2020.2980431
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-grained Named entity recognition (NER) is crucial to natural language processing (NLP) applications like relation extraction and knowledge graph construction. Most existing fine-grained NER systems suffer from inefficiency problem as they use manually annotated training datasets. To address such issue, our NER system could automatically generate datasets from Wikipedia in distant supervision paradigm through mapping hyperlinks in Wikipedia documents to Freebase. In addition, previous NER models can not effectively process fine-grained labels with more than 100 types. So we introduce a "BIO" tagging strategy which can identify the position and type attributes simultaneously. Such tagging scheme transfers NER problem into a sequence-to-sequence (seq2seq) based issue. We propose a seq2seq framework to comprehend the input sentence in a comprehensive way. Specifically, we adopt a Bi-LSTM as the encoder to equally process the past and future information of the input. Then we add a self-attention mechanism to handle the long-term dependency problem in a long sequence. When classifying the entity tags, we choose CRF model as it adds more constraints to avoid position logical problem. Experiments are performed on large-scale datasets for fine-grained NER tasks. Experimental results verify the effectiveness of FSeqC, and it outperforms other state-of-the-art alternatives consistently and significantly.
引用
收藏
页码:53953 / 53961
页数:9
相关论文
共 50 条
  • [1] Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing
    Feng, Yanlin
    Pratapa, Adithya
    Mortensen, David
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15550 - 15560
  • [2] Combination of explicit segmentation with Seq2Seq recognition for fine analysis of children handwriting
    Krichen, Omar
    Corbille, Simon
    Anquetil, Eric
    Girard, Nathalie
    Fromont, Elisa
    Nerdeux, Pauline
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (04) : 339 - 350
  • [3] Combination of explicit segmentation with Seq2Seq recognition for fine analysis of children handwriting
    Omar Krichen
    Simon Corbillé
    Éric Anquetil
    Nathalie Girard
    Élisa Fromont
    Pauline Nerdeux
    International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 339 - 350
  • [4] Fine-Grained Multimodal Named Entity Recognition and Grounding with a Generative Framework
    Wang, Jieming
    Li, Ziyan
    Yu, Jianfei
    Yang, Li
    Xia, Rui
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3934 - 3943
  • [5] Minimize Exposure Bias of Seq2Seq Models in Joint Entity and Relation Extraction
    Zhang, Ranran Haoran
    Liu, Qianying
    Fan, Aysa Xuemo
    Ji, Heng
    Zeng, Daojian
    Cheng, Fei
    Kawahara, Daisuke
    Kurohashi, Sadao
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 236 - 246
  • [6] Fine-Grained Named Entity Recognition for Sinhala
    Azeez, Rameela
    Ranathunga, Surangika
    MERCON 2020: 6TH INTERNATIONAL MULTIDISCIPLINARY MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2020, : 295 - 300
  • [7] Generating Lead Sheets with Affect: A Novel Conditional seq2seq Framework
    Makris, Dimos
    Agres, Kat R.
    Herremans, Dorien
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [8] Fine-grained Named Entity Recognition for Turkish
    Khudoyberdieva, Lola
    Diri, Banu
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [9] Multilingual Fine-Grained Named Entity Recognition
    Lupancu, Viorica-Camelia
    Iftene, Adrian
    COMPUTER SCIENCE JOURNAL OF MOLDOVA, 2023, 31 (03) : 321 - 339
  • [10] Fine-grained Dutch named entity recognition
    Bart Desmet
    Véronique Hoste
    Language Resources and Evaluation, 2014, 48 : 307 - 343