Fine-Grained Named Entity Recognition for Sinhala

被引:0
|
作者
Azeez, Rameela [1 ]
Ranathunga, Surangika [1 ]
机构
[1] Univ Moratuwa, Dept Comp Sci & Engn, Katubedda 10400, Sri Lanka
关键词
named entity recognition; sinhala; named entity; conditional random fields;
D O I
10.1109/mercon50084.2020.9185296
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
For English, Named Entity Recognition (NER) is more or less a solved problem. However, for low-resourced and morphologically rich languages such as Sinhala, minimal research has been done. In this paper, we present a novel fine-grained Named Entity (NE) tag set and an NE annotated Sinhala corpus of 70k word tokens. We trained a custom NER model for Sinhala based on Conditional Random Fields (CRF). Despite the low-resourced setting, this NER model could achieve an micro-averaged F1 score of 84.8.
引用
收藏
页码:295 / 300
页数:6
相关论文
共 50 条
  • [21] Embeddings of Label Components for Sequence Labeling: A Case Study of Fine-grained Named Entity Recognition
    Kato, Takuma
    Abe, Kaori
    Ouchi, Hiroki
    Miyawaki, Shumpei
    Suzuki, Jun
    Inui, Kentaro
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, 2020, : 222 - 229
  • [22] CHEMNER: Fine-Grained Chemistry Named Entity Recognition with Ontology-Guided Distant Supervision
    Wang, Xuan
    Hu, Vivian
    Song, Xiangchen
    Garg, Shweta
    Xiao, Jinfeng
    Han, Jiawei
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5227 - 5240
  • [23] Fine-grained multimodal named entity recognition with heterogeneous image-text similarity graphs
    Wang, Yongpeng
    Jiang, Chunmao
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, : 2401 - 2415
  • [24] Enhancing Legal Named Entity Recognition Using RoBERTa-GCN with CRF: A Nuanced Approach for Fine-Grained Entity Recognition
    Jain, Arihant
    Sharma, Raksha
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT III, 2024, 14610 : 261 - 267
  • [25] NumER: A Fine-Grained Numeral Entity Recognition Dataset
    Julavanich, Thanakrit
    Aizawa, Akiko
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2021), 2021, 12801 : 64 - 75
  • [26] Enhancing fine-grained geographic named entity recognition by Multi-scale Siamese Reconstruction Network
    Huang, Guanhua
    Gao, Bofei
    Chen, Jiaze
    Zhang, Yuchen
    Yang, Zhouwang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148
  • [27] MultiCoNER v2: a Large Multilingual dataset for Fine-grained and Noisy Named Entity Recognition
    Fetahu, Besnik
    Chen, Zhiyu
    Kar, Sudipta
    Rokhlenko, Oleg
    Malmasi, Shervin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2027 - 2051
  • [28] Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-Grained Student Ensemble
    Qu, Xiaoye
    Zeng, Jun
    Liu, Daizong
    Wang, Zhefeng
    Huai, Baoxing
    Zhou, Pan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13501 - 13509
  • [29] Fine-Grained Chinese Named Entity Recognition Based on MacBERT-Attn-BiLSTM-CRF Model
    Wang, Jueyang
    Li, Shuzhen
    Agyemang-Duah, Edward
    Feng, Xingyu
    Xu, Chun
    Ji, Yuao
    Liu, Junqiang
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 125 - 131
  • [30] SemEval-2023 Task 2: Fine-grained Multilingual Named Entity Recognition (MultiCoNER 2)
    Fetahu, Besnik
    Kar, Sudipta
    Chen, Zhiyu
    Rokhlenko, Oleg
    Malmasi, Shervin
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2247 - 2265