Named Entity Recognition for Mongolian Language

被引:9
作者
Munkhjargal, Zoljargal [1 ]
Bella, Gabor [2 ]
Chagnaa, Altangerel [1 ]
Giunchiglia, Fausto [2 ]
机构
[1] Natl Univ Mongolia, DICS, Ulaanbaatar 14200, Mongolia, Mongolia
[2] Univ Trent, DISI, I-38100 Trento, Italy
来源
TEXT, SPEECH, AND DIALOGUE (TSD 2015) | 2015年 / 9302卷
关键词
Mongolian named entity recognition; Genetic algorithm; Machine learning; String matching;
D O I
10.1007/978-3-319-24033-6_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a pioneering work on building a Named Entity Recognition system for the Mongolian language, with an agglutinative morphology and a subject-object-verb word order. Our work explores the fittest feature set from a wide range of features and a method that refines machine learning approach using gazetteers with approximate string matching, in an effort for robust handling of out-of-vocabulary words. As well as we tried to apply various existing machine learning methods and find optimal ensemble of classifiers based on genetic algorithm. The classifiers uses different feature representations. The resulting system constitutes the first-ever usable software package for Mongolian NER, while our experimental evaluation will also serve as a much-needed basis of comparison for further research.
引用
收藏
页码:243 / 251
页数:9
相关论文
共 50 条
  • [21] A comparative study for biomedical named entity recognition
    Xu Wang
    Chen Yang
    Renchu Guan
    [J]. International Journal of Machine Learning and Cybernetics, 2018, 9 : 373 - 382
  • [22] A comparative study for biomedical named entity recognition
    Wang, Xu
    Yang, Chen
    Guan, Renchu
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (03) : 373 - 382
  • [23] Named Entity Recognition in Manipuri: A Hybrid Approach
    Jimmy, L.
    Kaur, Darvinder
    [J]. LANGUAGE PROCESSING AND KNOWLEDGE IN THE WEB, 2013, 8105 : 104 - 110
  • [24] Domain Adaptation for Named Entity Recognition Using CRFs
    Tian, Tian
    Dinarelli, Marco
    Tellier, Isabelle
    Cardoso, Pedro Dias
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 561 - 565
  • [25] Study of Named Entity Recognition methods in biomedical field
    Sniegula, Anna
    Poniszewska-Maranda, Aneta
    Chomatek, Lukasz
    [J]. 10TH INT CONF ON EMERGING UBIQUITOUS SYST AND PERVAS NETWORKS (EUSPN-2019) / THE 9TH INT CONF ON CURRENT AND FUTURE TRENDS OF INFORMAT AND COMMUN TECHNOLOGIES IN HEALTHCARE (ICTH-2019) / AFFILIATED WORKOPS, 2019, 160 : 260 - 265
  • [26] Towards the Named Entity Recognition Methods in Biomedical Field
    Sniegula, Anna
    Poniszewska-Maranda, Aneta
    Chomatek, Lukasz
    [J]. SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 375 - 387
  • [27] Named Entity Recognition using Conditional Random Fields
    Patil, Nita
    Patil, Ajay
    Pawar, B., V
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1181 - 1188
  • [28] JAPANESE NAMED ENTITY RECOGNITION FOR QUESTION ANSWERING SYSTEM
    Liu, Ye
    Ren, Fuji
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS, 2011, : 402 - 406
  • [29] A Hybrid Named Entity Recognition System for Aviation Text
    Bharathi, A.
    Ramdin, Robin
    Babu, Preeja
    Menon, Vijay Krishna
    Jayaramakrishnan, Chandrasekhar
    Lakshmikumar, Sudarsan
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (01)
  • [30] Named Entity Recognition Using Conditional Random Fields
    Khan, Wahab
    Daud, Ali
    Shahzad, Khurram
    Amjad, Tehmina
    Banjar, Ameen
    Fasihuddin, Heba
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (13):