Named Entity Recognition for Mongolian Language

被引:9
|
作者
Munkhjargal, Zoljargal [1 ]
Bella, Gabor [2 ]
Chagnaa, Altangerel [1 ]
Giunchiglia, Fausto [2 ]
机构
[1] Natl Univ Mongolia, DICS, Ulaanbaatar 14200, Mongolia, Mongolia
[2] Univ Trent, DISI, I-38100 Trento, Italy
来源
TEXT, SPEECH, AND DIALOGUE (TSD 2015) | 2015年 / 9302卷
关键词
Mongolian named entity recognition; Genetic algorithm; Machine learning; String matching;
D O I
10.1007/978-3-319-24033-6_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a pioneering work on building a Named Entity Recognition system for the Mongolian language, with an agglutinative morphology and a subject-object-verb word order. Our work explores the fittest feature set from a wide range of features and a method that refines machine learning approach using gazetteers with approximate string matching, in an effort for robust handling of out-of-vocabulary words. As well as we tried to apply various existing machine learning methods and find optimal ensemble of classifiers based on genetic algorithm. The classifiers uses different feature representations. The resulting system constitutes the first-ever usable software package for Mongolian NER, while our experimental evaluation will also serve as a much-needed basis of comparison for further research.
引用
收藏
页码:243 / 251
页数:9
相关论文
共 50 条
  • [1] Learning Morpheme Representation for Mongolian Named Entity Recognition
    Weihua Wang
    Feilong Bao
    Guanglai Gao
    Neural Processing Letters, 2019, 50 : 2647 - 2664
  • [2] Cyrillic Mongolian Named Entity Recognition with Rich Features
    Wang, Weihua
    Bao, Feilong
    Gao, Guanglai
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 497 - 505
  • [3] MTNER: A Corpus for Mongolian Tourism Named Entity Recognition
    Cheng, Xiao
    Wang, Weihua
    Bao, Feilong
    Gao, Guanglai
    MACHINE TRANSLATION, CCMT 2020, 2020, 1328 : 11 - 23
  • [4] Mongolian Named Entity Recognition using Suffixes Segmentation
    Wang, Weihua
    Bao, Feilong
    Gao, Guanglai
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 169 - 172
  • [5] Learning Morpheme Representation for Mongolian Named Entity Recognition
    Wang, Weihua
    Bao, Feilong
    Gao, Guanglai
    NEURAL PROCESSING LETTERS, 2019, 50 (03) : 2647 - 2664
  • [6] Named Entity Recognition in Marathi Language
    Kale, Shrutika
    Govilkar, Sharvari
    INTERNATIONAL CONFERENCE ON INTELLIGENT DATA COMMUNICATION TECHNOLOGIES AND INTERNET OF THINGS, ICICI 2018, 2019, 26 : 371 - 377
  • [7] Named Entity Recognition for Nepali Language
    Singh, Oyesh Mann
    Padia, Ankur
    Joshi, Anupam
    2019 IEEE 5TH INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC 2019), 2019, : 184 - 190
  • [8] Named entity recognition for the Kazakh language
    Kozhirbayev, Z. M.
    Yessenbayev, Z. A.
    JOURNAL OF MATHEMATICS MECHANICS AND COMPUTER SCIENCE, 2020, 107 (03): : 57 - 66
  • [9] Named Entity Recognition for Sinhala Language
    Dahanayaka, J. K.
    Weerasinghe, A. R.
    14TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER) 2014, 2014, : 215 - 220
  • [10] Named Entity Recognition for the Azerbaijani Language
    Akhundova, Natavan
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2021), 2021,