Named Entity Recognition for Hungarian Using Various Machine Learning Algorithms

被引:0
|
作者
Farkas, Richard [1 ]
Szarvast, Gyorgy [2 ]
Kocsor, Andras [1 ]
机构
[1] MTA SZTE Res Grp Artificial Intelligence, Aradi Vertanuk Tere 1, H-6720 Szeged, Hungary
[2] Univ Szeged, Dept Informat, H-6720 Szeged, Hungary
来源
ACTA CYBERNETICA | 2006年 / 17卷 / 03期
关键词
named entity recognition; statistical models; machine learning;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we introduce a statistical Named Entity recognizer (NER) system for the Hungarian language. We examined three methods for identifying and disambiguating proper nouns (Artificial Neural Network, Support Vector Machine, C4.5 Decision Tree), their combinations and the effects of dimensionality reduction as well. We used a segment of Szeged Corpus [5] for training and validation purposes, which consists of short business news articles collected from MTI (Hungarian News Agency, www.mti.hu). Our results were presented at the Second Conference on Hungarian Computational Linguistics [7]. Our system makes use of both language dependent features (describing the orthography of proper nouns in Hungarian) and other, language independent information such as capitalization. Since we avoided the inclusion of large gazetteers of pre-classified entities, the system remains portable across languages without requiring any major modification, as long as the few specialized orthographical and syntactic characteristics are collected for a new target language. The best performing model achieved an F measure accuracy of 91.95%.
引用
收藏
页码:633 / 646
页数:14
相关论文
共 50 条
  • [21] Ensemble Learning for Named Entity Recognition
    Speck, Rene
    Ngomo, Axel-Cyrille Ngonga
    SEMANTIC WEB - ISWC 2014, PT I, 2014, 8796 : 519 - 534
  • [22] Automatic Configuration of Deep Learning Algorithms for an Arabic Named Entity Recognition System
    Azroumahli, Chaimae
    Mouhib, Ibtihal
    El Younoussi, Yacine
    Badir, Hassan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 106 - 113
  • [23] Joint Learning of Named Entity Recognition and Entity Linking
    Martins, Pedro Henrique
    Marinho, Zita
    Martins, Andre F. T.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 190 - 196
  • [24] MetaboListem and TABoLiSTM: Two Deep Learning Algorithms for Metabolite Named Entity Recognition
    Yeung, Cheng S.
    Beck, Tim
    Posma, Joram M.
    METABOLITES, 2022, 12 (04)
  • [25] A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity
    Dasgupta, Soham
    Piplai, Aritran
    Kotal, Anantaa
    Joshi, Anupam
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2596 - 2604
  • [26] Named entity recognition in a Hungarian NL based QA system
    Tikk, Domonkos
    Szidarovszky, P. Ferenc
    Kardkovacs, Zsolt T.
    Magyar, Gabor
    ADVANCES IN INFORMATION SYSTEMS DEVELOPMENT, VOL 1 AND 2: BRIDGING THE GAP BETWEEN ACADEMIA AND INDUSTRY, 2006, : 879 - +
  • [27] Medical Named Entity Recognition Using Weakly Supervised Learning
    Long-Long Ma
    Jie Yang
    Bo An
    Shuaikang Liu
    Gaijuan Huang
    Cognitive Computation, 2022, 14 : 1068 - 1079
  • [28] Named entity recognition using point prediction and active learning
    Kobayashi, Koga
    Wakabayashi, Kei
    IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 287 - 293
  • [29] Medical Named Entity Recognition Using Weakly Supervised Learning
    Ma, Long-Long
    Yang, Jie
    An, Bo
    Liu, Shuaikang
    Huang, Gaijuan
    COGNITIVE COMPUTATION, 2022, 14 (03) : 1068 - 1079
  • [30] Named Entity Recognition in Malayalam using Fuzzy Support Vector Machine
    Lakshmi, G.
    Panicker, Janu R.
    Meera, M.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE (ICIS), 2016, : 201 - 206