Named Entity Recognition for Hungarian Using Various Machine Learning Algorithms

被引:0
|
作者
Farkas, Richard [1 ]
Szarvast, Gyorgy [2 ]
Kocsor, Andras [1 ]
机构
[1] MTA SZTE Res Grp Artificial Intelligence, Aradi Vertanuk Tere 1, H-6720 Szeged, Hungary
[2] Univ Szeged, Dept Informat, H-6720 Szeged, Hungary
来源
ACTA CYBERNETICA | 2006年 / 17卷 / 03期
关键词
named entity recognition; statistical models; machine learning;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we introduce a statistical Named Entity recognizer (NER) system for the Hungarian language. We examined three methods for identifying and disambiguating proper nouns (Artificial Neural Network, Support Vector Machine, C4.5 Decision Tree), their combinations and the effects of dimensionality reduction as well. We used a segment of Szeged Corpus [5] for training and validation purposes, which consists of short business news articles collected from MTI (Hungarian News Agency, www.mti.hu). Our results were presented at the Second Conference on Hungarian Computational Linguistics [7]. Our system makes use of both language dependent features (describing the orthography of proper nouns in Hungarian) and other, language independent information such as capitalization. Since we avoided the inclusion of large gazetteers of pre-classified entities, the system remains portable across languages without requiring any major modification, as long as the few specialized orthographical and syntactic characteristics are collected for a new target language. The best performing model achieved an F measure accuracy of 91.95%.
引用
收藏
页码:633 / 646
页数:14
相关论文
共 50 条
  • [41] Character Feature Learning for Named Entity Recognition
    Zeng, Ping
    Tan, Qingping
    Zhang, Haoyu
    Meng, Xiankai
    Zhang, Zhuo
    Xu, Jianjun
    Lei, Yan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (07) : 1811 - 1815
  • [42] Named entity recognition based on deep learning
    Ji Z.
    Kong D.
    Liu W.
    Dong W.
    Sang Y.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2022, 28 (06): : 1603 - 1615
  • [43] Turkish Named Entity Recognition with Deep Learning
    Gunes, Asim
    Tantug, A. Cuneyd
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [44] Transfer Learning for Indonesian Named Entity Recognition
    Kosasih, Joshua Aditya
    Khodra, Masayu Leylia
    2018 INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT INFORMATICS (SAIN), 2018, : 173 - 178
  • [45] Multitask Learning for Chinese Named Entity Recognition
    Zhang, Qun
    Li, Zhenzhen
    Feng, Dawei
    Li, Dongsheng
    Huang, Zhen
    Peng, Yuxing
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 653 - 662
  • [46] Deep learning for named entity recognition: a survey
    Hu Z.
    Hou W.
    Liu X.
    Neural Comput. Appl., 16 (8995-9022): : 8995 - 9022
  • [47] A Deep Learning Solution to Named Entity Recognition
    Murthy, V. Rudra
    Bhattacharyya, Pushpak
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 427 - 438
  • [48] A multilingual Named Entity Recognition system using boosting and C4.5 decision tree learning algorithms
    Szarvas, Gyorgy
    Farkas, Richard
    Kocsor, Andras
    DISCOVERY SCIENCE, PROCEEDINGS, 2006, 4265 : 267 - 278
  • [49] Urdu Named Entity Recognition System Using Deep Learning Approaches
    Haq, Rafiul
    Zhang, Xiaowang
    Khan, Wahab
    Feng, Zhiyong
    COMPUTER JOURNAL, 2023, 66 (08): : 1856 - 1869
  • [50] The Named Entity Recognition of Chinese Cybersecurity Using an Active Learning Strategy
    Xie, Bo
    Shen, Guowei
    Guo, Chun
    Cui, Yunhe
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021