Information Extraction: Evaluating Named Entity Recognition from Classical Malay Documents

被引:0
|
作者
Sazali, Siti Syakirah [1 ]
Rahman, Nurazzah Abdul [1 ]
Abu Bakar, Zainab [2 ]
机构
[1] Univ Teknol MARA, Fac Comp & Math Sci, Shah Alam, Selangor, Malaysia
[2] Al Madinah Int Univ, Fac Comp & Informat Technol, Shah Alam, Selangor, Malaysia
来源
2016 THIRD INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP) | 2016年
关键词
component; bahasa melayu; information extraction; malay language; named entity recognition; natural language processing; nouns; nouns extraction;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Natural Language Processing (NLP) is an important field of research in Computer Science. NLP is the process of analyzing texts based on a set of theories and technologies, and recent studies focused more on Information Extraction (IE). In Information Extraction, there are few steps or commonly known as task to be followed, which are named entity recognition, relation detection and classification, temporal and event processing, and template filling. Recent researches in Malay languages mainly focused on newspaper articles and since this research experiment is experimenting on classical documents, there is a need to identify the best way to extract noun from existing methods. This paper proposes to conduct a research about extracting nouns from Malay classical documents. The result shows that experiment using the Noun Extraction using Morphological Rules (Verb, Adjective and Noun Affixes) that has 77.61% chances of identifying a noun to contribute to the existing Malay noun list. As there is not any existing completed Malay noun list or dictionary that can be used as a guide, the results extracted still need to be judged by the language experts.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [21] Transformer based named entity recognition for place name extraction from unstructured text
    Berragan, Cillian
    Singleton, Alex
    Calafiore, Alessia
    Morley, Jeremy
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2023, 37 (04) : 747 - 766
  • [22] Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents
    Francis, Sumam
    Van Landeghem, Jordy
    Moens, Marie-Francine
    INFORMATION, 2019, 10 (08)
  • [23] Benchmarking Named Entity Recognition Approaches for Extracting Research Infrastructure Information from Text
    Cheirmpos, Georgios
    Tabatabaei, Seyed Amin
    Kanoulas, Evangelos
    Tsatsaronis, Georgios
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT I, 2024, 14505 : 131 - 141
  • [24] A Named Entity and Relationship Extraction Method from Trouble-Shooting Documents in Korean
    Jeong, Minkyu
    Suh, Hyowon
    Lee, Heejung
    Lee, Jae Hyun
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [25] Joint Learning of Named Entity Recognition and Relation Extraction
    Xu, Qiuyan
    Li, Fang
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1978 - 1982
  • [26] Evaluating named entity recognition tools for extracting social networks from novels
    Dekker, Niels
    Kuhn, Tobias
    van Erp, Marieke
    PEERJ COMPUTER SCIENCE, 2019, 2019 (04)
  • [27] Named Entity Recognition Through Learning from Experts
    Andrews, Martin
    INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES 2015, 2016, 5 : 281 - 292
  • [28] Learning multilingual named entity recognition from Wikipedia
    Nothman, Joel
    Ringland, Nicky
    Radford, Will
    Murphy, Tara
    Curran, James R.
    ARTIFICIAL INTELLIGENCE, 2013, 194 : 151 - 175
  • [29] Sensitive Information Detection Adopting Named Entity Recognition: A Proposed Methodology
    Campanile, Lelio
    de Biase, Maria Stella
    Marrone, Stefano
    Marulli, Fiammetta
    Raimondo, Mariapia
    Verde, Laura
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART IV, 2022, 13380 : 377 - 388
  • [30] Named entity recognition for Chinese judgment documents based on BiLSTM and CRF
    Wenming Huang
    Dengrui Hu
    Zhenrong Deng
    Jianyun Nie
    EURASIP Journal on Image and Video Processing, 2020