Information Extraction: Evaluating Named Entity Recognition from Classical Malay Documents

被引:0
|
作者
Sazali, Siti Syakirah [1 ]
Rahman, Nurazzah Abdul [1 ]
Abu Bakar, Zainab [2 ]
机构
[1] Univ Teknol MARA, Fac Comp & Math Sci, Shah Alam, Selangor, Malaysia
[2] Al Madinah Int Univ, Fac Comp & Informat Technol, Shah Alam, Selangor, Malaysia
来源
2016 THIRD INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP) | 2016年
关键词
component; bahasa melayu; information extraction; malay language; named entity recognition; natural language processing; nouns; nouns extraction;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Natural Language Processing (NLP) is an important field of research in Computer Science. NLP is the process of analyzing texts based on a set of theories and technologies, and recent studies focused more on Information Extraction (IE). In Information Extraction, there are few steps or commonly known as task to be followed, which are named entity recognition, relation detection and classification, temporal and event processing, and template filling. Recent researches in Malay languages mainly focused on newspaper articles and since this research experiment is experimenting on classical documents, there is a need to identify the best way to extract noun from existing methods. This paper proposes to conduct a research about extracting nouns from Malay classical documents. The result shows that experiment using the Noun Extraction using Morphological Rules (Verb, Adjective and Noun Affixes) that has 77.61% chances of identifying a noun to contribute to the existing Malay noun list. As there is not any existing completed Malay noun list or dictionary that can be used as a guide, the results extracted still need to be judged by the language experts.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [41] A Bank Information Extraction System Based on Named Entity Recognition with CRFs from Noisy Customer Order Texts in Turkish
    Emekligil, Erdem
    Arslan, Secil
    Agin, Onur
    KNOWLEDGE ENGINEERING AND SEMANTIC WEB, KESW 2016, 2016, 649 : 93 - 102
  • [42] Named entity recognition on bio-medical literature documents using hybrid based approach
    Ramachandran, R.
    Arutchelvan, K.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021,
  • [43] On the Assessment of Deep Learning Models for Named Entity Recognition of Brazilian Legal Documents
    Albuquerque, Hidelberg O.
    Souza, Ellen
    Oliveira, Adriano L. I.
    Macedo, David
    Zanchettin, Cleber
    Vitorio, Douglas
    da Silva, Nadia F. F.
    de Carvalho, Andre C. P. L. F.
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT II, 2023, 14116 : 93 - 104
  • [44] Named-entity recognition from Greek and English texts
    Karkaletsis, V
    Paliouras, G
    Petasis, G
    Manousopoulou, N
    Spyropoulos, CD
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 1999, 26 (02) : 123 - 135
  • [45] Named Entity Recognition From Biomedical Data
    Refaat, Maged
    Rafea, Ahmed
    Gaballah, Nada
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 838 - 844
  • [46] Named-Entity Recognition from Greek and English Texts
    Vangelis Karkaletsis
    Georgios Paliouras
    Georgios Petasis
    Natasa Manousopoulou
    Constantine D. Spyropoulos
    Journal of Intelligent and Robotic Systems, 1999, 26 : 123 - 135
  • [47] Arabic Named Entity Recognition from diverse text types
    Shaalan, Khaled
    Raza, Hafsa
    ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2008, 5221 : 440 - 451
  • [48] A RoBERTa-GlobalPointer-Based Method for Named Entity Recognition of Legal Documents
    Zhang, Xinrui
    Luo, Xudong
    Wu, Jiaye
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [49] Named Entity Recognition in Semi Structured Documents Using Neural Tensor Networks
    Shehzad, Khurram
    Ul-Hasan, Adnan
    Malik, Muhammad Imran
    Shafait, Faisal
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 398 - 409
  • [50] FINDING BABIES ON SOCIAL MEDIA: A CASE OF NAMED ENTITY RECOGNITION IN VIETNAMESE DOCUMENTS
    Van Pham Hoai
    Loc Nguyen Tan
    Quoc Phan Phu
    Trung Mai Duc
    Tho Quan Thanh
    PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2018, : 209 - 213