Information Extraction: Evaluating Named Entity Recognition from Classical Malay Documents

被引:0
|
作者
Sazali, Siti Syakirah [1 ]
Rahman, Nurazzah Abdul [1 ]
Abu Bakar, Zainab [2 ]
机构
[1] Univ Teknol MARA, Fac Comp & Math Sci, Shah Alam, Selangor, Malaysia
[2] Al Madinah Int Univ, Fac Comp & Informat Technol, Shah Alam, Selangor, Malaysia
来源
2016 THIRD INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP) | 2016年
关键词
component; bahasa melayu; information extraction; malay language; named entity recognition; natural language processing; nouns; nouns extraction;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Natural Language Processing (NLP) is an important field of research in Computer Science. NLP is the process of analyzing texts based on a set of theories and technologies, and recent studies focused more on Information Extraction (IE). In Information Extraction, there are few steps or commonly known as task to be followed, which are named entity recognition, relation detection and classification, temporal and event processing, and template filling. Recent researches in Malay languages mainly focused on newspaper articles and since this research experiment is experimenting on classical documents, there is a need to identify the best way to extract noun from existing methods. This paper proposes to conduct a research about extracting nouns from Malay classical documents. The result shows that experiment using the Noun Extraction using Morphological Rules (Verb, Adjective and Noun Affixes) that has 77.61% chances of identifying a noun to contribute to the existing Malay noun list. As there is not any existing completed Malay noun list or dictionary that can be used as a guide, the results extracted still need to be judged by the language experts.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [11] A step towards information extraction: Named entity recognition in Bangla using deep learning
    Karim, Redwanul
    Islam, M. A. Muhiminul
    Simanto, Sazid Rahman
    Chowdhury, Saif Ahmed
    Roy, Kalyan
    Al Neon, Adnan
    Hasan, Md. Sajid
    Firoze, Adnan
    Rahman, Rashedur M.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (06) : 7401 - 7413
  • [12] A Survey of Named-Entity Recognition Methods for Food Information Extraction
    Popovski, Gorjan
    Seljak, Barbara Korousic
    Eftimov, Tome
    IEEE ACCESS, 2020, 8 : 31586 - 31594
  • [13] An Enhanced Malay Named Entity Recognition using Combination Approach for Crime Textual Data Analysis
    Asmai, Siti Azirah
    Salleh, Muhammad Sharilazlan
    Basiron, Halizah
    Ahmad, Sabrina
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (09) : 474 - 483
  • [14] Named Entity Recognition and Classification in Historical Documents: A Survey
    Ehrmann, Maud
    Hamdi, Ahmed
    Pontes, Elvys Linhares
    Romanello, Matteo
    Doucet, Antoine
    ACM COMPUTING SURVEYS, 2024, 56 (02)
  • [15] A Dataset of German Legal Documents for Named Entity Recognition
    Leitner, Elena
    Rehm, Georg
    Moreno-Schneider, Julian
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4478 - 4485
  • [16] ArRaNER: A novel named entity recognition model for biomedical literature documents
    R. Ramachandran
    K. Arutchelvan
    The Journal of Supercomputing, 2022, 78 : 16498 - 16511
  • [17] IMPROVING CHINESE NAMED ENTITY RECOGNITION WITH LEXICAL INFORMATION
    Fu, Guo-Hong
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 3487 - 3491
  • [18] ArRaNER: A novel named entity recognition model for biomedical literature documents
    Ramachandran, R.
    Arutchelvan, K.
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (14): : 16498 - 16511
  • [19] Named Entity Recognition and Relation Extraction: State-of-the-Art
    Nasar, Zara
    Jaffry, Syed Waqar
    Malik, Muhammad Kamran
    ACM COMPUTING SURVEYS, 2021, 54 (01)
  • [20] Named Entity Recognition in Classical Chinese by Lexicon Enhancement
    Yu, Jianye
    Feng, Xiangyilan
    Li, Jie
    Liu, Jialin
    2023 IEEE INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2023, : 463 - 468