Thai Knowledge-Augmented Language Model Adaptation (ThaiKALA)

被引:0
作者
Ruangchutiphophan, Pavaris [1 ]
Saetia, Chanatip [1 ]
Ayutthaya, Thititorn Seneewong Na [1 ]
Chalothorn, Tawunrat [1 ]
机构
[1] Kasikorn Business Technol Grp, Bangkok, Thailand
来源
2023 18TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING, ISAI-NLP | 2023年
关键词
Knowledge-Augmented; Language Model; Question Answering;
D O I
10.1109/iSAI-NLP60301.2023.10355001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models have exhibited considerable prowess in diverse NLP tasks, demonstrating promising performance. However, they still have limitations in effectively capturing domain-specific knowledge and contextually-relevant information, resulting in hallucination issues. To address these challenges, This paper presents ThaiKALA, a framework designed for the Thai language to augment domain-specific knowledge into the language model. The framework utilizes three modules to handle Thai language specifically: event extraction, a self-defined ID database, and a multilingual language model. To confirm the performance, the framework is also evaluated with strong generative baselines like GPT-3 and GPT-3.5-turbo-16k. As a result, ThaiKALA, with only Entity Memory, outperforms all baselines including GPT-3 and GPT-3.5 in extractive Question Answering (EQA) tasks, achieving a higher exact match (42.48%) and competitive F1 scores (67.07%). These results demonstrate that ThaiKALA is effective in enhancing the language model's performance on Thai extractive QA by augmenting the extracted knowledge.
引用
收藏
页数:6
相关论文
共 44 条
  • [21] Language model adaptation in Tamil language using cross-lingual latent semantic analysis with document aligned corpora
    Selvam, M.
    Natarajan, A. M.
    CURRENT SCIENCE, 2010, 98 (07): : 922 - 929
  • [22] Domain Adaptation in Semantic Role Labeling Using a Neural Language Model and Linguistic Resources
    Quynh Thi Ngoc Do
    Bethard, Steven
    Moens, Marie-Francine
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1812 - 1823
  • [23] LANGUAGE MODEL ADAPTATION FOR ACADEMIC LECTURES USING CHARACTER RECOGNITION RESULT OF PRESENTATION SLIDES
    Akita, Yuya
    Tong, Yizheng
    Kawahara, Tatsuya
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5431 - 5435
  • [24] Language model adaptation based on PLSA of topics and speakers for automatic transcription of panel discussions
    Akita, Y
    Kawahara, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 439 - 445
  • [25] Language Model Adaptation Using Latent Dirichlet Allocation and an Efficient Topic Inference Algorithm
    Heidel, Aaron
    Chang, Hung-an
    Lee, Lin-shan
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1145 - +
  • [26] UNSUPERVISED CV LANGUAGE MODEL ADAPTATION BASED ON DIRECT LIKELIHOOD MAXIMIZATION SENTENCE SELECTION
    Shinozaki, Takahiro
    Horiuchi, Yasuo
    Kuroiwa, Shingo
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5029 - 5032
  • [27] Enriching contextualized language model from knowledge graph for biomedical information extraction
    Fei, Hao
    Ren, Yafeng
    Zhang, Yue
    Ji, Donghong
    Liang, Xiaohui
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [28] KGGLM: A Generative Language Model for Generalizable Knowledge Graph Representation Learning in Recommendation
    Balloccu, Giacomo
    Boratto, Ludovico
    Fenu, Gianni
    Marras, Mirko
    Soccol, Alessandro
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 1079 - 1084
  • [29] KNOWLEDGE DISTILLATION FROM LANGUAGE MODEL TO ACOUSTIC MODEL: A HIERARCHICAL MULTI-TASK LEARNING APPROACH
    Lee, Mun-Hak
    Chang, Joon-Hyuk
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8392 - 8396
  • [30] Language Model Adaptation Using Machine-Translated Text for Resource-Deficient Languages
    Jensson, Arnar Thor
    Iwano, Koji
    Furui, Sadaoki
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2008, 2008 (1)