Thai Knowledge-Augmented Language Model Adaptation (ThaiKALA)

被引:0
作者
Ruangchutiphophan, Pavaris [1 ]
Saetia, Chanatip [1 ]
Ayutthaya, Thititorn Seneewong Na [1 ]
Chalothorn, Tawunrat [1 ]
机构
[1] Kasikorn Business Technol Grp, Bangkok, Thailand
来源
2023 18TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING, ISAI-NLP | 2023年
关键词
Knowledge-Augmented; Language Model; Question Answering;
D O I
10.1109/iSAI-NLP60301.2023.10355001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models have exhibited considerable prowess in diverse NLP tasks, demonstrating promising performance. However, they still have limitations in effectively capturing domain-specific knowledge and contextually-relevant information, resulting in hallucination issues. To address these challenges, This paper presents ThaiKALA, a framework designed for the Thai language to augment domain-specific knowledge into the language model. The framework utilizes three modules to handle Thai language specifically: event extraction, a self-defined ID database, and a multilingual language model. To confirm the performance, the framework is also evaluated with strong generative baselines like GPT-3 and GPT-3.5-turbo-16k. As a result, ThaiKALA, with only Entity Memory, outperforms all baselines including GPT-3 and GPT-3.5 in extractive Question Answering (EQA) tasks, achieving a higher exact match (42.48%) and competitive F1 scores (67.07%). These results demonstrate that ThaiKALA is effective in enhancing the language model's performance on Thai extractive QA by augmenting the extracted knowledge.
引用
收藏
页数:6
相关论文
共 44 条
  • [1] Knowledge-Augmented Mutation-Based Bug Localization for Hardware Design Code
    Wu, Jiang
    Zhang, Zhuo
    Yang, Deheng
    Xu, Jianjun
    He, Jiayu
    Mao, Xiaoguang
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 21 (03)
  • [2] Semantic Similarity Measurement Using Knowledge-Augmented Multiple-prototype Distributed Word Vector
    Lu, Wei
    Shi, Kailun
    Cai, Yuanyuan
    Che, Xiaoping
    INTERNATIONAL JOURNAL OF INTERDISCIPLINARY TELECOMMUNICATIONS AND NETWORKING, 2016, 8 (02) : 45 - 57
  • [3] Language Model Supervision for Handwriting Recognition Model Adaptation
    Tensmeyer, Chris
    Wigington, Curtis
    Davis, Brian
    Stewart, Seth
    Martinez, Tony
    Barrett, William
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 133 - 138
  • [4] Reliable feature selection for language model adaptation
    Chueh, Chuang-Hua
    Chien, Jen-Tzung
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5089 - 5092
  • [5] Language Model Adaptation for Relevance Feedback in Information Retrieval
    Chang, Ying-Lang
    Chien, Jen-Tzung
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 289 - 292
  • [6] Sentence Selection by Direct Likelihood Maximization for Language Model Adaptation
    Shinozaki, Takahiro
    Kubota, Yu
    Furui, Sadaoki
    Utsunomiya, Eiji
    Shindoh, Yasutaka
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 620 - +
  • [7] Knowledge Distillation Approach for Efficient Internal Language Model Estimation
    Chen, Zhipeng
    Xu, Haihua
    Khassanov, Yerbolat
    He, Yi
    Lu, Lu
    Ma, Zejun
    Wu, Ji
    INTERSPEECH 2023, 2023, : 1339 - 1343
  • [8] CodeKGC: Code Language Model for Generative Knowledge Graph Construction
    Bi, Zhen
    Chen, Jing
    Jiang, Yinuo
    Xiong, Feiyu
    Guo, Wei
    Chen, Huajun
    Zhang, Ningyu
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (03)
  • [9] Adaptation of language model of Information Retrieval for empty answers Problem in databases
    Chellal, Abdelhamid
    Amrouche, Karima
    2015 12TH IEEE INTERNATIONAL CONFERENCE ON PROGRAMMING AND SYSTEMS (ISPS), 2015, : 142 - 148
  • [10] Supervised and unsupervised Web-based language model domain adaptation
    Lecorve, Gwenole
    Dines, John
    Hain, Thomas
    Motlicek, Petr
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 182 - 185