Survey on Thai NLP Language Resources and Tools

被引:0
|
作者
Arreerard, Ratchakrit [1 ]
Mander, Stephen [1 ]
Piao, Scott [1 ]
机构
[1] Univ Lancaster, Sch Comp & Commun, Lancaster LA1 4WA, England
来源
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年
关键词
Natural Language Processing; Thai NLP; Survey; Language Resource; NLP tools;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Over the past decades, Natural Language Processing (NLP) research has been expanding to cover more languages. Recently particularly, NLP community has paid increasing attention to under-resourced languages. However, there are still many languages for which NLP research is limited in terms of both language resources and software tools. Thai language is one of the under-resourced languages in the NLP domain, although it is spoken by nearly 70 million people globally. In this paper, we report on our survey on the past development of Thai NLP research to help understand its current state and future research directions. Our survey shows that, although Thai NLP community has achieved a significant achievement over the past three decades, particularly on NLP upstream tasks such as tokenisation, research on downstream tasks such as syntactic parsing and semantic analysis is still limited. But we foresee that Thai NLP research will advance rapidly as richer Thai language resources and more robust NLP techniques become available.
引用
收藏
页码:6495 / 6505
页数:11
相关论文
共 50 条
  • [1] A Survey on NLP Resources, Tools, and Techniques for Marathi Language Processing
    Lahoti, Pawan
    Mittal, Namita
    Singh, Girdhari
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (02)
  • [2] A Survey on NLP Resources, Tools, and Techniques for Marathi Language Processing
    Lahoti, Pawan
    Mittal, Namita
    Singh, Girdhari
    ACM Transactions on Asian and Low-Resource Language Information Processing, 2022, 22 (02)
  • [3] Developing Language Resources and NLP Tools for the North Korean Language
    Akdemir, Arda
    Jeon, Yeojoo
    Shibuya, Tetsuo
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5595 - 5600
  • [4] Language resources for Maghrebi Arabic dialects’ NLP: a survey
    Jihene Younes
    Emna Souissi
    Hadhemi Achour
    Ahmed Ferchichi
    Language Resources and Evaluation, 2020, 54 : 1079 - 1142
  • [5] Language resources for Maghrebi Arabic dialects' NLP: a survey
    Younes, Jihene
    Souissi, Emna
    Achour, Hadhemi
    Ferchichi, Ahmed
    LANGUAGE RESOURCES AND EVALUATION, 2020, 54 (04) : 1079 - 1142
  • [6] Language Resources and Tools for Swedish: A Survey
    Elenius, Kjell
    Forsbom, Eva
    Megyesi, Beata
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 600 - 604
  • [7] A survey of Konkani NLP resources
    Rajan, Annie
    Salgaonkar, Ambuja
    Joshi, Ramprasad
    COMPUTER SCIENCE REVIEW, 2020, 38
  • [8] Semantic Information Retrieval: A comparative experimental study of NLP Tools and Language Resources for Arabic
    Soudani, Nadia
    Bounhas, Ibrahim
    Slimani, Yahya
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 879 - 887
  • [9] Resources and components for gujarati NLP systems: a survey
    Desai, Nikita P.
    Dabhi, Vipul K.
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (07) : 5391 - 5409
  • [10] Resources and components for gujarati NLP systems: a survey
    Nikita P. Desai
    Vipul K. Dabhi
    Artificial Intelligence Review, 2022, 55 : 1 - 19