Survey on Thai NLP Language Resources and Tools

被引:0
作者
Arreerard, Ratchakrit [1 ]
Mander, Stephen [1 ]
Piao, Scott [1 ]
机构
[1] Univ Lancaster, Sch Comp & Commun, Lancaster LA1 4WA, England
来源
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年
关键词
Natural Language Processing; Thai NLP; Survey; Language Resource; NLP tools;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Over the past decades, Natural Language Processing (NLP) research has been expanding to cover more languages. Recently particularly, NLP community has paid increasing attention to under-resourced languages. However, there are still many languages for which NLP research is limited in terms of both language resources and software tools. Thai language is one of the under-resourced languages in the NLP domain, although it is spoken by nearly 70 million people globally. In this paper, we report on our survey on the past development of Thai NLP research to help understand its current state and future research directions. Our survey shows that, although Thai NLP community has achieved a significant achievement over the past three decades, particularly on NLP upstream tasks such as tokenisation, research on downstream tasks such as syntactic parsing and semantic analysis is still limited. But we foresee that Thai NLP research will advance rapidly as richer Thai language resources and more robust NLP techniques become available.
引用
收藏
页码:6495 / 6505
页数:11
相关论文
共 50 条
  • [31] Natural language processing (NLP) tools in extracting biomedical concepts from research articles: a case study on autism spectrum disorder
    Jacqueline Peng
    Mengge Zhao
    James Havrilla
    Cong Liu
    Chunhua Weng
    Whitney Guthrie
    Robert Schultz
    Kai Wang
    Yunyun Zhou
    BMC Medical Informatics and Decision Making, 20
  • [32] Toward Tool Mashups: Comparing and Combining NLP RE Tools
    Arendse, Brian
    Lucassen, Garm
    2016 IEEE 24TH INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS (REW), 2016, : 26 - 31
  • [33] DeepNLPF: A Framework for Integrating Third-Party NLP Tools
    Rodrigues, Francisco
    Lima, Rinaldo
    Domingues, William
    Fidalgo, Robson
    Chifu, Adrian
    Espinasse, Bernard
    Fournier, Sebastien
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 7244 - 7251
  • [34] The promise of NLP and speech processing technologies in language assessment
    Chapelle, Carol A.
    Chung, Yoo-Ree
    LANGUAGE TESTING, 2010, 27 (03) : 301 - 315
  • [35] The language of proteins: NLP, machine learning & protein sequences
    Ofer, Dan
    Brandes, Nadav
    Linial, Michal
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 1750 - 1758
  • [36] A Systematic Literature Review on Natural Language Processing (NLP)
    Castanha, Jick
    Indrawati
    Pillai, Subhash K. B.
    Ramantoko, Gadang
    Widarmanti, Tri
    2022 INTERNATIONAL CONFERENCE ON ADVANCED CREATIVE NETWORKS AND INTELLIGENT SYSTEMS, ICACNIS, 2022, : 130 - 135
  • [37] Language Processing Modelling Notation - Orchestration of NLP Microservices
    Walkowiak, Tomasz
    ADVANCES IN DEPENDABILITY ENGINEERING OF COMPLEX SYSTEMS, 2018, 582 : 464 - 473
  • [38] Graph-based Arabic NLP Techniques: A Survey
    Etaiwi, Wael
    Awajan, Arafat
    ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 328 - 333
  • [39] Negation and Speculation in NLP: A Survey, Corpora, Methods, and Applications
    Mahany, Ahmed
    Khaled, Heba
    Elmitwally, Nouh Sabri
    Aljohani, Naif
    Ghoniemy, Said
    APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [40] The State of the Art of Natural Language Processing-A Systematic Automated Review of NLP Literature Using NLP Techniques
    Sawicki, Jan
    Ganzha, Maria
    Paprzycki, Marcin
    DATA INTELLIGENCE, 2023, 5 (03) : 707 - 749