Survey on Thai NLP Language Resources and Tools

被引:0
|
作者
Arreerard, Ratchakrit [1 ]
Mander, Stephen [1 ]
Piao, Scott [1 ]
机构
[1] Univ Lancaster, Sch Comp & Commun, Lancaster LA1 4WA, England
来源
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年
关键词
Natural Language Processing; Thai NLP; Survey; Language Resource; NLP tools;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Over the past decades, Natural Language Processing (NLP) research has been expanding to cover more languages. Recently particularly, NLP community has paid increasing attention to under-resourced languages. However, there are still many languages for which NLP research is limited in terms of both language resources and software tools. Thai language is one of the under-resourced languages in the NLP domain, although it is spoken by nearly 70 million people globally. In this paper, we report on our survey on the past development of Thai NLP research to help understand its current state and future research directions. Our survey shows that, although Thai NLP community has achieved a significant achievement over the past three decades, particularly on NLP upstream tasks such as tokenisation, research on downstream tasks such as syntactic parsing and semantic analysis is still limited. But we foresee that Thai NLP research will advance rapidly as richer Thai language resources and more robust NLP techniques become available.
引用
收藏
页码:6495 / 6505
页数:11
相关论文
共 50 条
  • [21] Opinion Mining and thought Pattern Classification with Natural Language Processing (NLP) Tools
    Naqvi, Sayyada Muntaha Azim
    Awais, Muhammad
    Saeed, Muhammad Yahya
    Mohsin, Muhammad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (10) : 485 - 493
  • [22] Italian NLP for Everyone: Resources and Models from EVALITA to the European Language Grid
    Basile, Valerio
    Bosco, Cristina
    Fell, Michael
    Patti, Viviana
    Varvara, Rossella
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 174 - 180
  • [23] A Survey of Tools and Resources for the Next Generation Analyst
    Hall, David L.
    Graham, Jake
    Catherman, Emily
    NEXT-GENERATION ANALYST III, 2015, 9499
  • [24] Problems and resources in analyzing Northern Thai conversation for English language readers
    Bilmes, J
    JOURNAL OF PRAGMATICS, 1996, 26 (02) : 171 - 188
  • [25] Language resources and tools for supporting the system engineering process
    Onditi, VO
    Rayson, P
    Ransom, B
    Ramduny, D
    Sommerville, I
    Dix, A
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2004, 3136 : 147 - 158
  • [26] Developing New Linguistic Resources and Tools for the Galician Language
    Agerri, Rodrigo
    Gomez Guinovart, Xavier
    Rigau, German
    Solla Portela, Miguel Anxo
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2322 - 2325
  • [27] Introduction to the special issue on Resources and Tools for Language Learners
    Sharoff, Serge
    Spina, Stefania
    Kokkinakis, Sofie Johansson
    LANGUAGE RESOURCES AND EVALUATION, 2014, 48 (01) : 1 - 3
  • [28] Detecting Gaps in Language Resources and Tools in the Project CESAR
    Tadic, Marko
    Varadi, Tamas
    Garabik, Radovan
    Koeva, Svetla
    Ogrodniczuk, Maciej
    Vitas, Dusko
    HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2014, 8387 : 471 - 478
  • [29] The Landscape of Accessibility Tools Requiring Sign Language Resources
    Efthimiou, Eleni
    Fotinea, Stavroula-Evita
    THE 14TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2021, 2021, : 260 - 261
  • [30] Latvian Language Resources and Tools: Assessment, Description and Sharing
    Vasiljevs, Andrejs
    Skadina, Inguna
    HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE, 2012, 247 : 265 - 272