Translating Natural Language Queries to SQL Using the T5 Model

被引:0
|
作者
Wong, Albert [1 ]
Pham, Lien [1 ]
Lee, Young [2 ]
Chan, Shek [1 ]
Sadaya, Razel [1 ]
Khmelevsky, Youry [3 ]
Clement, Mathias [3 ]
Cheng, Florence Wing Yau [1 ]
Mahony, Joe [4 ]
Ferri, Michael [4 ]
机构
[1] Langara Coll, Math & Stat, Vancouver, BC, Canada
[2] Okanagan Coll, Math & Stat, Kelowna, BC, Canada
[3] Okanagan Coll, Comp Sci, Kelowna, BC, Canada
[4] Harris SmartWorks, Res & Dev, Ottawa, ON, Canada
来源
18TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON 2024 | 2024年
基金
加拿大自然科学与工程研究理事会;
关键词
Natural Language Processing; Data Query System; Text-to-SQL; Speech-to-SQL; Deep Learning; Machine Learning; T5; Model; Human-Machine-Systems; Energy Systems;
D O I
10.1109/SysCon61195.2024.10553509
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents the development process of a natural language to SQL model using the T5 model as the basis. The models, developed in August 2022 for an online transaction processing system and a data warehouse, have a 73% and 84% exact match accuracy respectively. These models, in conjunction with other work completed in the research project, were implemented for several companies and used successfully on a daily basis. The approach used in the model development could be implemented in a similar fashion for other database environments and with a more powerful pre-trained language model.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Natural Language Inference Using LSTM Model with Sentence Fusion
    Zhang, Senlin
    Liu, Siyang
    Liu, Meiqin
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 11081 - 11085
  • [32] Enhancing extractive text summarization using natural language processing with an optimal deep learning model
    Hassan, Abdulkhaleq Q. A.
    Al-onazi, Badriyya B.
    Maashi, Mashael
    Darem, Abdulbasit A.
    Abunadi, Ibrahim
    Mahmud, Ahmed
    AIMS MATHEMATICS, 2024, 9 (05): : 12588 - 12609
  • [33] Development of a predictive model for retention in HIV care using natural language processing of clinical notes
    Oliwa, Tomasz
    Furner, Brian
    Schmitt, Jessica
    Schneider, John
    Ridgway, Jessica P.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (01) : 104 - 112
  • [34] Automatic text summarisation system for scientific papers on the basis of T5 model, on-the-fly constructed corpus and citations
    Mosbah M.
    International Journal of Web Engineering and Technology, 2024, 19 (02) : 170 - 193
  • [35] Terminology model discovery using natural language processing and visualization techniques
    Zhou, Li
    Tao, Ying
    Cimino, James J.
    Chen, Elizabeth S.
    Liu, Hongfang
    Lussier, Yves A.
    Hripcsak, George
    Friedman, Carol
    JOURNAL OF BIOMEDICAL INFORMATICS, 2006, 39 (06) : 626 - 636
  • [36] Predictive Model for ICU Readmission Based on Discharge Summaries Using Machine Learning and Natural Language Processing
    Orangi-Fard, Negar
    Akhbardeh, Alireza
    Sagreiya, Hersh
    INFORMATICS-BASEL, 2022, 9 (01):
  • [37] Empirical Auto-Evaluation of Python']Python Code for Performance Analysis of Transformer Network Using T5 Architecture
    Ganguli, Isha
    Bhowmick, Rajat Subhra
    Biswas, Shivam
    Sil, Jaya
    2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC), 2021, : 75 - 79
  • [38] Predicting Discharge Disposition Following Meningioma Resection Using a Multi-Institutional Natural Language Processing Model
    Muhlestein, Whitney E.
    Monsour, Meredith A.
    Friedman, Gabriel N.
    Zinzuwadia, Aniket
    Zachariah, Marcus A.
    Coumans, Jean-Valery
    Carter, Bob S.
    Chambless, Lola B.
    NEUROSURGERY, 2021, 88 (04) : 838 - 845
  • [39] Natural Language Processing Based Question Answering Using Vector Space Model
    Jayashree, R.
    Niveditha, N.
    PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2016, VOL 2, 2017, 547 : 368 - 375
  • [40] Commentary: Predicting Discharge Disposition Following Meningioma Resection Using a Multi-Institutional Natural Language Processing Model
    Zaki, Mark M.
    NEUROSURGERY, 2021, 88 (04) : E321 - E322