Decoupling SQL query hardness parsing for text-to-SQL

被引:2
作者
Yi, Jiawen [1 ]
Chen, Guo [1 ]
Zhou, Xiaojun [1 ]
机构
[1] Cent South Univ, Dept Automat, Changsha 410083, Hunan, Peoples R China
关键词
Semantic parsing; Natural language process; Decouple analysis;
D O I
10.1016/j.neucom.2024.129293
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fundamental goal of the Text-to-SQL task is to translate natural language question into SQL query. Current research, which has made significant progress, primarily emphasizes the information coupling between natural language questions and schemas. However, the correlation between natural language questions and SQL queries is often overlooked. Decoupling this correlation may simplify the task. In this paper, we introduce an innovative framework based on Decoupling SQL Query Hardness Parsing for Text-to-SQL. The framework decouples the Text-to-SQL task based on query hardness by analyzing natural language questions and highly relevant schemas. It introduces a distributed structure to simplify multi-hardness task into single-hardness task, thereby significantly reducing the parsing burden on language models. We have evaluated our proposed framework and achieved anew state-of-the-art performance in the methods with model scales less than 3B on the Spider 1.0 dev set. Our code is shown at https://github.com/JarvenYi/DQHP.
引用
收藏
页数:13
相关论文
共 47 条
[1]  
Bogin B, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3659
[2]  
Cai RC, 2021, ADV NEUR IN, V34
[3]   A Heterogeneous Graph to Abstract Syntax Tree Framework for Text-to-SQL [J].
Cao, Ruisheng ;
Chen, Lu ;
Li, Jieyu ;
Zhang, Hanchong ;
Xu, Hongshen ;
Zhang, Wangyou ;
Yu, Kai .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) :13796-13813
[4]  
Cao RS, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), P2541
[5]  
Chen DL, 2020, AAAI CONF ARTIF INTE, V34, P3438
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]  
Fan Yuankai, 2023, 2023 IEEE 39th International Conference on Data Engineering (ICDE), P110, DOI 10.1109/ICDE55515.2023.00016
[8]  
Gan YJ, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, P2030
[9]  
Gao C., 2022, Findings of the Association for Computational Linguistics: EMNLP 2022, P2113
[10]   Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation [J].
Gao, Dawei ;
Wang, Haibin ;
Li, Yaliang ;
Sun, Xiuyu ;
Qian, Yichen ;
Ding, Bolin ;
Zhou, Jingren .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (05) :1132-1145