Towards Text-to-SQL over Aggregate Tables

被引:0
作者
Shuqin Li [1 ]
Kaibin Zhou [2 ]
Zeyang Zhuang [2 ]
Haofen Wang [1 ]
Jun Ma [3 ]
机构
[1] College of Design and Innovation, Tongji University
[2] School of Software, Tongji University
[3] School of Automotive Studies, Tongji
关键词
D O I
暂无
中图分类号
TP391.1 [文字信息处理]; TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-to-SQL aims at translating textual questions into the corresponding SQL queries. Aggregate tables are widely created for high-frequent queries. Although text-to-SQL has emerged as an important task, recent studies paid little attention to the task over aggregate tables. The increased aggregate tables bring two challenges:(1) mapping of natural language questions and relational databases will suffer from more ambiguity,(2) modern models usually adopt self-attention mechanism to encode database schema and question. The mechanism is of quadratic time complexity, which will make inferring more time-consuming as input sequence length grows. In this paper, we introduce a novel approach named WAGG for text-to-SQL over aggregate tables. To effectively select among ambiguous items, we propose a relation selection mechanism for relation computing. To deal with high computation costs, we introduce a dynamical pruning strategy to discard unrelated items that are common for aggregate tables. We also construct a new large-scale dataset Spiderw AGG extended from Spider dataset for validation, where extensive experiments show the effectiveness and efficiency of our proposed method with 4% increase of accuracy and 15% decrease of inference time w.r.t a strong baseline RAT-SQL.
引用
收藏
页码:457 / 474
页数:18
相关论文
共 1 条
  • [1] Break It Down: A Question Understanding Benchmark.[J].Tomer Wolfson;Mor Geva;Ankit Gupta;Matt Gardner;Yoav Goldberg;Daniel Deutch;Jonathan Berant.Transactions of the Association for Computational Linguistics.2020,