An Empirical Study of Model Errors and User Error Discovery and Repair Strategies in Natural Language Database Queries

被引:5
作者
Ning, Zheng [1 ]
Zhang, Zheng [1 ]
Sun, Tianyi [2 ]
Tian, Yuan [3 ]
Zhang, Tianyi [3 ]
Li, Toby Jia-Jun [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
[2] Univ Chicago, Chicago, IL 60637 USA
[3] Purdue Univ, W Lafayette, IN 47907 USA
来源
PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023 | 2023年
关键词
Empirical study; human-computer interaction; database systems; SQL;
D O I
10.1145/3581641.3584067
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in machine learning (ML) and natural language processing (NLP) have led to significant improvement in natural language interfaces for structured databases (NL2SQL). Despite the great strides, the overall accuracy of NL2SQL models is still far from being perfect (similar to 75% on the Spider benchmark). In practice, this requires users to discern incorrect SQL queries generated by a model and manually fix them when using NL2SQL models. Currently, there is a lack of comprehensive understanding about the common errors in auto-generated SQLs and the effective strategies to recognize and fix such errors. To bridge the gap, we (1) performed an in-depth analysis of errors made by three state-of-the-art NL2SQL models; (2) distilled a taxonomy of NL2SQL model errors; and (3) conducted a within-subjects user study with 26 participants to investigate the effectiveness of three representative interactive mechanisms for error discovery and repair in NL2SQL. Findings from this paper shed light on the design of future error discovery and repair strategies for natural language data query interfaces.
引用
收藏
页码:633 / 649
页数:17
相关论文
共 70 条
  • [1] Al Shuaily Huda Salim, 2016, International Journal of Social, Behavioral, Educational, Economic, Business and Industrial Engineering, V10, P3095
  • [2] Harnessing Evolution of Multi-Turn Conversations for Effective Answer Retrieval
    Aliannejadi, Mohammad
    Chakraborty, Manajit
    Rissola, Esteban Andres
    Crestani, Fabio
    [J]. CHIIR'20: PROCEEDINGS OF THE 2020 CONFERENCE ON HUMAN INFORMATION INTERACTION AND RETRIEVAL, 2020, : 33 - 42
  • [3] Allen James, 2007, P 22 NATL C ARTIFICI, V7, P1514
  • [4] Allen James F., 1996, P 34 ANN M ASS COMP, P62, DOI DOI 10.3115/981863.981872
  • [5] Antoine Axel, 2021, P 2021 CHI C HUM FAC, P1
  • [6] Bridging the Semantic Gap with SQL Query Logs in Natural Language Interfaces to Databases
    Baik, Christopher
    Jagadish, H. V.
    Li, Yunyao
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 374 - 385
  • [7] Explaining Queries over Web Tables to Non-Experts
    Berant, Jonathan
    Deutch, Daniel
    Globerson, Amir
    Milo, Tova
    Wolfson, Tomer
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1570 - 1573
  • [8] Bergamaschi Sonia, 2013, QUEST: a keyword search system for relational data based on semantic and machine learning techniques
  • [9] Bogin B, 2019, Arxiv, DOI arXiv:1905.06241
  • [10] Braun V., 2006, QUAL RES PSYCHOL, V3, P77, DOI [DOI 10.1191/1478088706QP063OA, 10.1191/1478088706qp063oa]