Comparative study on the customization of natural language interfaces to databases

被引：2

作者：

Pazos R, Rodolfo A. ^{[1
]}

Aguirre L, Marco A. ^{[1
]}

Gonzalez B, Juan J. ^{[1
]}

Martinez F, Jose A. ^{[1
]}

Perez O, Joaquin ^{[2
]}

Verastegui O, Andres A. ^{[1
]}

机构：

[1] Tecnol Nacl Mexico Inst Tecnol Ciudad Madero, Div Estudios Posgrad Invest, Av 1o Mayo S-No, Ciudad Madero, Tamaulipas, Mexico

[2] Tecnol Nacl Mexico Ctr Nacl Invest & Desarrollo T, Dept Ciencias Comp, Interior Internado Palmira S-N, Cuernavaca, Morelos, Mexico

来源：

SPRINGERPLUS | 2016年 / 5卷

关键词：

Natural language processing; Natural language interface; Databases; Semantic modelling;

D O I：

10.1186/s40064-016-2164-y

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

In the last decades the popularity of natural language interfaces to databases (NLIDBs) has increased, because in many cases information obtained from them is used for making important business decisions. Unfortunately, the complexity of their customization by database administrators make them difficult to use. In order for a NLIDB to obtain a high percentage of correctly translated queries, it is necessary that it is correctly customized for the database to be queried. In most cases the performance reported in NLIDB literature is the highest possible; i.e., the performance obtained when the interfaces were customized by the implementers. However, for end users it is more important the performance that the interface can yield when the NLIDB is customized by someone different from the implementers. Unfortunately, there exist very few articles that report NLIDB performance when the NLIDBs are not customized by the implementers. This article presents a semantically-enriched data dictionary (which permits solving many of the problems that occur when translating from natural language to SQL) and an experiment in which two groups of undergraduate students customized our NLIDB and English language frontend (ELF), considered one of the best available commercial NLIDBs. The experimental results show that, when customized by the first group, our NLIDB obtained a 44.69 % of correctly answered queries and ELF 11.83 % for the ATIS database, and when customized by the second group, our NLIDB attained 77.05 % and ELF 13.48 %. The performance attained by our NLIDB, when customized by ourselves was 90 %.

引用

页数：30

共 27 条

[1] Aguirre M. A., 2014, THESIS I TECNOLOGICO
[2] ANDROUTSOPOULOS I, 1993, P 6 INT C IND ENG AP, P327
[3] [Anonymous], 2001, MACHINE LEARNING ECM
[4] Atis, 2015, ATIS2 LING DAT CONS
[5] Blum A, 1999, PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P247
[6] C-Phrase, 2010, CONFIGURATION FILE
[7] The economics of natural language interfaces: natural language processing technology as a scarce resource
Conlon, SJ
Conlon, JR
James, TL
[J]. DECISION SUPPORT SYSTEMS, 2004, 38 (01) : 141 - 159
[8] Elf, 2015, NAT LANG DAT INT ELF
[9] Giordani A, 2009, P EUR C MACH LEARN K, P391
[10] Giordani A., 2012, P COLING 2012 POST D, P401

← 1 2 3 →