A Bayesian network approach to searching Web databases through keyword-based queries

被引:11
作者
Calado, P [1 ]
da Silva, AS
Laender, AHF
Ribeiro-Neto, BA
Vieira, RC
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, BR-30123970 Belo Horizonte, MG, Brazil
[2] Univ Fed Amazonas, Dept Comp Sci, BR-69077000 Manaus, Amazonas, Brazil
关键词
Web databases; Bayesian networks; query structuring;
D O I
10.1016/j.ipm.2004.03.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
On-line information services have become widespread in the Web nowadays. However, Web users are non-specialized and have a great variety of interests. Interfaces for Web databases must, therefore, be both simple and uniform. In this paper, we present a solution for querying Web databases using keywords only. A Bayesian network model is used to generate a set of one or more plausible structured queries derived form the initial user input. These queries can then be submitted to Web databases and the retrieved results presented as a set of ranked answers. To structure the user queries, full access to the database is not required. Instead, only a small portion of its content, extracted through a public Web interface, is used by the network model. This approach not only reduces the complexity of existing on-line interfaces, but also offers a solution to the problem of querying several distinct Web databases with a single interface. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:773 / 790
页数:18
相关论文
共 26 条
[1]   An information retrieval model based on simple Bayesian networks [J].
Acid, S ;
de Campos, LM ;
Fernández-Luna, JM ;
Huete, JF .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2003, 18 (02) :251-265
[2]  
Agrawal VD, 2002, J ELECTRON TEST, V18, P5, DOI 10.1023/A:1013736206948
[3]  
BAEZAYATES RA, 1999, MODERN INFORMATION R
[4]   Evaluating Top-k queries over web-accessible Databases [J].
Bruno, N ;
Gravano, L ;
Marian, A .
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, :369-+
[5]  
Calado P., 2002, Proceedings of the Eleventh International Conference on Information and Knowledge Management. CIKM 2002, P26, DOI 10.1145/584792.584801
[6]  
CALADO P, 2003, P 12 INT C INF KNOWL, P394, DOI DOI 10.1145/956863.956938
[7]  
CALLAN J, 1996, P 19 ANN INT ACM SIG, P262
[8]  
Chaudhuri S, 1999, PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P399
[9]   Reasoning about Textual Similarity in a Web-Based Information Access System [J].
Cohen W.W. .
Autonomous Agents and Multi-Agent Systems, 1999, 2 (1) :65-86
[10]  
Dar S., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P645