An Approach Based on Bayesian Networks for Query Selectivity Estimation

被引:5
|
作者
Halford, Max [1 ,2 ]
Saint-Pierre, Philippe [2 ]
Morvan, Franck [1 ]
机构
[1] Paul Sabatier Univ, IRIT Lab, Toulouse, France
[2] Paul Sabatier Univ, IMT Lab, Toulouse, France
关键词
Query optimisation; Cardinality estimation; Bayesian networks; COMPLEXITY; SIZE;
D O I
10.1007/978-3-030-18579-4_1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The efficiency of a query execution plan depends on the accuracy of the selectivity estimates given to the query optimiser by the cost model. The cost model makes simplifying assumptions in order to produce said estimates in a timely manner. These assumptions lead to selectivity estimation errors that have dramatic effects on the quality of the resulting query execution plans. A convenient assumption that is ubiquitous among current cost models is to assume that attributes are independent with each other. However, it ignores potential correlations which can have a huge negative impact on the accuracy of the cost model. In this paper we attempt to relax the attribute value independence assumption without unreasonably deteriorating the accuracy of the cost model. We propose a novel approach based on a particular type of Bayesian networks called Chow-Liu trees to approximate the distribution of attribute values inside each relation of a database. Our results on the TPC-DS benchmark show that our method is an order of magnitude more precise than other approaches whilst remaining reasonably efficient in terms of time and space.
引用
收藏
页码:3 / 19
页数:17
相关论文
共 50 条
  • [21] Selectivity Estimation in Spatial Networks
    Tiakas, E.
    Papadopoulos, A. N.
    Nanopoulos, A.
    Manolopoulos, Y.
    APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 852 - 856
  • [22] Selectivity-based XML Query Processing in Structured Peer-to-Peer Networks
    Comito, Carmela
    Talia, Domenico
    Trunfio, Paolo
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '10), 2010, : 236 - 244
  • [23] Bayesian Regression Based Approach for Beam Deflection Estimation
    Batbooti, Raed S.
    Mohammed, Bassam A.
    Jabbar, Tahseen Ali
    Faisal, Safa H.
    ADVANCES IN SCIENCE AND TECHNOLOGY-RESEARCH JOURNAL, 2023, 17 (03) : 206 - 213
  • [24] Learning Bayesian networks by constrained Bayesian estimation
    Gao Xiaoguang
    Yang Yu
    Guo Zhigao
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2019, 30 (03) : 511 - 524
  • [25] Learning Bayesian networks by constrained Bayesian estimation
    GAO Xiaoguang
    YANG Yu
    GUO Zhigao
    JournalofSystemsEngineeringandElectronics, 2019, 30 (03) : 511 - 524
  • [26] Applying Advanced Methods of Query Selectivity Estimation in Oracle DBMS
    Augustyn, Dariusz R.
    MAN-MACHINE INTERACTIONS, 2009, 59 : 585 - 593
  • [27] The Method of Query Selectivity Estimation for Selection Conditions Based on Sum of Sub-Independent Attributes
    Augustyn, Dariusz Rafal
    MAN-MACHINE INTERACTIONS 3, 2014, 242 : 601 - 609
  • [28] Group recommending:: A methodological approach based on Bayesian networks
    De Campos, Luis M.
    Fernandez-Luna, Juan M.
    Huete, Juan F.
    Rueda-Morales, Miguel A.
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1-2, 2007, : 835 - 844
  • [29] An optimization-based approach for the design of Bayesian networks
    Martinez-Rodriguez, Ana M.
    May, Jerrold H.
    Vargas, Luis G.
    MATHEMATICAL AND COMPUTER MODELLING, 2008, 48 (7-8) : 1265 - 1278
  • [30] Motion estimation in vehicular environments based on Bayesian dynamic networks
    Reyes-Cocoletzi, Lauro
    Olmos-Pineda, Ivan
    Arturo Olvera-Lopez, J.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4673 - 4684