Learning Set Cardinality in Distance Nearest Neighbours

被引:19
作者
Anagnostopoulos, Christos [1 ]
Triantafillou, Peter [1 ]
机构
[1] Univ Glasgow, Sch Comp Sci, Glasgow G12 8QQ, Lanark, Scotland
来源
2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2015年
关键词
Query-driven set cardinality prediction; distance nearest neighbors analytics; hetero-associative competitive learning; local regression vector quantization;
D O I
10.1109/ICDM.2015.17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distance-based nearest neighbours (dNN) queries and aggregations over their answer sets are important for exploratory data analytics. We focus on the Set Cardinality Prediction (SCP) problem for the answer set of dNN queries. We contribute a novel, query-driven perspective for this problem, whereby answers to previous dNN queries are used to learn the answers to incoming dNN queries. The proposed novel machine learning (ML) model learns the dynamically changing query patterns space and thus it can focus only on the portion of the data being queried. The model enjoys several comparative advantages in prediction error and space requirements. This is in addition to being applicable in environments with sensitive data and/or environments where data accesses are too costly to execute, where the data-centric state-of-the-art is inapplicable and/or too costly. A comprehensive performance evaluation of our model is conducted, evaluating its comparative advantages versus acclaimed methods (i.e., different self-tuning histograms, sampling, multidimensional histograms, and the power-method).
引用
收藏
页码:691 / 696
页数:6
相关论文
共 14 条
[1]  
Bottou L., NIPS 2008, P161
[2]  
Bruno N., ACM SIGMOD02, P263
[3]  
Bruno N., ACM SIGMOD 01, P211
[4]   FUZZY ART - FAST STABLE LEARNING AND CATEGORIZATION OF ANALOG PATTERNS BY AN ADAPTIVE RESONANCE SYSTEM [J].
CARPENTER, GA ;
GROSSBERG, S ;
ROSEN, DB .
NEURAL NETWORKS, 1991, 4 (06) :759-771
[5]   Selectivity estimators for multidimensional range queries over real attributes [J].
Gunopulos, D ;
Kollios, G ;
Tsotras, VJ ;
Domeniconi, C .
VLDB JOURNAL, 2005, 14 (02) :137-154
[6]  
Ioannidis Y., VLDB 03, P19
[7]  
Merz C., 1998, UCI repository of machine learning databases
[8]  
Olken F., SSDBM 90, P92
[9]  
Poosala V., VLDB 97, P486
[10]  
Srivastava U., IEEE ICDE 06, P39