Similarity queries: their conceptual evaluation, transformations, and processing

被引:24
|
作者
Silva, Yasin N. [1 ]
Aref, Walid G. [2 ]
Larson, Per-Ake [3 ]
Pearson, Spencer S. [1 ]
Ali, Mohamed H. [4 ]
机构
[1] Arizona State Univ, Phoenix, AZ 85069 USA
[2] Purdue Univ, W Lafayette, IN 47907 USA
[3] Microsoft Res, Redmond, WA USA
[4] Microsoft Corp, Redmond, WA 98052 USA
来源
VLDB JOURNAL | 2013年 / 22卷 / 03期
基金
美国国家科学基金会;
关键词
Similarity queries; Query processing; Query transformations; Conceptual evaluation;
D O I
10.1007/s00778-012-0296-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many application scenarios can significantly benefit from the identification and processing of similarities in the data. Even though some work has been done to extend the semantics of some operators, for example join and selection, to be aware of data similarities, there has not been much study on the role and implementation of similarity-aware operations as first-class database operators. Furthermore, very little work has addressed the problem of evaluating and optimizing queries that combine several similarity operations. The focus of this paper is the study of similarity queries that contain one or multiple first-class similarity database operators such as Similarity Selection, Similarity Join, and Similarity Group-by. Particularly, we analyze the implementation techniques of several similarity operators, introduce a consistent and comprehensive conceptual evaluation model for similarity queries, and present a rich set of transformation rules to extend cost-based query optimization to the case of similarity queries.
引用
收藏
页码:395 / 420
页数:26
相关论文
共 50 条
  • [31] Processing and improvement of multi-statement queries in Chiql
    Meng X.
    Wong K.-F.
    Yip S.M.
    Lum V.
    Wang Sh.
    Journal of Computer Science and Technology, 1998, 13 (2) : 161 - 173
  • [32] Processing SPARQL queries with regular expressions in RDF databases
    Jinsoo Lee
    Minh-Duc Pham
    Jihwan Lee
    Wook-Shin Han
    Hune Cho
    Hwanjo Yu
    Jeong-Hoon Lee
    BMC Bioinformatics, 12
  • [33] Processing SPARQL queries with regular expressions in RDF databases
    Lee, Jinsoo
    Pham, Minh-Duc
    Lee, Jihwan
    Han, Wook-Shin
    Cho, Hune
    Yu, Hwanjo
    Lee, Jeong-Hoon
    BMC BIOINFORMATICS, 2011, 12
  • [34] XCube: Processing XPath queries in a hypercube overlay network
    Li, Yingguang
    Oezsu, M. Tamer
    Tan, Kian-Lee
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2009, 2 (02) : 128 - 145
  • [35] Incremental Evaluation of Visible Nearest Neighbor Queries
    Nutanong, Sarana
    Tanin, Egemen
    Zhang, Rui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (05) : 665 - 681
  • [36] Processing optimal sequenced route queries using voronoi diagrams
    Sharifzadeh, Mehdi
    Shahabi, Cyrus
    GEOINFORMATICA, 2008, 12 (04) : 411 - 433
  • [37] Efficient Processing of Location-Aware Group Preference Queries
    Li, Miao
    Chen, Lisi
    Cong, Gao
    Gu, Yu
    Yu, Ge
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 559 - 568
  • [38] Evaluation of Range Queries with Predicates on Moving Objects
    McCarthy, Mitzi
    He, Zhen
    Wang, X. Sean
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) : 1144 - 1157
  • [39] On Efficient Processing of Group and Subsequent Queries for Social Activity Planning
    Chen, Yi-Ling
    Yang, De-Nian
    Shen, Chih-Ya
    Lee, Wang-Chien
    Chen, Ming-Syan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (12) : 2364 - 2378
  • [40] Processing ad-hoc queries in wireless sensor networks
    Yun, Sanghun
    Cho, Haengrae
    Liu, Xingcheng
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, 2008, : 749 - +