Examining the Additivity of Top-k Query Processing Innovations

被引:15
作者
Mackenzie, Joel [1 ]
Moffat, Alistair [1 ]
机构
[1] Univ Melbourne, Melbourne, Vic, Australia
来源
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT | 2020年
基金
澳大利亚研究理事会;
关键词
Query Processing; Dynamic Pruning; Experimentation; Additivity; DOCUMENT; STRATEGIES;
D O I
10.1145/3340531.3412000
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research activity spanning more than five decades has led to index organizations, compression schemes, and traversal algorithms that allow extremely rapid response to ranked queries against very large text collections. However, little attention has been paid to the interactions between these many components, and the additivity of algorithmic improvements has not been explored. Here we examine the extent to which efficiency improvements add up. We employ four query processing algorithms, four compression codecs, and all possible combinations of four distinct further optimizations, and compare the performance of the 256 resulting systems to determine when and how different optimizations interact. Our results over two test collections show that efficiency enhancements are, for the most part, additive, and that there is little risk of negative interactions. In addition, our detailed profiling across this large pool of systems leads to key insights as to why the various individual enhancements work well, and indicates that optimizing "simpler" implementations can result in higher query throughput than is available from non-optimized versions of the more "complex" techniques, with clear implications for the choices needing to be made by practitioners.
引用
收藏
页码:1085 / 1094
页数:10
相关论文
共 65 条
[1]   On the Additivity and Weak Baselines for Search Result Diversification Research [J].
Akcay, Mehmet ;
Altingovde, Ismail Sengor ;
Macdonald, Craig ;
Ounis, Iadh .
ICTIR'17: PROCEEDINGS OF THE 2017 ACM SIGIR INTERNATIONAL CONFERENCE THEORY OF INFORMATION RETRIEVAL, 2017, :109-116
[2]  
Allan J., 2007, P TREC
[3]  
Allan J., 2008, P 17 TEXT RETR C TRE
[4]  
[Anonymous], 2010, NIST SPECIAL PUBLICA
[5]  
Armstrong Timothy G., 2009, P 18 INT C INF KNOWL, P601, DOI DOI 10.1145/1645953.1646031
[6]   Sources of dissolved inorganic carbon in rivers from the Changbaishan area, an active volcanic zone in North Eastern China [J].
Bai X. ;
Chetelat B. ;
Song Y. .
Acta Geochimica, 2017, 36 (03) :410-415
[7]  
Bartolini N, 2009, MSWIM09
[8]  
PROCEEDINGS OF THE 12TH ACM INTERNATIONAL CONFERENCE ON MODELING, ANALYSIS, AND SYSTEMS, P305
[9]   TSP and cluster-based solutions to the reassignment of document identifiers [J].
Blanco, Roi ;
Barreiro, Alvaro .
INFORMATION RETRIEVAL, 2006, 9 (04) :499-517
[10]  
Broder A Z., 2003, Proc. the 12th International Conference on Information and Knowledge Management, P426