Cube query interestingness: Novelty, relevance, peculiarity and surprise

被引:0
作者
Gkitsakis, Dimos [1 ]
Kaloudis, Spyridon [1 ]
Mouselli, Eirini [2 ]
Peralta, Veronika [3 ]
Marcel, Patrick [4 ]
Vassiliadis, Panos [1 ]
机构
[1] Univ Ioannina, Ioannina, Greece
[2] Natech SA, Ioannina, Greece
[3] Univ Tours, Blois, France
[4] Univ Orleans, Orleans, France
关键词
Interestingness; Data cubes; Business intelligence; Novelty; Surprise; Relevance; Peculiarity; User study; SIMILARITY MEASURES; OLAP; EXPLORATION; PSYCHOLOGY; DISCOVERY; ANALYTICS; CURIOSITY;
D O I
10.1016/j.is.2024.102381
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we discuss methods to assess the interestingness of a query in an environment of data cubes. We assume a hierarchical multidimensional database, storing data cubes and level hierarchies. We start with a comprehensive review of related work in the fields of human behavior studies and computer science. We define the interestingness of a query as a vector of scores along different aspects, like novelty, relevance, surprise and peculiarity and complement this definition with a taxonomy of the information that can be used to assess each of these aspects of interestingness. We provide both syntactic (result-independent) and extensional (result-dependent) checks, measures and algorithms for assessing the different aspects of interestingness in a quantitative fashion. We also report our findings from a user study that we conducted, analyzing the significance of each aspect, its evolution over time and the behavior of the study's participants.
引用
收藏
页数:29
相关论文
共 59 条
[51]   Hints for Reviewing Empirical Work in Software Engineering [J].
Walter F. Tichy .
Empirical Software Engineering, 2000, 5 (4) :309-312
[52]  
Vassiliadis P., 2018, P DOLAP
[53]  
Vassiliadis P, 2023, Arxiv, DOI arXiv:2203.09390
[54]   Beyond roll-up's and drill-down's: An intentional analytics model to reinvent OLAP [J].
Vassiliadis, Panos ;
Marcel, Patrick ;
Rizzi, Stefano .
INFORMATION SYSTEMS, 2019, 85 :68-91
[55]   DataShot: Automatic Generation of Fact Sheets from Tabular Data [J].
Wang, Yun ;
Sun, Zhida ;
Zhang, Haidong ;
Cui, Weiwei ;
Xu, Ke ;
Ma, Xiaojuan ;
Zhang, Dongmei .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (01) :895-905
[56]  
Yao YY, 2006, STUD COMPUT INTELL, V9, P41
[57]   Guided Exploration of Data Summaries [J].
Youngmann, Brit ;
Amer-Yahia, Sihem ;
Personnaz, Aurelien .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (09) :1798-1807
[58]   Investigating the Effect of the Multiple Comparisons Problem in Visual Analysis [J].
Zgraggen, Emanuel ;
Zhao, Zheguang ;
Zeleznik, Robert ;
Kraska, Tim .
PROCEEDINGS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2018), 2018,
[59]   Controlling False Discoveries During Interactive Data Exploration [J].
Zhao, Zheguang ;
De Stefani, Lorenzo ;
Zgraggen, Emanuel ;
Binnig, Carsten ;
Upfal, Eli ;
Kraska, Tim .
SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, :527-540