Visplause: Visual Data Quality Assessment of Many Time Series Using Plausibility Checks

被引:47
作者
Arbesser, Clemens [1 ]
Spechtenhauser, Florian [1 ]
Muehlbacher, Thomas [1 ]
Piringer, Harald [1 ]
机构
[1] VrVis Res Ctr, Donau City Str 1, A-1220 Vienna, Austria
关键词
Data Quality Assessment; High-Dimensional Data; Hierarchical Aggregation; Linked Views; ANOMALY DETECTION; VISUALIZATION;
D O I
10.1109/TVCG.2016.2598592
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Trends like decentralized energy production lead to an exploding number of time series from sensors and other sources that need to be assessed regarding their data quality (DQ). While the identification of DQ problems for such routinely collected data is typically based on existing automated plausibility checks, an efficient inspection and validation of check results for hundreds or thousands of time series is challenging. The main contribution of this paper is the validated design of Visplause, a system to support an efficient inspection of DQ problems for many time series. The key idea of Visplause is to utilize meta- information concerning the semantics of both the time series and the plausibility checks for structuring and summarizing results of DQ checks in a flexible way. Linked views enable users to inspect anomalies in detail and to generate hypotheses about possible causes. The design of Visplause was guided by goals derived from a comprehensive task analysis with domain experts in the energy sector. We reflect on the design process by discussing design decisions at four stages and we identify lessons learned. We also report feedback from domain experts after using Visplause for a period of one month. This feedback suggests significant efficiency gains for DQ assessment, increased confidence in the DQ, and the applicability of Visplause to summarize indicators also outside the context of DQ.
引用
收藏
页码:641 / 650
页数:10
相关论文
共 45 条
[1]  
Alsallakh B., 2014, EuroVis (STARs), DOI DOI 10.2312/EUROVISSTAR.20141170
[2]  
[Anonymous], 2008, VISUAL ANAL SCOPE CH
[3]  
[Anonymous], 2000, IEEE Data Eng. Bull.
[4]  
Batini C., 2006, Data_Quality:_Concepts, Methodologies_and_Techniques, P19, DOI DOI 10.1007/3-540-33173-5_2
[5]   Matches, Mismatches, and Methods: Multiple-View Workflows for Energy Portfolio Analysis [J].
Brehmer, Matthew ;
Ng, Jocelyn ;
Tate, Kevin ;
Munzner, Tamara .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2016, 22 (01) :449-458
[6]  
Dasu T., 2003, Exploratory Data Mining and Data Cleaning, Vfirst
[7]   Parallel Prototyping Leads to Better Design Results, More Divergence, and Increased Self-Efficacy [J].
Dow, Steven P. ;
Glassco, Alana ;
Kass, Jonathan ;
Schwarz, Melissa ;
Schwartz, Daniel L. ;
Klemmer, Scott R. .
ACM TRANSACTIONS ON COMPUTER-HUMAN INTERACTION, 2010, 17 (04)
[8]  
Eaton C, 2005, LECT NOTES COMPUT SC, V3585, P861, DOI 10.1007/11555261_68
[9]   Hierarchical Aggregation for Information Visualization: Overview, Techniques, and Design Guidelines [J].
Elmqvist, Niklas ;
Fekete, Jean-Daniel .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (03) :439-454
[10]  
Fernstad S. J., 2014, P IEEE INFOVIS