Employing artificial neural networks for constructing metadata-based model to automatically select an appropriate data visualization technique

被引:33
作者
Muhammad, Tufail [1 ]
Halim, Zahid [1 ]
机构
[1] Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Topi, Pakistan
关键词
Automated visualization selection; Visualization techniques classification; Neural networks; Metadata-based visualization selection; INFORMATION VISUALIZATION; DESIGN; TOOL;
D O I
10.1016/j.asoc.2016.08.039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advances in computing technology have been instrumental in creating an assortment of powerful information visualization techniques. However, the selection of a suitable and effective visualization technique for a specific dataset and a data mining task is not trivial. This work automatically selects an appropriate visualization technique based on the given metadata and the task that a user intends to perform. The appropriate visualization is predicted based on an artificial neural network (ANN)-based model which classifies the input data into one of the eight predefined classes. A purpose built dataset extracted from the existing knowledge in the discipline is utilized to train the neural network. The dataset covers eight visualization techniques, including: histogram, line chart, pie chart, scatter plot, parallel coordinates, map, treemap, and linked graph. Various architectures using different numbers of hidden units, hidden layers, and input and output data formats have been evaluated to find the optimal neural network architecture. The performance of neural networks is measured using: confusion matrix, accuracy, precision, and sensitivity of the classification. Optimal neural network architecture is determined by convergence time and number of iterations. The results obtained from the best ANN architecture are compared with five other classifiers, k-nearest neighbor, naive Bayes, decision tree, random forest, and support vector machine. The proposed system outperforms four classifiers in terms of accuracy and all five classifiers based on execution time. The trained neural network is also tested on twenty real-world benchmark datasets, where the proposed approach also provides two alternate visualizations, in addition to the most suitable one, for a particular dataset. A qualitative comparison with the state-of-the-art approaches is also presented. The results show that the proposed technique assists in selecting an appropriate visualization technique for a given dataset with high accuracy. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:365 / 384
页数:20
相关论文
共 87 条
[1]  
Aigner W, 2011, HUM-COMPUT INT-SPRIN, P147, DOI 10.1007/978-0-85729-079-3_7
[2]  
[Anonymous], 2015, ENCY METAGENOMICS
[3]   Uncovering clusters in crowded parallel coordinates visualizations [J].
Artero, AO ;
de Oliveira, MCF ;
Levkowitz, H .
IEEE SYMPOSIUM ON INFORMATION VISUALIZATION 2004, PROCEEDINGS, 2004, :81-88
[4]   A comparative survey of artificial intelligence applications in finance: artificial neural networks, expert system and hybrid intelligent systems [J].
Bahrammirzaee, Arash .
NEURAL COMPUTING & APPLICATIONS, 2010, 19 (08) :1165-1195
[5]   TrustedDB: A Trusted Hardware-Based Database with Privacy and Data Confidentiality [J].
Bajaj, Sumeet ;
Sion, Radu .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (03) :752-765
[6]  
Balzer Michael, 2005, P 2005 ACM S SOFTW V, P165, DOI DOI 10.1145/1056018.1056041
[7]  
Banda J.M., 2014, NEW TRENDS DATABASES, P151, DOI DOI 10.1007/978-3-319-01863-8_17
[8]  
Barry B., 2001, INF VISUAL DATA MINI, P18
[9]  
Baum K. G., 2006, P IS T SIDS 14 COL I, P138
[10]  
Blascheck Tanja., 2013, KI Workshop on Visual and Spatial Cognition, P44