An Algorithm for Visualization of Big Data in a Two-Dimensional Space

被引:0
作者
Wu, Bo [1 ]
Wilamowski, B. M. [1 ]
机构
[1] Auburn Univ, Dept Elect & Comp Engn, Auburn, AL 36849 USA
来源
IECON 2015 - 41ST ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY | 2015年
关键词
big data; visualization; clustering; classification; EIGENMAPS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new algorithm for visualization of high-multidimensional data is described. The algorithm follows several steps. At first, centers representing several categories are selected, and Euclidean distances between these centers are calculated in a high-dimensional space. Then these centers are placed in a 2-dimensional space in such a way that distances in this 2-dimensional space are similar to distances in the high dimensional space. Next individual patterns are placed one-by one in the 2-dimensional space trying to keep the similar distances in a high-dimensional and 2-dimensional space. With this algorithm, it was possible to visualize many high-dimensional data sets. The algorithm was successfully verified in several real life problems. It turned out that in some cases, which were until now considered as not linearly separable, became easily separable once patterns were transformed in the 2-dimensional space using the proposed algorithm.
引用
收藏
页码:53 / 58
页数:6
相关论文
共 20 条
[1]  
Anguita D., 2013, EUROPEANSYMPOSIUMONA, P1
[2]  
[Anonymous], IEEE INT JOINT C P J
[3]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[4]   Deep, Big, Simple Neural Nets for Handwritten Digit Recognition [J].
Ciresan, Dan Claudiu ;
Meier, Ueli ;
Gambardella, Luca Maria ;
Schmidhuber, Juergen .
NEURAL COMPUTATION, 2010, 22 (12) :3207-3220
[5]   Mean shift: A robust approach toward feature space analysis [J].
Comaniciu, D ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619
[6]   Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data [J].
Donoho, DL ;
Grimes, C .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (10) :5591-5596
[7]  
Dunn J. C., 1973, Journal of Cybernetics, V3, P32, DOI 10.1080/01969727308546046
[8]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[9]   Topological grammars for data approximation [J].
Gorban, A. N. ;
Sumner, N. R. ;
Zinovyev, A. Y. .
APPLIED MATHEMATICS LETTERS, 2007, 20 (04) :382-386
[10]  
Hartigan J., J ROYAL STAT SOC C, V28, P100