Adaptive Interpolation of Multidimensional Scaling

被引:4
作者
Bae, Seung-Hee [1 ]
Qiu, Judy [1 ]
Fox, Geoffrey [1 ]
机构
[1] Indiana Univ, Sch Informat & Comp, Bloomington, IN 47405 USA
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012 | 2012年 / 9卷
关键词
dimension reduction; multidimensional scaling; interpolation; adaptation; MDS;
D O I
10.1016/j.procs.2012.04.042
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The recent explosion of publicly available biology gene sequences and chemical compounds offers an unprecedented opportunity for data mining. To make data analysis feasible for such vast volume and high-dimensional scientific data, we apply high performance dimension reduction algorithms. It facilitates the investigation of unknown structures in a three dimensional visualization. Among the known dimension reduction algorithms, we utilize the multidimensional scaling (MDS) algorithm to configure the given high-dimensional or abstract data into a target dimension. However, the MDS algorithm requires large physical memory as well as computational resources. In order to reduce computational complexity and memory requirement effectively, the interpolation method of the MDS was proposed in 2010. With minor trade-off of approximation, the MDS interpolation method enables us to process millions of data points with modest amounts of computation and memory requirement. In this paper, we would like to improve the mapping quality of the MDS interpolation approach by adapting the original dissimilarity based on the ratio between the original dissimilarity and the corresponding mapping distances. Our experimental results illustrate that the quality of interpolated mapping results are improved by adding the adaptation step without runtime loss compared to the original interpolation method. With the proposed adaptive interpolation method, we construct a better configuration of millions of out-of-sample data into a target dimension than the previous interpolation method.
引用
收藏
页码:393 / 402
页数:10
相关论文
共 22 条
[1]  
[Anonymous], 1978, Multidimensional scaling
[2]  
[Anonymous], 1952, Psychometrika
[3]  
Bae S.-H., 2010, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, Chicago, Illinois, P203, DOI DOI 10.1145/1851476.1851501
[4]  
Bengio Y, 2004, ADV NEUR IN, V16, P177
[5]   GTM: The generative topographic mapping [J].
Bishop, CM ;
Svensen, M ;
Williams, CKI .
NEURAL COMPUTATION, 1998, 10 (01) :215-234
[6]  
Bishop CM, 1997, ADV NEUR IN, V9, P354
[7]  
Borg I., 2005, Modern multidimensional scaling: theory and applications
[8]  
Choi J. Y., P 10 IEEE ACM INT C, P331
[9]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[10]  
De Leeuw J., 1977, RECENT DEV STAT, V1, P133