Dimensionality Reduction via Graph Structure Learning

被引:59
作者
Mao, Qi [1 ]
Wang, Li [2 ]
Goodison, Steve [3 ]
Sun, Yijun [1 ,4 ]
机构
[1] SUNY Buffalo, Bioinformat Lab, Buffalo, NY 14201 USA
[2] Univ Victoria, Dept Math & Stat, Victoria, BC V8P 5C2, Canada
[3] Mayo Clin, Dept Hlth Sci Res, Jacksonville, FL 32224 USA
[4] SUNY Buffalo, Dept Microbiol & Immunol, Buffalo, NY 14201 USA
来源
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2015年
基金
美国国家科学基金会;
关键词
Dimensionality Reduction; Graph Structure Learning; Clustering; Unsupervised Learning; KERNEL;
D O I
10.1145/2783258.2783309
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new dimensionality reduction setting for a large family of real-world problems. Unlike traditional methods, the new setting aims to explicitly represent and learn an intrinsic structure from data in a high-dimensional space, which can greatly facilitate data visualization and scientific discovery in downstream analysis. We propose a new dimensionality-reduction framework that involves the learning of a mapping function that projects data points in the original high-dimensional space to latent points in a low dimensional space that are then used directly to construct a graph. Local geometric information of the projected data is naturally captured by the constructed graph. As a showcase, we develop a new method to obtain a discriminative and compact feature representation for clustering problems. In contrast to assumptions used in traditional clustering methods, we assume that centers of clusters should be close to each other if they are connected in a learned graph, and other cluster centers should be distant. Extensive experiments are performed that demonstrate that the proposed method is able to obtain discriminative feature representations yielding superior clustering performance, and correctly recover the intrinsic structures of various real-world datasets including curves, hierarchies and a cancer progression path.
引用
收藏
页码:765 / 774
页数:10
相关论文
共 34 条
[1]  
Ando RK, 2005, J MACH LEARN RES, V6, P1817
[2]  
[Anonymous], 2012, Matrix Analysis
[3]  
[Anonymous], 2005, Technical report
[4]  
[Anonymous], A Self-organizing Maps application
[5]  
Belkin M., 2001, NIPS
[6]   GTM: The generative topographic mapping [J].
Bishop, CM ;
Svensen, M ;
Williams, CKI .
NEURAL COMPUTATION, 1998, 10 (01) :215-234
[7]  
Boyd S., 2004, CONEX OPTIMIZATION
[8]   Dimension Reduction: A Guided Tour [J].
Burges, Christopher J. C. .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2010, 2 (04) :275-365
[9]  
Cacabelos R, 1999, ALZHEIMER'S DISEASE AND RELATED DISORDERS, P93
[10]   MEAN SHIFT, MODE SEEKING, AND CLUSTERING [J].
CHENG, YZ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (08) :790-799