Stochastic proximity embedding

被引:109
|
作者
Agrafiotis, DK [1 ]
机构
[1] 3 Dimens Pharmaceut Inc, Exton, PA 19341 USA
关键词
stochastic proximity embedding; multidimensional scaling; nonlinear mapping; Sammon mapping; stochastic descent; self-organizing; dimensionality reduction; feature extraction; combinatorial chemistry; data mining; data analysis; pattern recognition; molecular descriptor; molecular similarity; molecular diversity;
D O I
10.1002/jcc.10234
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We introduce stochastic proximity embedding (SPE), a novel self-organizing algorithm for producing meaningful underlying dimensions from proximity data. SPE attempts to generate low-dimensional Euclidean embeddings that best preserve the similarities between a set of related observations. The method starts with an initial configuration, and iteratively refines it by repeatedly selecting pairs of objects at random, and adjusting their coordinates so that their distances on the map match more closely their respective proximities. The magnitude of these adjustments is controlled by a learning rate parameter, which decreases during the course of the simulation to avoid oscillatory behavior. Unlike classical multidimensional scaling (MDS) and nonlinear mapping (NLM), SPE scales linearly with respect to sample size, and can be applied to very large data sets that are intractable by conventional embedding procedures. The method is programmatically simple, robust, and convergent, and can be applied to a wide range of scientific problems involving exploratory data analysis and visualization. (C) 2003 Wiley Periodicals, Inc.
引用
收藏
页码:1215 / 1221
页数:7
相关论文
共 50 条
  • [1] A modified update rule for stochastic proximity embedding
    Rassokhin, DN
    Agrafiotis, DK
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2003, 22 (02) : 133 - 140
  • [2] Stochastic Proximity Embedding: Methods and Applications
    Agrafiotis, Dimitris K.
    Xu, Huafeng
    Zhu, Fangqiang
    Bandyopadhyay, Deepak
    Liu, Pu
    MOLECULAR INFORMATICS, 2010, 29 (11) : 758 - 770
  • [3] Distributed wireless sensor network localization using stochastic proximity embedding
    Aloor, Gopakumar
    Jacob, Lillykutty
    COMPUTER COMMUNICATIONS, 2010, 33 (06) : 745 - 755
  • [4] Unrolr: Structural analysis of protein conformations using stochastic proximity embedding
    Eberhardt, Jerome
    Stote, Roland H.
    Dejaegere, Annick
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2018, 39 (30) : 2551 - 2557
  • [5] Application of stochastic proximity embedding to distance geometry problems
    Kashima, Hiroyuki
    Doi, Shinji
    Kumagai, Sadatoshi
    2006 SICE-ICASE INTERNATIONAL JOINT CONFERENCE, VOLS 1-13, 2006, : 1321 - +
  • [6] Underwater localization using stochastic proximity embedding and multi-dimensional scaling
    Ameer, P. M.
    Jacob, Lillykutty
    WIRELESS NETWORKS, 2013, 19 (07) : 1679 - 1690
  • [7] Underwater localization using stochastic proximity embedding and multi-dimensional scaling
    P. M. Ameer
    Lillykutty Jacob
    Wireless Networks, 2013, 19 : 1679 - 1690
  • [8] On the effects of permuted input on conformational sampling of drug-like molecules: an evaluation of stochastic proximity embedding
    Agrafiotis, Dirnitris K.
    Bandyopadhyay, Deepak
    Carta, Giorgio
    Knox, Andrew J. S.
    Lloyd, David G.
    CHEMICAL BIOLOGY & DRUG DESIGN, 2007, 70 (02) : 123 - 133
  • [9] PPPNE: Personalized proximity preserved network embedding
    Fan, Ge
    Geng, Biao
    Tao, Jianrong
    Wang, Kai
    Fan, Changjie
    Zeng, Wei
    NEUROCOMPUTING, 2022, 472 (103-112) : 103 - 112
  • [10] Proximity-aware heterogeneous information network embedding
    Zhang, Chen
    Wang, Guodong
    Yu, Bin
    Xie, Yu
    Pan, Ke
    KNOWLEDGE-BASED SYSTEMS, 2020, 193