A Dimensionality Reduction and Reconstruction Method for Data with Multiple Connected Components

被引:0
|
作者
Yao, Yuqin [1 ]
Gao, Yang [2 ]
Long, Zhiguo [2 ]
Meng, Hua [1 ]
Sioutis, Michael [3 ]
机构
[1] Southwest Jiaotong Univ, Sch Math, Chengdu, Peoples R China
[2] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu, Peoples R China
[3] Univ Bamberg, Fac Informat Syst & Appl Comp Sci, Bavaria, Germany
来源
2022 IEEE THE 5TH INTERNATIONAL CONFERENCE ON BIG DATA AND ARTIFICIAL INTELLIGENCE (BDAI 2022) | 2022年
关键词
LE; Dimensionality reduction; Manifold learning; Topological connectivity;
D O I
10.1109/BDAI56143.2022.9862787
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the literature on dimensionality reduction, including Spectral Clustering and Laplacian Eigenmaps, one of the core ideas is to reconstruct data based on similarities between data points, which makes the choice of similarity matrices a key factor on the performance of a dimensionality reduction model. Traditional methods like K-nearest neighbor, is an element of-neighbor, and Gaussian Kernel for constructing similarity matrices based on data distribution characteristics have been extensively studied. However, these methods usually focus on only a specific level of the data when considering the similarity between data points, which might result in a great flaw in data reconstruction when data possess hierarchical and multiple groups structure. Specifically, such methods can only characterize the similarity between data within a group, but ignore the similarity between different groups. To overcome this deficiency, this paper proposes a hierarchical way of similarity matrix construction, by introducing strong, weak, and intra- and inter-cluster similarities to describe relations between multiple levels. The proposed method can better adapt to complex data with multiple connected components, and the effectiveness of it is verified in a series of experiments on synthetic and real-world datasets.
引用
收藏
页码:87 / 92
页数:6
相关论文
共 50 条
  • [41] A sparse grid based method for generative dimensionality reduction of high-dimensional data
    Bohn, Bastian
    Garcke, Jochen
    Griebel, Michael
    JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 309 : 1 - 17
  • [42] Foraging theory for dimensionality reduction of clustered data
    Luis Felipe Giraldo
    Fernando Lozano
    Nicanor Quijano
    Machine Learning, 2011, 82 : 71 - 90
  • [43] Asymmetric Isomap for Dimensionality Reduction and Data Visualization
    Olszewski, Dominik
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT I, 2024, 15016 : 102 - 115
  • [44] Foraging theory for dimensionality reduction of clustered data
    Giraldo, Luis Felipe
    Lozano, Fernando
    Quijano, Nicanor
    MACHINE LEARNING, 2011, 82 (01) : 71 - 90
  • [45] A Stable Dimensionality-Reduction Method for Internet-of-Things (IoT) Streaming Data
    Li, Yang
    Bao, Yuanyuan
    Chen, Wai
    2019 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND INTELLIGENCE SYSTEM (IOTAIS), 2019, : 231 - 237
  • [46] ONBOARD PAYLOAD-DATA DIMENSIONALITY REDUCTION
    Penalver, M.
    Del Frate, F.
    Paoletti, M. E.
    Haut, J. M.
    Plaza, J.
    Plaza, A.
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 783 - 786
  • [47] Improving Dimensionality Reduction Projections for Data Visualization
    Rafieian, Bardia
    Hermosilla, Pedro
    Vazquez, Pere-Pau
    APPLIED SCIENCES-BASEL, 2023, 13 (17):
  • [48] On dimensionality reduction of high dimensional data sets
    Chizi, B
    Shmilovici, A
    Maimon, O
    INTELLIGENT TECHNOLOGIES - THEORY AND APPLICATIONS: NEW TRENDS IN INTELLIGENT TECHNOLOGIES, 2002, 76 : 233 - 238
  • [49] Efficient Dimensionality Reduction for Sparse Binary Data
    Pratap, Rameshwar
    Kulkarni, Raghav
    Sohony, Ishan
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 152 - 157
  • [50] Dimensionality and data reduction in telecom churn prediction
    Lin, Wei-Chao
    Tsai, Chih-Fong
    Ke, Shih-Wen
    KYBERNETES, 2014, 43 (05) : 737 - 749