Proposing a Dimensionality Reduction Technique With an Inequality for Unsupervised Learning from High-Dimensional Big Data

被引:0
作者
Ismkhan, Hassan [1 ]
Izadi, Mohammad [1 ]
机构
[1] Sharif Univ Technol, Fac Comp Engn, Tehran 1458889694, Iran
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2023年 / 53卷 / 06期
关键词
Clustering algorithms; Task analysis; Feature extraction; Unsupervised learning; Dimensionality reduction; Transforms; Standards; Big data; dimensionality reduction (DR); high-dimensional data; k-means; nearest neighbor (NN); K-MEANS;
D O I
10.1109/TSMC.2023.3234227
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
task can be considered as the most important unsupervised learning algorithms. For about all clustering algorithms, finding the Nearest Neighbors of a point within a certain radius r (NN -r), is a critical task. For a high-dimensional dataset, this task becomes too time consuming. This article proposes a simple dimensionality reduction (DR) technique. For point p in d-dimensional space, it produces point p' in d'-dimensional space, where d' << d. In addition, for any pair of points p and q, and their maps p' and q' in the target space, it is proved that |p, q| > |p', q'| is preserved, where |, | used to denote the Euclidean distance between a pair of points. This property can speed up finding NN -r. For a certain radius r, and a pair of points p and q, whenever |p', q'| > r, then q can not be in NN -r of p. Using this trick, the task of finding the NN -r is speeded up. Then, as a case study, it is applied to accelerate the k-means, one of the most famous unsupervised learning algorithms, where it can automatically determine the d'. The proposed NN -r method and the accelerated k-means are compared with recent state-of-the-arts, and both yield favorable results.
引用
收藏
页码:3880 / 3889
页数:10
相关论文
共 50 条
  • [21] Overview and comparative study of dimensionality reduction techniques for high dimensional data
    Ayesha, Shaeela
    Hanif, Muhammad Kashif
    Talib, Ramzan
    INFORMATION FUSION, 2020, 59 : 44 - 58
  • [22] A Novel Angle-Based Learning Framework on Semi-supervised Dimensionality Reduction in High-Dimensional Data with Application to Action Recognition
    Ramezani, Zahra
    Pourdarvish, Ahmad
    Teymourian, Kiumars
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2020, 45 (12) : 11051 - 11063
  • [23] A Novel Angle-Based Learning Framework on Semi-supervised Dimensionality Reduction in High-Dimensional Data with Application to Action Recognition
    Zahra Ramezani
    Ahmad Pourdarvish
    Kiumars Teymourian
    Arabian Journal for Science and Engineering, 2020, 45 : 11051 - 11063
  • [24] Robust locally nonlinear embedding (RLNE) for dimensionality reduction of high-dimensional data with noise
    Xu, Yichen
    Li, Eric
    NEUROCOMPUTING, 2024, 596
  • [25] Data-Efficient Dimensionality Reduction and Surrogate Modeling of High-Dimensional Stress Fields
    Samaddar, Anirban
    Ravi, Sandipp Krishnan
    Ramachandra, Nesar
    Luan, Lele
    Madireddy, Sandeep
    Bhaduri, Anindya
    Pandita, Piyush
    Sun, Changjie
    Wang, Liping
    JOURNAL OF MECHANICAL DESIGN, 2025, 147 (03)
  • [26] Using synthetic data and dimensionality reduction in high-dimensional classification via logistic regression
    Zarei, Shaho
    Mohammadpour, Adel
    COMPUTATIONAL METHODS FOR DIFFERENTIAL EQUATIONS, 2019, 7 (04): : 626 - 634
  • [27] Dimensionality reduction for density ratio estimation in high-dimensional spaces
    Sugiyama, Masashi
    Kawanabe, Motoaki
    Chui, Pui Ling
    NEURAL NETWORKS, 2010, 23 (01) : 44 - 59
  • [28] An adaptive and dynamic dimensionality reduction method for high-dimensional indexing
    Heng Tao Shen
    Xiaofang Zhou
    Aoying Zhou
    The VLDB Journal, 2007, 16 : 219 - 234
  • [29] Effective indexing and searching with dimensionality reduction in high-dimensional space
    Jeong, Seungdo
    Kim, Sang-Wook
    Choi, Byung-Uk
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2016, 31 (04): : 291 - 302
  • [30] An adaptive and dynamic dimensionality reduction method for high-dimensional indexing
    Shen, Heng Tao
    Zhou, Xiaofang
    Zhou, Aoying
    VLDB JOURNAL, 2007, 16 (02) : 219 - 234