On dimensionality reduction of high dimensional data sets

被引:0
作者
Chizi, B [1 ]
Shmilovici, A [1 ]
Maimon, O [1 ]
机构
[1] Tel Aviv Univ, Dept Ind Engn, IL-69978 Tel Aviv, Israel
来源
INTELLIGENT TECHNOLOGIES - THEORY AND APPLICATIONS: NEW TRENDS IN INTELLIGENT TECHNOLOGIES | 2002年 / 76卷
关键词
dimensionality reduction; data mining; logistic regression;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High dimensional databases are demanding in terms of the computational power required for their processing. Dimensionality reduction can effectively reduce the costs of various operations (e.g. classification), This research presents an explanation why dimensionality reduction is often possible with minimum information loss. Three kinds of greedy dimensionality reduction techniques are presented: Information Gain (Entropy), Polytomous Logistic Regression and random removal of attributes. An empirical comparison of the effect of the above methods on 10 benchmark data-sets revealed that a relatively simple logistic regression method provided mostly the best results.
引用
收藏
页码:233 / 238
页数:6
相关论文
共 50 条
  • [21] A sparse grid based method for generative dimensionality reduction of high-dimensional data
    Bohn, Bastian
    Garcke, Jochen
    Griebel, Michael
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 309 : 1 - 17
  • [22] Semi-supervised dimensionality reduction for analyzing high-dimensional data with constraints
    Yan, Su
    Bouaziz, Sofien
    Lee, Dongwon
    Barlow, Jesse
    [J]. NEUROCOMPUTING, 2012, 76 (01) : 114 - 124
  • [23] Linear regression for dimensionality reduction and classification of multi dimensional data
    Rangarajan, L
    Nagabhushan, P
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 193 - 199
  • [24] Robust locally nonlinear embedding (RLNE) for dimensionality reduction of high-dimensional data with noise
    Xu, Yichen
    Li, Eric
    [J]. NEUROCOMPUTING, 2024, 596
  • [25] Self-taught dimensionality reduction on the high-dimensional small-sized data
    Zhu, Xiaofeng
    Huang, Zi
    Yang, Yang
    Shen, Heng Tao
    Xu, Changsheng
    Luo, Jiebo
    [J]. PATTERN RECOGNITION, 2013, 46 (01) : 215 - 229
  • [26] Data-Efficient Dimensionality Reduction and Surrogate Modeling of High-Dimensional Stress Fields
    Samaddar, Anirban
    Ravi, Sandipp Krishnan
    Ramachandra, Nesar
    Luan, Lele
    Madireddy, Sandeep
    Bhaduri, Anindya
    Pandita, Piyush
    Sun, Changjie
    Wang, Liping
    [J]. JOURNAL OF MECHANICAL DESIGN, 2025, 147 (03)
  • [27] An Optimized Dimensionality Reduction Model for High-dimensional Data Based on Restricted Boltzmann Machines
    Zhang, Ke
    Liu, Jianhuan
    Chai, Yi
    Qian, Kun
    [J]. 2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 2963 - 2968
  • [28] Using synthetic data and dimensionality reduction in high-dimensional classification via logistic regression
    Zarei, Shaho
    Mohammadpour, Adel
    [J]. COMPUTATIONAL METHODS FOR DIFFERENTIAL EQUATIONS, 2019, 7 (04): : 626 - 634
  • [29] Storage and Retrieval of Large Data Sets: Dimensionality Reduction and Nearest Neighbour Search
    Chandrasekhar, A. Poorna
    Rani, T. Sobha
    [J]. CONTEMPORARY COMPUTING, 2012, 306 : 262 - 272
  • [30] Proposing a Dimensionality Reduction Technique With an Inequality for Unsupervised Learning from High-Dimensional Big Data
    Ismkhan, Hassan
    Izadi, Mohammad
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (06): : 3880 - 3889