On dimensionality reduction of high dimensional data sets

被引:0
作者
Chizi, B [1 ]
Shmilovici, A [1 ]
Maimon, O [1 ]
机构
[1] Tel Aviv Univ, Dept Ind Engn, IL-69978 Tel Aviv, Israel
来源
INTELLIGENT TECHNOLOGIES - THEORY AND APPLICATIONS: NEW TRENDS IN INTELLIGENT TECHNOLOGIES | 2002年 / 76卷
关键词
dimensionality reduction; data mining; logistic regression;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High dimensional databases are demanding in terms of the computational power required for their processing. Dimensionality reduction can effectively reduce the costs of various operations (e.g. classification), This research presents an explanation why dimensionality reduction is often possible with minimum information loss. Three kinds of greedy dimensionality reduction techniques are presented: Information Gain (Entropy), Polytomous Logistic Regression and random removal of attributes. An empirical comparison of the effect of the above methods on 10 benchmark data-sets revealed that a relatively simple logistic regression method provided mostly the best results.
引用
收藏
页码:233 / 238
页数:6
相关论文
共 50 条
[21]   Comparing and Exploring High-Dimensional Data with Dimensionality Reduction Algorithms and Matrix Visualizations [J].
Cutura, Rene ;
Aupetit, Michael ;
Fekete, Jean-Daniel ;
Sedlmair, Michael .
PROCEEDINGS OF THE WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES AVI 2020, 2020,
[22]   A sparse grid based method for generative dimensionality reduction of high-dimensional data [J].
Bohn, Bastian ;
Garcke, Jochen ;
Griebel, Michael .
JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 309 :1-17
[23]   Semi-supervised dimensionality reduction for analyzing high-dimensional data with constraints [J].
Yan, Su ;
Bouaziz, Sofien ;
Lee, Dongwon ;
Barlow, Jesse .
NEUROCOMPUTING, 2012, 76 (01) :114-124
[24]   Linear regression for dimensionality reduction and classification of multi dimensional data [J].
Rangarajan, L ;
Nagabhushan, P .
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 :193-199
[25]   Robust locally nonlinear embedding (RLNE) for dimensionality reduction of high-dimensional data with noise [J].
Xu, Yichen ;
Li, Eric .
NEUROCOMPUTING, 2024, 596
[26]   Data-Efficient Dimensionality Reduction and Surrogate Modeling of High-Dimensional Stress Fields [J].
Samaddar, Anirban ;
Ravi, Sandipp Krishnan ;
Ramachandra, Nesar ;
Luan, Lele ;
Madireddy, Sandeep ;
Bhaduri, Anindya ;
Pandita, Piyush ;
Sun, Changjie ;
Wang, Liping .
JOURNAL OF MECHANICAL DESIGN, 2025, 147 (03)
[27]   Self-taught dimensionality reduction on the high-dimensional small-sized data [J].
Zhu, Xiaofeng ;
Huang, Zi ;
Yang, Yang ;
Shen, Heng Tao ;
Xu, Changsheng ;
Luo, Jiebo .
PATTERN RECOGNITION, 2013, 46 (01) :215-229
[28]   An Optimized Dimensionality Reduction Model for High-dimensional Data Based on Restricted Boltzmann Machines [J].
Zhang, Ke ;
Liu, Jianhuan ;
Chai, Yi ;
Qian, Kun .
2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, :2963-2968
[29]   Using synthetic data and dimensionality reduction in high-dimensional classification via logistic regression [J].
Zarei, Shaho ;
Mohammadpour, Adel .
COMPUTATIONAL METHODS FOR DIFFERENTIAL EQUATIONS, 2019, 7 (04) :626-634
[30]   Storage and Retrieval of Large Data Sets: Dimensionality Reduction and Nearest Neighbour Search [J].
Chandrasekhar, A. Poorna ;
Rani, T. Sobha .
CONTEMPORARY COMPUTING, 2012, 306 :262-272