On dimensionality reduction of high dimensional data sets

被引:0
|
作者
Chizi, B [1 ]
Shmilovici, A [1 ]
Maimon, O [1 ]
机构
[1] Tel Aviv Univ, Dept Ind Engn, IL-69978 Tel Aviv, Israel
来源
INTELLIGENT TECHNOLOGIES - THEORY AND APPLICATIONS: NEW TRENDS IN INTELLIGENT TECHNOLOGIES | 2002年 / 76卷
关键词
dimensionality reduction; data mining; logistic regression;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High dimensional databases are demanding in terms of the computational power required for their processing. Dimensionality reduction can effectively reduce the costs of various operations (e.g. classification), This research presents an explanation why dimensionality reduction is often possible with minimum information loss. Three kinds of greedy dimensionality reduction techniques are presented: Information Gain (Entropy), Polytomous Logistic Regression and random removal of attributes. An empirical comparison of the effect of the above methods on 10 benchmark data-sets revealed that a relatively simple logistic regression method provided mostly the best results.
引用
收藏
页码:233 / 238
页数:6
相关论文
共 50 条
  • [1] Dimensionality Reduction for Registration of High-Dimensional Data Sets
    Xu, Min
    Chen, Hao
    Varshney, Pramod K.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (08) : 3041 - 3049
  • [2] Dimensionality Reduction Approaches and Evolving Challenges in High Dimensional Data
    Ullah, Adnan
    Qamar, Usman
    Khan, Farhan Hassan
    Bashir, Saba
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND MACHINE LEARNING (IML'17), 2017,
  • [3] Dimensionality reduction of clustered data sets
    Sanguinetti, Guido
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (03) : 535 - 540
  • [4] A Framework for Local Supervised Dimensionality Reduction of High Dimensional Data
    Aggarwal, Charu C.
    PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 360 - 371
  • [5] Utilizing differential characteristics of high dimensional data as a mechanism for dimensionality reduction
    Xing, Samuel S.
    Islam, Md Tauhidul
    PATTERN RECOGNITION LETTERS, 2022, 157 : 1 - 7
  • [6] Overview and comparative study of dimensionality reduction techniques for high dimensional data
    Ayesha, Shaeela
    Hanif, Muhammad Kashif
    Talib, Ramzan
    INFORMATION FUSION, 2020, 59 : 44 - 58
  • [7] New dimensionality reduction methods for the representation of high dimensional 'omics' data
    Becavin, Christophe
    Benecke, Arndt
    EXPERT REVIEW OF MOLECULAR DIAGNOSTICS, 2011, 11 (01) : 27 - 34
  • [8] Frequent item sets based dimensionality reduction algorithm in data mining research
    Bao Yong
    Lu Jia-yuan
    Wu Hui-zhong
    Proceedings of 2005 Chinese Control and Decision Conference, Vols 1 and 2, 2005, : 1433 - 1435
  • [9] Dependence maps, a dimensionality reduction with dependence distance for high-dimensional data
    Lee, Kichun
    Gray, Alexander
    Kim, Heeyoung
    DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 26 (03) : 512 - 532
  • [10] A Positive Region-based Dimensionality Reduction from High Dimensional data
    Dai Zhe
    Liu Jianhui
    2015 8TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI), 2015, : 624 - 628