Bag of little bootstraps on features for enhancing classification performance

被引:1
作者
Wang, Haocheng [1 ,2 ]
Zhuang, Fuzhen [1 ]
Jin, Xin [3 ]
Ao, Xiang [1 ]
He, Qing [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Huawei Technol Co Ltd, Cent Software Inst, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Ensemble learning; bag of little bootstraps on features; high-dimensional data; classification; EXTREME LEARNING-MACHINE;
D O I
10.3233/IDA-160857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble learning via manipulating the training set is an effective technique for improving classification accuracy. In this work, we investigate the strategy how to combine learning set resampling method and random subspace method applied in high-dimensional domains. We propose a new procedure, Bag of Little Bootstraps on Features (BLBF), which works by combining the results of bootstrapping multiple feature subsets of the original dataset using the random subspace method. Our empirical experiments on various high-dimensional datasets demonstrate that our proposed approach outperforms the state-of-the-art instance-based resampling learning algorithm BLB and its two relevant variants, in terms of classification performance. In addition, we also investigate the effect of hyperparameters on classification performance, which shows that the parameters can be easily set while maintaining a good performance.
引用
收藏
页码:1085 / 1099
页数:15
相关论文
共 31 条
  • [1] [Anonymous], 1990, Applied Linear Statistical Models: Regression, Analysis of Variance, and Experimental Designs
  • [2] [Anonymous], MODERN REGRESSION ME
  • [3] [Anonymous], 2008, Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM '08
  • [4] [Anonymous], 1996, J ECON LIT
  • [5] Bay S. D., 1999, Intelligent Data Analysis, V3, P191, DOI 10.1016/S1088-467X(99)00018-9
  • [6] Bickel PJ, 1997, STAT SINICA, V7, P1
  • [7] Bin Linghu, 2010, 2010 3rd International Symposium on Knowledge Acquisition and Modeling (KAM 2010), P80, DOI 10.1109/KAM.2010.5646323
  • [8] Bagging predictors
    Breiman, L
    [J]. MACHINE LEARNING, 1996, 24 (02) : 123 - 140
  • [9] Attribute bagging: improving accuracy of classifier ensembles by using random feature subsets
    Bryll, R
    Gutierrez-Osuna, R
    Quek, F
    [J]. PATTERN RECOGNITION, 2003, 36 (06) : 1291 - 1302
  • [10] Graph Regularized Nonnegative Matrix Factorization for Data Representation
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    Huang, Thomas S.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1548 - 1560