Bag of little bootstraps on features for enhancing classification performance

被引:1
作者
Wang, Haocheng [1 ,2 ]
Zhuang, Fuzhen [1 ]
Jin, Xin [3 ]
Ao, Xiang [1 ]
He, Qing [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Huawei Technol Co Ltd, Cent Software Inst, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Ensemble learning; bag of little bootstraps on features; high-dimensional data; classification; EXTREME LEARNING-MACHINE;
D O I
10.3233/IDA-160857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble learning via manipulating the training set is an effective technique for improving classification accuracy. In this work, we investigate the strategy how to combine learning set resampling method and random subspace method applied in high-dimensional domains. We propose a new procedure, Bag of Little Bootstraps on Features (BLBF), which works by combining the results of bootstrapping multiple feature subsets of the original dataset using the random subspace method. Our empirical experiments on various high-dimensional datasets demonstrate that our proposed approach outperforms the state-of-the-art instance-based resampling learning algorithm BLB and its two relevant variants, in terms of classification performance. In addition, we also investigate the effect of hyperparameters on classification performance, which shows that the parameters can be easily set while maintaining a good performance.
引用
收藏
页码:1085 / 1099
页数:15
相关论文
共 31 条
  • [11] Claesen M, 2014, J MACH LEARN RES, V15, P141
  • [12] NEURAL NETWORK ENSEMBLES
    HANSEN, LK
    SALAMON, P
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (10) : 993 - 1001
  • [13] Ho TK, 1998, IEEE T PATTERN ANAL, V20, P832, DOI 10.1109/34.709601
  • [14] Hosmer W., 2000, Applied Logistic Regression, VSecond
  • [15] Huang GB, 2004, IEEE IJCNN, P985
  • [16] Extreme learning machine: Theory and applications
    Huang, Guang-Bin
    Zhu, Qin-Yu
    Siew, Chee-Kheong
    [J]. NEUROCOMPUTING, 2006, 70 (1-3) : 489 - 501
  • [17] Extreme learning machines: a survey
    Huang, Guang-Bin
    Wang, Dian Hui
    Lan, Yuan
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2011, 2 (02) : 107 - 122
  • [18] Huixin Tian, 2010, 2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA), P1076, DOI 10.1109/BICTA.2010.5645111
  • [19] A scalable bootstrap for massive data
    Kleiner, Ariel
    Talwalkar, Ameet
    Sarkar, Purnamrita
    Jordan, Michael I.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2014, 76 (04) : 795 - 816
  • [20] Komarek P., 2004, Robotics Institute, P222