Interval extreme learning machine for big data based on uncertainty reduction

被引:8
作者
Li, Yingjie [1 ]
Wang, Ran [2 ,3 ]
Shiu, Simon C. K. [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab High Performance Data Min, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Extreme learning machine; interval; uncertainty reduction; big data; SELECTION;
D O I
10.3233/IFS-141520
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Choosing representative samples and removing data redundancy are two key issues in large-scale data classification. This paper proposes a new model, named interval extreme learning machine (ELM), for big data classification with continuous-valued attributes. The interval ELM model is built up based on two techniques, i.e., discretization of conditional attributes and fuzzification of class labels. First, inspired by the traditional decision tree (DT) induction algorithm, each conditional attribute is discretized into a number of intervals based on uncertainty reduction scheme. Then, the center and range of each interval are calculated as the mean and standard deviation of the values in it. Afterwards, the samples in the same intervals with regard to all the conditional attributes are merged as one record, and a fuzzification process is performed on the class labels. As a result, the original data set is transferred into a smaller one with fuzzy classes, and the interval ELM model is developed. Experimental comparisons demonstrate the feasibility and effectiveness of the proposed approach.
引用
收藏
页码:2391 / 2403
页数:13
相关论文
共 34 条
  • [1] [Anonymous], 1984, OLSHEN STONE CLASSIF, DOI 10.2307/2530946
  • [2] On optimization based extreme learning machine in primal for regression and classification by functional iterative method
    Balasundaram, S.
    Gupta, Deepak
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2016, 7 (05) : 707 - 728
  • [3] Billard L, 2000, ST CLASS DAT ANAL, P369
  • [4] Bradley P. S., 1998, INFORMS Journal on Computing, V10, P209, DOI 10.1287/ijoc.10.2.209
  • [5] Bradley P. S., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P82
  • [6] Handwritten character recognition using wavelet energy and extreme learning machine
    Chacko, Binu P.
    Krishnan, V. R. Vimal
    Raju, G.
    Anto, P. Babu
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2012, 3 (02) : 149 - 161
  • [7] A survey on feature selection methods
    Chandrashekar, Girish
    Sahin, Ferat
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) : 16 - 28
  • [8] Dougherty J., 1995, Machine Learning. Proceedings of the Twelfth International Conference on Machine Learning, P194
  • [9] Forman G., 2003, Journal of Machine Learning Research, V3, P1289, DOI 10.1162/153244303322753670
  • [10] Gioia F., 2005, STAT APPL, V17