MR-ELM: a MapReduce-based framework for large-scale ELM training in big data era

被引:0
作者
Jiaoyan Chen
Huajun Chen
Xiangyi Wan
Guozhou Zheng
机构
[1] Zhejiang University,College of Computer Science and Technology
来源
Neural Computing and Applications | 2016年 / 27卷
关键词
Extreme learning machine; Big data; MapReduce; Distributed;
D O I
暂无
中图分类号
学科分类号
摘要
In the big data era, extreme learning machine (ELM) can be a good solution for the learning of large sample data as it has high generalization performance and fast training speed. However, the emerging big and distributed data blocks may still challenge the method as they may cause large-scale training which is hard to be finished by a common commodity machine in a limited time. In this paper, we propose a MapReduce-based distributed framework named MR-ELM to enable large-scale ELM training. Under the framework, ELM submodels are trained parallelly with the distributed data blocks on the cluster and then combined as a complete single-hidden layer feedforward neural network. Both classification and regression capabilities of MR-ELM have been theoretically proven, and its generalization performance is shown to be as high as that of the original ELM and some common ELM ensemble methods through many typical benchmarks. Compared with the original ELM and the other parallel ELM algorithms, MR-ELM is a general and scalable ELM training framework for both classification and regression and is suitable for big data learning under the cloud environment where the data are usually distributed instead of being located in one machine.
引用
收藏
页码:101 / 110
页数:9
相关论文
共 40 条
[1]  
Lynch C(2008)Big data: how do your data grow Nature 455 28-29
[2]  
Huang G-B(2006)Universal approximation using incremental constructive feedforward networks with random hidden nodes Neural Netw IEEE Trans 17 879-892
[3]  
Chen L(2012)Extreme learning machine for regression and multiclass classification Syst Man Cybern Part B Cybern IEEE Trans 42 513-529
[4]  
Siew C-K(2011)Human face recognition based on multidimensional pca and extreme learning machine Pattern Recognit 44 2588-2597
[5]  
Huang G-B(2006)A fast and accurate online sequential learning algorithm for feedforward networks Neural Netw IEEE Trans 17 1411-1423
[6]  
Zhou H(2008)Enhanced random search based incremental extreme learning machine Neurocomputing 71 3460-3468
[7]  
Ding X(2008)A fast pruned-extreme learning machine for classification problem Neurocomputing 72 359-366
[8]  
Zhang R(2008)Mapreduce: simplified data processing on large clusters Commun ACM 51 107-113
[9]  
Mohammed A(2011)Gpu-accelerated and parallelized elm ensembles for large-scale regression Neurocomputing 74 2430-2437
[10]  
Minhas R(2011)An os-elm based distributed ensemble classification framework in p2p networks Neurocomputing 74 2438-2443