A Fast SVD-Hidden-nodes based Extreme Learning Machine for Large-Scale Data Analytics

被引:29
作者
Deng, Wan-Yu [1 ,2 ]
Bai, Zuo [3 ]
Huang, Guang-Bin [3 ]
Zheng, Qing-Hua [4 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Comp, Shaanxi, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[4] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, Xian, Peoples R China
基金
美国国家科学基金会;
关键词
Extreme Learning Machine; Singular value decomposition; Big data; Big dimensional data; Fast approximation method; DIMENSIONAL FEATURE-SELECTION; CLASSIFICATION; RECOGNITION; NETWORKS; SOLVER;
D O I
10.1016/j.neunet.2015.09.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big dimensional data is a growing trend that is emerging in many real world contexts, extending from web mining, gene expression analysis, protein-protein interaction to high-frequency financial data. Nowadays, there is a growing consensus that the increasing dimensionality poses impeding effects on the performances of classifiers, which is termed as the "peaking phenomenon'' in the field of machine intelligence. To address the issue, dimensionality reduction is commonly employed as a preprocessing step on the Big dimensional data before building the classifiers. In this paper, we propose an Extreme Learning Machine (ELM) approach for large-scale data analytic. In contrast to existing approaches, we embed hidden nodes that are designed using singular value decomposition (SVD) into the classical ELM. These SVD nodes in the hidden layer are shown to capture the underlying characteristics of the Big dimensional data well, exhibiting excellent generalization performances. The drawback of using SVD on the entire dataset, however, is the high computational complexity involved. To address this, a fast divide and conquer approximation scheme is introduced to maintain computational tractability on high volume data. The resultant algorithm proposed is labeled here as Fast Singular Value Decomposition-Hidden-nodes based Extreme Learning Machine or FSVD-H-ELM in short. In FSVD-H-ELM, instead of identifying the SVD hidden nodes directly from the entire dataset, SVD hidden nodes are derived from multiple random subsets of data sampled from the original dataset. Comprehensive experiments and comparisons are conducted to assess the FSVD-H-ELM against other state-of-the-art algorithms. The results obtained demonstrated the superior generalization performance and efficiency of the FSVD-H-ELM. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:14 / 28
页数:15
相关论文
共 90 条
[1]  
Achlioptas D., 2007, J ACM, V54, P611
[2]  
[Anonymous], 2001, Improving Multiclass Text Classification with the Support Vector Machine
[3]  
[Anonymous], 2012, Proceedings of the fifteenth International Conference on Artificial Intelligence and Statistics
[4]  
[Anonymous], 2004, P 21 INT C MACH LEAR, DOI DOI 10.1145/1015330.1015332
[5]  
[Anonymous], 2012, MATRIX COMPUTATIONS
[6]  
[Anonymous], 2010, Journal of Machine Learning Research, DOI DOI 10.5555/1756006.1859899
[7]  
[Anonymous], 2001, GMD Report 148
[8]  
[Anonymous], 2009, Advances in Neural Information Processing Systems
[9]  
[Anonymous], 2009, P 26 ANN INT C MACH, DOI DOI 10.1145/1553374.1553462
[10]   Sparse Extreme Learning Machine for Classification [J].
Bai, Zuo ;
Huang, Guang-Bin ;
Wang, Danwei ;
Wang, Han ;
Westover, M. Brandon .
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (10) :1858-1870