Big data analytics with swarm intelligence

被引:35
作者
Cheng, Shi [1 ]
Zhang, Qingyu [2 ,3 ]
Qin, Quande [2 ,3 ,4 ]
机构
[1] Univ Nottingham, Div Comp Sci, Ningbo, Zhejiang, Peoples R China
[2] Shenzhen Univ, Dept Management Sci, Shenzhen, Peoples R China
[3] Res Inst Business Analyt & Supply Chain Managemen, Shenzhen, Peoples R China
[4] Beijing Inst Technol, Ctr Energy & Environm Policy Res, Beijing, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Evolutionary computation; Optimization; Data mining; Big data; Swarm intelligence; Big data analytics; PARTICLE SWARM; OBJECTIVE REDUCTION; OPTIMIZATION; CONVERGENCE; DIVERSITY; FRAMEWORK; COLONY;
D O I
10.1108/IMDS-06-2015-0222
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Purpose - The quality and quantity of data are vital for the effectiveness of problem solving. Nowadays, big data analytics, which require managing an immense amount of data rapidly, has attracted more and more attention. It is a new research area in the field of information processing techniques. It faces the big challenges and difficulties of a large amount of data, high dimensionality, and dynamical change of data. However, such issues might be addressed with the help from other research fields, e.g., swarm intelligence (SI), which is a collection of nature-inspired searching techniques. The paper aims to discuss these issues. Design/methodology/approach - In this paper, the potential application of SI in big data analytics is analyzed. The correspondence and association between big data analytics and SI techniques are discussed. As an example of the application of the SI algorithms in the big data processing, a commodity routing system in a port in China is introduced. Another example is the economic load dispatch problem in the planning of a modern power system. Findings - The characteristics of big data include volume, variety, velocity, veracity, and value. In the SI algorithms, these features can be, respectively, represented as large scale, high dimensions, dynamical, noise/surrogates, and fitness/objective problems, which have been effectively solved. Research limitations/implications - In current research, the example problem of the port is formulated but not solved yet given the ongoing nature of the project. The example could be understood as advanced IT or data processing technology, however, its underlying mechanism could be the SI algorithms. This paper is the first step in the research to utilize the SI algorithm to a big data analytics problem. The future research will compare the performance of the method and fit it in a dynamic real system. Originality/value - Based on the combination of SI and data mining techniques, the authors can have a better understanding of the big data analytics problems, and design more effective algorithms to solve real-world big data analytical problems.
引用
收藏
页码:646 / 666
页数:21
相关论文
共 71 条
[1]  
Abraham A, 2006, IEEE C EVOL COMPUTAT, P1769
[2]   Multiple cooperating swarms for data clustering [J].
Ahmadi, Abbas ;
Karray, Fakhri ;
Kamel, Mohamed .
2007 IEEE SWARM INTELLIGENCE SYMPOSIUM, 2007, :206-+
[3]   Big Data GUEST EDITORS' INTRODUCTION [J].
Alexander, Francis J. ;
Hoisie, Adolfy ;
Szalay, Alexander .
COMPUTING IN SCIENCE & ENGINEERING, 2011, 13 (06) :10-12
[4]  
[Anonymous], 2012, P 2012 IEEE C EV COM
[5]  
[Anonymous], 2014, SPRINGER P MATH STAT
[6]  
[Anonymous], 2006, STUDIES COMPUTATIONA
[7]  
[Anonymous], 2008, Proc. of 2008 IEEE Congress on Evolutionary Computation, DOI DOI 10.1109/CEC.2008.4631121
[8]  
[Anonymous], 2012, Mining of massive datasets
[9]  
Bellman R., 1961, Adaptive Control Processes: A Guided Tour, DOI DOI 10.1515/9781400874668
[10]   The balance between proximity and diversity in multiobjective evolutionary algorithms [J].
Bosman, PAN ;
Thierens, D .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2003, 7 (02) :174-188