An improved firefly heuristics for efficient feature selection and its application in big data

被引:0
作者
Selvi, Senthamil R. [1 ]
Valarmathi, M. L. [2 ]
机构
[1] MIET Coll, Tiruchirappalli, Tamil Nadu, India
[2] Govt Coll Technol, Coimbatore, Tamil Nadu, India
来源
BIOMEDICAL RESEARCH-INDIA | 2017年 / 28卷
关键词
Big data; Feature Selection; NP-hard; Firefly Algorithm (FA);
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Big Data is exceedingly useful for business applications and is fast rising as a domain of the IT industry. It has created considerable interest in several domains, which includes the manufacturing of health care machine, bank transaction, social media, and so on. Due to the diversity and size of datasets in Big Data, effective representation, access as well as analyses of unstructured as well as semi-structured data are still problematic. It is required to determine the way of searching space of all potential variable sub-sets as well as the assessment of prediction performance of learning machines for guiding searches and also which predictor to utilize. Extensive searches may be carried out if the quantity of parameters is not too much. However the issue is NP-Hard and search rapidly turns operationally intractable. Vast set of search schemes may be utilized, which include best-first, branch-and-bound, simulated annealing, genetic algorithm. In the current paper, a features selection method on the basis of Firefly Algorithm ( FA) is suggested to improve the big data analysis. FA meta-heuristic techniques modelled on the behaviour of the fireflies solve the optimization problems. The suggested technique was tested through a huge twitter data set and effectiveness of the proposed method was proven.
引用
收藏
页码:S236 / S241
页数:6
相关论文
共 50 条
[41]   Recent advances and emerging challenges of feature selection in the context of big data [J].
Bolon-Canedo, V. ;
Sanchez-Marono, N. ;
Alonso-Betanzos, A. .
KNOWLEDGE-BASED SYSTEMS, 2015, 86 :33-45
[42]   DQPFS: Distributed quadratic programming based feature selection for big data [J].
Soheili, Majid ;
Eftekhari-Moghadam, Amir Masoud .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 138 :1-14
[43]   Link based BPSO for feature selection in big data text clustering [J].
Kushwaha, Neetu ;
Pant, Millie .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 82 :190-199
[44]   Improved binary pigeon-inspired optimization and its application for feature selection [J].
Jeng-Shyang Pan ;
Ai-Qing Tian ;
Shu-Chuan Chu ;
Jun-Bao Li .
Applied Intelligence, 2021, 51 :8661-8679
[45]   Accelerated PSO Swarm Search Feature Selection for Data Stream Mining Big Data [J].
Fong, Simon ;
Wong, Raymond ;
Vasilakos, Athanasios V. .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (01) :33-45
[46]   An Improved PSO for Parameter Determination and Feature Selection of SVR and its Application in STLF [J].
Niu, Dong-Xiao ;
Guo, Ying-Chun .
JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2010, 16 (06) :567-584
[47]   Improved binary pigeon-inspired optimization and its application for feature selection [J].
Pan, Jeng-Shyang ;
Tian, Ai-Qing ;
Chu, Shu-Chuan ;
Li, Jun-Bao .
APPLIED INTELLIGENCE, 2021, 51 (12) :8661-8679
[48]   An online approach for feature selection for classification in big data [J].
Nazar, Nasrin Banu ;
Senthilkumar, Radha .
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (01) :163-171
[49]   Streaming feature selection algorithms for big data: A survey [J].
AlNuaimi, Noura ;
Masud, Mohammad Mehedy ;
Serhani, Mohamed Adel ;
Zaki, Nazar .
APPLIED COMPUTING AND INFORMATICS, 2022, 18 (1/2) :113-135
[50]   Scalable and Accurate Online Feature Selection for Big Data [J].
Yu, Kui ;
Wu, Xindong ;
Ding, Wei ;
Pei, Jian .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2016, 11 (02)