An improved firefly heuristics for efficient feature selection and its application in big data

被引:0
作者
Selvi, Senthamil R. [1 ]
Valarmathi, M. L. [2 ]
机构
[1] MIET Coll, Tiruchirappalli, Tamil Nadu, India
[2] Govt Coll Technol, Coimbatore, Tamil Nadu, India
来源
BIOMEDICAL RESEARCH-INDIA | 2017年 / 28卷
关键词
Big data; Feature Selection; NP-hard; Firefly Algorithm (FA);
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Big Data is exceedingly useful for business applications and is fast rising as a domain of the IT industry. It has created considerable interest in several domains, which includes the manufacturing of health care machine, bank transaction, social media, and so on. Due to the diversity and size of datasets in Big Data, effective representation, access as well as analyses of unstructured as well as semi-structured data are still problematic. It is required to determine the way of searching space of all potential variable sub-sets as well as the assessment of prediction performance of learning machines for guiding searches and also which predictor to utilize. Extensive searches may be carried out if the quantity of parameters is not too much. However the issue is NP-Hard and search rapidly turns operationally intractable. Vast set of search schemes may be utilized, which include best-first, branch-and-bound, simulated annealing, genetic algorithm. In the current paper, a features selection method on the basis of Firefly Algorithm ( FA) is suggested to improve the big data analysis. FA meta-heuristic techniques modelled on the behaviour of the fireflies solve the optimization problems. The suggested technique was tested through a huge twitter data set and effectiveness of the proposed method was proven.
引用
收藏
页码:S236 / S241
页数:6
相关论文
共 50 条
  • [21] New heuristics in feature selection for high dimensional data
    Ruiz, Roberto
    AI COMMUNICATIONS, 2007, 20 (02) : 129 - 131
  • [22] Towards Ultrahigh Dimensional Feature Selection for Big Data
    Tan, Mingkui
    Tsang, Ivor W.
    Wang, Li
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 1371 - 1429
  • [23] Enhancing Big Data Feature Selection Using a Hybrid Correlation-Based Feature Selection
    Mohamad, Masurah
    Selamat, Ali
    Krejcar, Ondrej
    Crespo, Ruben Gonzalez
    Herrera-Viedma, Enrique
    Fujita, Hamido
    ELECTRONICS, 2021, 10 (23)
  • [24] Improved multi-layer binary firefly algorithm for optimizing feature selection and classification of microarray data
    Xie, Weidong
    Wang, Linjie
    Yu, Kun
    Shi, Tengfei
    Li, Wei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [25] Investigating Random Undersampling and Feature Selection on Bioinformatics Big Data
    Hasanin, Tawfiq
    Khoshgoftaar, Taghi M.
    Leevy, Joffrey
    Seliya, Naeem
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2019), 2019, : 346 - 356
  • [26] A greedy feature selection algorithm for Big Data of high dimensionality
    Ioannis Tsamardinos
    Giorgos Borboudakis
    Pavlos Katsogridakis
    Polyvios Pratikakis
    Vassilis Christophides
    Machine Learning, 2019, 108 : 149 - 202
  • [27] Feature Selection and Classification of Big Data Using MapReduce Framework
    Devi, D. Renuka
    Sasikala, S.
    INTELLIGENT COMPUTING, INFORMATION AND CONTROL SYSTEMS, ICICCS 2019, 2020, 1039 : 666 - 673
  • [28] Feature Selection in Big Data using Filter Based Techniques
    Srinivas, Sumitra K.
    Kancharla, Gangadhara Rao
    2019 4TH MEC INTERNATIONAL CONFERENCE ON BIG DATA AND SMART CITY (ICBDSC), 2019, : 139 - 145
  • [29] Feature selection techniques in the context of big data: taxonomy and analysis
    Abdulwahab, Hudhaifa Mohammed
    Ajitha, S.
    Saif, Mufeed Ahmed Naji
    APPLIED INTELLIGENCE, 2022, 52 (12) : 13568 - 13613
  • [30] Feature selection based on an improved cat swarm optimization algorithm for big data classification
    Kuan-Cheng Lin
    Kai-Yuan Zhang
    Yi-Hung Huang
    Jason C. Hung
    Neil Yen
    The Journal of Supercomputing, 2016, 72 : 3210 - 3221