Stateful MapReduce Framework for mRMR Feature Selection Using Horizontal Partitioning

被引:0
|
作者
Yelleti, Vivek [1 ]
Prasad, P. S. V. S. Sai [1 ]
机构
[1] Univ Hyderabad, Sch Comp & Informat Sci, Hyderabad 500046, Telangana, India
来源
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021 | 2024年 / 13102卷
关键词
Feature Selection; mRMR; Big data; Horizontal partitioning; MapReduce; Iterative MapReduce;
D O I
10.1007/978-3-031-12700-7_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection (FS) is an important pre-processing step in building machine learning models. minimum Redundancy and Maximum Relevance (mRMR) approach has emerged as one of the successful algorithms in obtaining irredundant feature subset involving only bivariate computations. In the current digital age, owing to the prevalence of very large scale datasets, an imminent need has arisen for scalable solutions using distributed/parallel algorithms. MapReduce solutions are proven to be one of the best approaches to design fault-tolerant and scalable solutions. This work analyses the existing Horizontal MapReduce approaches for mRMR feature selection and identifies the limitations thereof. It is observed that existing approaches involve redundant and repetitive computations and lacks a metadata framework to diminish them. This motivated us to propose Horizontal partitioning based MapReduce solutions namely HMR_mRMR, is an Iterative MapReduce algorithms and is designed under Apache Spark. Appropriate usage of metadata framework and solution formulation optimizes the computations in the proposed approaches. The comparative experimental study is conducted with existing approaches to establish the importance of HMR_mRMR.
引用
收藏
页码:317 / 327
页数:11
相关论文
共 50 条
  • [1] Feature Selection Methods in the Framework of mRMR
    Wang, Xiujuan
    Tao, Yuanrui
    Zheng, Kangfeng
    2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 1490 - 1495
  • [2] Feature Selection and Classification of Big Data Using MapReduce Framework
    Devi, D. Renuka
    Sasikala, S.
    INTELLIGENT COMPUTING, INFORMATION AND CONTROL SYSTEMS, ICICCS 2019, 2020, 1039 : 666 - 673
  • [3] Differentially private feature selection under MapReduce framework
    CHEN Kai
    WAN Wen-qiang
    LI Yun
    The Journal of China Universities of Posts and Telecommunications, 2013, (05) : 85 - 90
  • [4] Analog Filter Circuits Feature Selection Using MRMR and SVM
    Sun, Yongkui
    Ma, Lei
    Qin, Na
    Zhang, Meilan
    Lv, Qianyong
    2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 1543 - 1547
  • [5] Improved Measures of Redundancy and Relevance for mRMR Feature Selection
    Jo, Insik
    Lee, Sangbum
    Oh, Sejong
    COMPUTERS, 2019, 8 (02)
  • [6] MRMR Based Feature Selection for the Classification of Stress Using EEG
    Subhani, Ahmad Rauf
    Mumtaz, Wajid
    Kamil, Nidal
    Saad, Naufal M.
    Nandagopal, Nanda
    Malik, Aamir Saeed
    2017 ELEVENTH INTERNATIONAL CONFERENCE ON SENSING TECHNOLOGY (ICST), 2017, : 466 - 469
  • [7] Feature Selection by mRMR Method for Heart Disease Diagnosis
    Wang, Gaoshuai
    Lauri, Fabrice
    El Hassani, Amir Hajjam
    IEEE ACCESS, 2022, 10 : 100786 - 100796
  • [8] mRMR plus : An Effective Feature Selection Algorithm for Classification
    Chowdhury, Hussain A.
    Bhattacharyya, Dhruba K.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 424 - 430
  • [9] Fed-mRMR: A lossless federated feature selection method
    Hermo, Jorge
    Bolon-Canedo, Veronica
    Ladra, Susana
    INFORMATION SCIENCES, 2024, 669
  • [10] Detection of Thymoma Disease Using mRMR Feature Selection and Transformer Models
    Agar, Mehmet
    Aydin, Siyami
    Cakmak, Muharrem
    Koc, Mustafa
    Togacar, Mesut
    DIAGNOSTICS, 2024, 14 (19)