Fed-mRMR: A lossless federated feature selection method

被引:3
|
作者
Hermo, Jorge [1 ]
Bolon-Canedo, Veronica [2 ]
Ladra, Susana [2 ]
机构
[1] Univ A Coruna, Dept Comp Sci, La Coruna 15071, Spain
[2] Univ A Coruna, CITIC, La Coruna 15071, Spain
关键词
Machine learning; Federated learning; Feature selection; Privacy preservation; Edge computing; Non-IID data; BIG DATA;
D O I
10.1016/j.ins.2024.120609
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection has become a mandatory task in data mining, due to the overwhelming amount of features in Big Data problems. To handle this high-dimensional data and avoid the wellknown curse of dimensionality, we need to pre-select an optimal subset of features to reduce redundant computations. Federated learning is a machine learning technique based on training an algorithm over many decentralized edge devices holding local rather than global data on a centralized server. Application of this technique is extending to fields such as self-driving cars, medicine and health, and Industry 4.0, where data privacy is compulsory. Feature selection through federated learning is a complicated task since suboptimal features calculated by feature selection methods may be different in heterogeneous datasets from different nodes. In this paper, we propose a lossless federated version of the classic minimum redundancy maximum relevance (mRMR) feature selection algorithm, called federated mRMR (fed-mRMR), which, without losing any effectiveness of the original mRMR method, is applicable to federated learning approaches and capable of dealing with data that are not independent and identically distributed (non-IID data). Implementation can be found at: https://github .com /jorgehermo9 /fed -mrmr
引用
收藏
页数:15
相关论文
共 50 条
  • [31] MSEs Credit Risk Assessment Model Based on Federated Learning and Feature Selection
    Xu, Zhanyang
    Cheng, Jianchun
    Cheng, Luofei
    Xu, Xiaolong
    Bilal, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 5573 - 5595
  • [32] Towards federated feature selection: Logarithmic division for resource-conscious methods
    Suarez-Marcote, Samuel
    Moran-Fernandez, Laura
    Bolon-Canedo, Veronica
    NEUROCOMPUTING, 2024, 596
  • [33] Designing a feature selection method based on explainable artificial intelligence
    Zacharias, Jan
    von Zahn, Moritz
    Chen, Johannes
    Hinz, Oliver
    ELECTRONIC MARKETS, 2022, 32 (04) : 2159 - 2184
  • [34] Correlation based feature selection method
    Michalak, K.
    Kwasnicka, H.
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2010, 2 (05) : 319 - 332
  • [35] Feature extraction technique based on Shapley value method and improved mRMR algorithm *
    Jiang, Degang
    Shi, Xiuyong
    Liang, Yunfang
    Liu, Hua
    MEASUREMENT, 2024, 237
  • [36] Improved aquila optimizer with mRMR for feature selection of high-dimensional gene expression data
    Qin, Xiwen
    Zhang, Siqi
    Dong, Xiaogang
    Shi, Hongyu
    Yuan, Liping
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (09): : 13005 - 13027
  • [37] Human-Centered Video Feature Selection via mRMR-SCMMCCA for Preference Extraction
    Ogawa, Takahiro
    Yamaguchi, Yoshiaki
    Asamizu, Satoshi
    Haseyama, Miki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (02): : 409 - 412
  • [38] mRMR-PSO: A Hybrid Feature Selection Technique with a Multiobjective Approach for Sign Language Recognition
    BansalnAff, Sandhya Rani
    Wadhawan, Savita
    Goel, Rajeev
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 10365 - 10380
  • [39] Nested ensemble selection: An effective hybrid feature selection method
    Kamalov, Firuz
    Sulieman, Hana
    Moussa, Sherif
    Reyes, Jorge Avante
    Safaraliev, Murodbek
    HELIYON, 2023, 9 (09)
  • [40] mRMR-PSO: A Hybrid Feature Selection Technique with a Multiobjective Approach for Sign Language Recognition
    Sandhya Rani Bansal
    Savita Wadhawan
    Rajeev Goel
    Arabian Journal for Science and Engineering, 2022, 47 : 10365 - 10380