Data heterogeneous federated learning algorithm for industrial entity extraction

被引:2
|
作者
Fu, Shengze [1 ]
Zhao, Xiaoli [1 ]
Yang, Chi [1 ]
Fang, Zhijun [2 ]
机构
[1] Shanghai Univ Engn Sci, Coll Elect & Elect Engn, Shanghai 201600, Peoples R China
[2] Donghua Univ, Sch Comp Sci & technol, Shanghai, Peoples R China
关键词
Entity extraction; Federated learning; Non-IID; Data quality performance; BLIND QUALITY ASSESSMENT;
D O I
10.1016/j.displa.2023.102504
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Entity extraction is an important part to realize digital transformation in the industrial field. Building an entity extraction model in the industrial field requires a lot of data. The parties in industry often cannot share data due to commercial competition and security and privacy issues, thus forming "Data Island". Federated learning provides a solution to this problem. Federated learning is a distributed machine learning framework that allows each party to train locally and independently using their own private data. The model parameters or gradient information of each party will be aggregated to the central server, thus forming a model jointly trained by all parties. This approach can not only protect the security and privacy of data from all parties, but also fully utilize their data resources. Federated learning can effectively solve the problem of data island, but it still faces some problems and challenges, among which the most typical problem is data heterogeneity. To address the data islanding problem and data heterogeneity problem faced by industrial entity extraction, this paper uses a federated learning framework to solve the data islanding problem and proposes the FedDP algorithm. This algorithm assigns weights based on the data quality performance of each participant. Participants with relatively good data quality performance have higher weights in the aggregation stage, while participants with relatively poor data quality performance have lower weights in the aggregation stage, thus optimizing the performance of federated learning in heterogeneous data scenarios.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Distributional Knowledge Transfer for Heterogeneous Federated Learning
    Wang, Luau
    Wang, Lijuan
    Shcn, Jun
    2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 747 - 754
  • [32] An effective Federated Learning system for Industrial IoT data streaming
    Wu, Yi
    Yang, Hongxu
    Wang, Xidong
    Yu, Hongjun
    El Saddik, Abdulmotaleb
    Hossain, M. Shamim
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 105 : 414 - 422
  • [33] Probabilistic Node Selection for Federated Learning with Heterogeneous Data in Mobile Edge
    Wu, Hongda
    Wang, Ping
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2453 - 2458
  • [34] Clustered Federated Learning Based on Momentum Gradient Descent for Heterogeneous Data
    Zhao, Xiaoyi
    Xie, Ping
    Xing, Ling
    Zhang, Gaoyuan
    Ma, Huahong
    ELECTRONICS, 2023, 12 (09)
  • [35] Evolutionary Multi-model Federated Learning on Malicious and Heterogeneous Data
    Shang, Chikai
    Gu, Fangqing
    Jiang, Jiaqi
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 386 - 395
  • [36] DFedSN: Decentralized federated learning based on heterogeneous data in social networks
    Yikuan Chen
    Li Liang
    Wei Gao
    World Wide Web, 2023, 26 : 2545 - 2568
  • [37] Dynamic Sample Selection for Federated Learning with Heterogeneous Data in Fog Computing
    Cai, Lingshuang
    Lin, Di
    Zhang, Jiale
    Yu, Shui
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [38] Stabilizing and Accelerating Federated Learning on Heterogeneous Data With Partial Client Participation
    Zhang, Hao
    Li, Chenglin
    Dai, Wenrui
    Zheng, Ziyang
    Zou, Junni
    Xiong, Hongkai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (01) : 67 - 83
  • [39] Federated Learning in Heterogeneous Data Settings for Virtual Assistants - A Case Study
    Pardela, Pawel
    Fajfer, Anna
    Gora, Mateusz
    Janicki, Artur
    TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 451 - 463
  • [40] Telemedicine data secure sharing scheme based on heterogeneous federated learning
    Wang, Nansen
    Zhang, Jianing
    Huang, Ju
    Ou, Wei
    Han, Wenbao
    Zhang, Qionglu
    CYBERSECURITY, 2024, 7 (01):