Neural network training method for materials science based on multi-source databases

被引:8
作者
Guo, Jialong [1 ,2 ]
Chen, Ziyi [1 ,2 ]
Liu, Zhiwei [1 ,2 ]
Li, Xianwei [3 ]
Xie, Zhiyuan [4 ]
Wang, Zongguo [1 ,2 ]
Wang, Yangang [1 ,2 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100083, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] China Petr Pipeline Engn Co Ltd, Langfang 065000, Hebei, Peoples R China
[4] Renmin Univ China, Dept Phys, Beijing 100872, Peoples R China
关键词
D O I
10.1038/s41598-022-19426-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The fourth paradigm of science has achieved great success in material discovery and it highlights the sharing and interoperability of data. However, most material data are scattered among various research institutions, and a big data transmission will consume significant bandwidth and tremendous time. At the meanwhile, some data owners prefer to protect the data and keep their initiative in the cooperation. This dilemma gradually leads to the "data island" problem, especially in material science. To attack the problem and make full use of the material data, we propose a new strategy of neural network training based on multi-source databases. In the whole training process, only model parameters are exchanged and no any external access or connection to the local databases. We demonstrate its validity by training a model characterizing material structure and its corresponding formation energy, based on two and four local databases, respectively. The results show that the obtained model accuracy trained by this method is almost the same to that obtained from a single database combining all the local ones. Moreover, different communication frequencies between the client and server are also studied to improve the model training efficiency, and an optimal frequency is recommended.
引用
收藏
页数:10
相关论文
共 27 条
  • [1] Benchmarking graph neural networks for materials chemistry
    Fung, Victor
    Zhang, Jiaxin
    Juarez, Eric
    Sumpter, Bobby G.
    [J]. NPJ COMPUTATIONAL MATERIALS, 2021, 7 (01)
  • [2] Gradient Shielding: Towards Understanding Vulnerability of Deep Neural Networks
    Gu, Zhaoquan
    Hu, Weixiong
    Zhang, Chuanjing
    Lu, Hui
    Yin, Lihua
    Wang, Le
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (02): : 921 - 932
  • [3] A multiple-kernel clustering based intrusion detection scheme for 5G and IoT networks
    Hu, Ning
    Tian, Zhihong
    Lu, Hui
    Du, Xiaojiang
    Guizani, Mohsen
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (11) : 3129 - 3144
  • [4] Patient clustering improves efficiency of federated machine learning to predict mortality and hospital stay time using distributed electronic medical records
    Huang, Li
    Shea, Andrew L.
    Qian, Huining
    Masurkar, Aditya
    Deng, Hao
    Liu, Dianbo
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 99
  • [5] Commentary: The Materials Project: A materials genome approach to accelerating materials innovation
    Jain, Anubhav
    Shyue Ping Ong
    Hautier, Geoffroy
    Chen, Wei
    Richards, William Davidson
    Dacek, Stephen
    Cholia, Shreyas
    Gunter, Dan
    Skinner, David
    Ceder, Gerbrand
    Persson, Kristin A.
    [J]. APL MATERIALS, 2013, 1 (01):
  • [6] On-the-fly closed-loop materials discovery via Bayesian active learning
    Kusne, A. Gilad
    Yu, Heshan
    Wu, Changming
    Zhang, Huairuo
    Hattrick-Simpers, Jason
    DeCost, Brian
    Sarker, Suchismita
    Oses, Corey
    Toher, Cormac
    Curtarolo, Stefano
    Davydov, Albert V.
    Agarwal, Ritesh
    Bendersky, Leonid A.
    Li, Mo
    Mehta, Apurva
    Takeuchi, Ichiro
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [7] A review of applications in federated learning
    Li, Li
    Fan, Yuxi
    Tse, Mike
    Lin, Kuo-Yi
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 149
  • [8] Liang T., 2020, J COMPUT APPL, V8, P1
  • [9] A universal model for accurately predicting the formation energy of inorganic compounds
    Liang, Yingzong
    Chen, Mingwei
    Wang, Yanan
    Jia, Huaxian
    Lu, Tenglong
    Xie, Fankai
    Cai, Guanghui
    Wang, Zongguo
    Meng, Sheng
    Liu, Miao
    [J]. SCIENCE CHINA-MATERIALS, 2023, 66 (01) : 343 - 351
  • [10] A Secure Federated Transfer Learning Framework
    Liu, Yang
    Kang, Yan
    Xing, Chaoping
    Chen, Tianjian
    Yang, Qiang
    [J]. IEEE INTELLIGENT SYSTEMS, 2020, 35 (04) : 70 - 82